Tech Lead – Backend & AI Systems
Location: On-site, Full-time
Experience: 3–8 years
Education Requirement: Graduate from 2021 batch or earlier
Role Type: Backend Engineering / AI Systems / Technical Ownership
Focus Areas: Python, FastAPI, AI Systems, LLMs, Real-Time APIs, Cloud Infrastructure
About the Role
We are hiring a Tech Lead – Backend & AI Systems to design, build, deploy, and manage production-ready backend and AI systems.
This role is for someone who has strong backend engineering experience and has worked on real systems, not only basic APIs or demos. The candidate should be comfortable building scalable backend services, integrating AI/ML models, working with LLMs, embeddings, inference APIs, databases, cloud infrastructure, and real-time systems.
The original JD focuses strongly on FastAPI, production AI systems, real-time systems, deployment, monitoring, scaling, and ownership, so this title fits the role properly.
Key Responsibilities
The candidate will be responsible for:
- Designing and developing high-performance backend systems using Python and FastAPI
- Building scalable APIs for AI-powered applications
- Developing and deploying services involving LLMs, embeddings, vector search, and inference pipelines
- Creating real-time backend systems with low-latency APIs, async processing, queues, and streaming workflows
- Owning the full backend lifecycle:
- architecture → development → deployment → monitoring → scaling
- Optimizing applications for speed, reliability, throughput, cost, and scalability
- Integrating AI models, databases, vector databases, APIs, cloud services, and internal tools
- Building robust production infrastructure for AI-driven systems
- Debugging production issues and improving system stability
- Designing systems that can handle failures and recover smoothly
- Working with frontend, AI/ML, product, and leadership teams to convert business requirements into working systems
- Ensuring backend systems are secure, scalable, maintainable, and production-readyMust-Have Skills
The candidate should have hands-on experience in:
- Strong backend development using Python
- FastAPI experience is mandatory
- REST API design and backend architecture
- Async programming and scalable API development
- Distributed systems and event-driven architecture
- AI/ML system integration in production or near-production environments
- LLM integration, embeddings, inference APIs, and model-serving workflows
- Vector databases such as FAISS, Pinecone, Weaviate, Milvus, ChromaDB, or similar
- Databases such as PostgreSQL, MySQL, MongoDB, or other NoSQL databases
- Caching and messaging systems such as Redis, Kafka, RabbitMQ, Celery, or queues
- Cloud platforms such as AWS, Azure, or GCP
- Docker and containerized deployments
- CI/CD pipelines and deployment workflows
- Monitoring, logging, debugging, and handling production issues
Good to Have Skills
These are added advantages but not mandatory:
- Kubernetes experience
- Node.js or other backend frameworks
- WebSockets and real-time communication systems
- Speech, video, or streaming system experience
- GPU optimization or model-serving infrastructure
- Experience with tools such as LangChain, LlamaIndex, vLLM, Triton, TensorRT, or similar
- Frontend awareness with React, Angular, or Next.js
- DevOps, security, IAM, and infrastructure automation exposure
- Experience in high-performance or low-latency systems
Eligibility Criteria
- Candidate must be a graduate from 2021 batch or earlier
- Candidate must have 3–8 years of relevant experience
- Candidate must be available for an on-site, full-time role
- Candidate should have strong hands-on backend development experience
- Candidate must have practical experience with FastAPI
- Candidate should have worked on production systems, scalable platforms, or AI/ML-integrated applications
What We Expect
We are looking for someone who can take ownership of backend and AI systems, not just write assigned code.
The candidate should be able to:
- Take responsibility for systems working properly in production
- Solve problems instead of passing them around
- Design systems that can handle failure, recovery, and scale
- Think about latency, performance, monitoring, security, and reliability
- Work in a fast-moving environment
- Understand business requirements and convert them into strong technical execution
- Take initiative and work independently when required
- Build systems that are clean, maintainable, and ready for real usage
Non-Negotiables
- Graduate from 2021 batch or earlier
- On-site role
- 3–8 years of relevant experience
- Strong Python backend experience
- FastAPI experience is mandatory
- Practical experience with AI/ML or LLM-based system integration
- Strong debugging and problem-solving ability
- Ownership mindset
- Ability to work in ambiguity and move fast
- Experience building systems beyond simple CRUD APIs
Why This Role Matters
We are building real-time AI-driven systems where speed, accuracy, reliability, correctness, and security matter.
The person in this role will help build the backend and AI infrastructure behind intelligent agents, AI dashboards, LLM workflows, enterprise AI tools, inference services, vector search systems, and real-time applications.
This is an important role for someone who wants to work on serious AI systems that are actually deployed, monitored, scaled, and used in real environments.
Compensation
Competitive salary plus performance-linked incentives, based on experience, technical ability, and demonstrated impact.
Final Note
This role is not for someone who has only built basic APIs or small demos.
We are looking for someone who has worked on backend systems that run properly, scale, handle failures, recover smoothly, and support real AI applications.
Pay: ₹946,359.21 - ₹2,235,582.77 per year
Work Location: In person