Job Description:
- Large Language Models (LLM):
Experience with LangChain, LangGraph
Proficiency in building agentic patterns like ReAct, ReWoo, LLMCompiler
- Multi-modal Retrieval-Augmented Generation (RAG):
Expertise in multi-modal AI systems (text, images, audio, video)
Designing and optimizing chunking strategies and clustering for large data processing
- Streaming & Real-time Processing:
Experience in audio/video streaming and real-time data pipelines
Low-latency inference and deployment architectures
Natural language-driven SQL generation for databases
Experience with natural language interfaces to databases and query optimization
Building scalable APIs with FastAPI for AI model serving
- Containerization & Orchestration:
Proficient with Docker for containerized AI services
Experience with orchestration tools for deploying and managing services