Job Summary
This AI/ML Engineer role requires minimum 3 years of experience and focuses on developing advanced AI systems in vision, speech, generative AI, and edge deployment, particularly using NVIDIA technologies. The position involves building real-time pipelines for video analytics, voice agents, and LLM-based applications, with backend integration and MLOps. It suits candidates with strong Python skills and expertise in computer vision, speech processing, and model optimization for edge devices like NVIDIA Jetson.
Key Responsibilities
- AI/ML Model Development: Train models via TAO Toolkit, PyTorch/TensorFlow; optimize with TensorRT/ONNX; build OCR (EasyOCR) and generative models like Stable Diffusion.
- LLM, RAG & GenAI: Deploy Llama/Mistral via Ollama; implement RAG with FAISS/Chroma; perform SFT/LoRA tuning for conversational AI. Edge AI & NVIDIA Ecosystem: Optimize for Jetson using TensorRT, CUDA, TAO; integrate with Omniverse/Isaac Sim.
- Backend, Integrations & MLOps: Create FastAPI/Flask services; use MQTT/Kafka/WebSockets; deploy Dockerized microservices with logging/monitoring.
- Vision AI & Video Analytics: Develop pipelines with NVIDIA DeepStream, GStreamer, RTSP streams; implement YOLO, DeepSORT tracking, OpenCV processing, and face recognition analytics.
Required Skills
- Strong Python programming.
- Vision AI: YOLO, DeepStream, OpenCV, tracking.
- LLM/GenAI: RAG, SFT/LoRA, prompt engineering.
- NVIDIA stack: Jetson, TensorRT, TAO.
- Streaming: RTSP, WebRTC, GStreamer.
- Backend: FastAPI, MQTT/Kafka.
- Models: CNN, RNN, transformers.
Good-to-Have Skills
NVIDIA Riva for speech.
Cloud (AWS/GCP/Azure).
MLOps (MLflow, Kubeflow).
Embedded Linux (JetPack).
Audio DSP basics.
Education Requirements
Bachelor’s or Master’s in Computer Science, Electronics, AI/ML, or related field.
Job Types: Full-time, Permanent
Pay: ₹230,000.00 - ₹700,000.00 per year
Work Location: In person