Job Title: Senior Software Engineer – Cloud & Edge Inference (NVIDIA Jetson)
Location: Bangalore [Remote/Hybrid]
Salary: As per Industry Standards
Education: Bachelor’s/Master’s degree in Computer Science, Engineering, or related fields.
Experience: 10+ Years in Software Engineering
Job Summary:
As a Senior Software Engineer – Cloud & Edge Inference (NVIDIA Jetson), you will design and implement a unified Docker-based deployment pipeline for cloud and edge environments. You will optimize real-time inference services for GenAI models (LLMs and beyond) running on NVIDIA Jetson devices, while collaborating with AI/ML teams to enable scalable hybrid deployments. This role combines cloud orchestration expertise with edge performance optimization, making it critical to our AI-driven solutions strategy.
Key Responsibilities:
- Design & implement a Docker-based deployment pipeline for seamless cloud and edge integration.
- Optimize and adapt Python/FastAPI inference services for real-time GenAI performance on edge devices.
- Build & maintain Kubernetes deployments for hybrid workloads (cloud + NVIDIA Jetson).
- Collaborate with AI/ML teams to integrate and deploy inference models to edge environments.
- Troubleshoot, optimize, and stabilize inference workloads under constrained edge hardware conditions.
Skills & Qualifications:
- 10+ years of professional software engineering experience.
- Strong proficiency in Python (FastAPI experience preferred).
- Proven expertise with Kubernetes and container orchestration at scale.
- Hands-on experience with real-time AI inference on embedded GPUs (NVIDIA Jetson or similar).
- Solid knowledge of performance optimization and resource constraints in edge environments.
- Strong problem-solving ability and collaborative mindset across AI/ML and infrastructure teams.
Preferred Qualifications:
- Experience with GPU acceleration frameworks (CUDA, TensorRT).
- Familiarity with CI/CD pipelines for containerized deployments.
- Knowledge of network optimization for cloud-to-edge communication.
- Background in distributed systems and scalable architectures.
What We Offer:
- Opportunity to work on cutting-edge AI inference solutions across cloud and edge.
- Exposure to hybrid cloud architectures with global clients.
- A collaborative and innovation-driven work culture.
- Competitive salary packages aligned with industry standards.
- Continuous learning opportunities in AI, edge computing, and advanced cloud technologies.
Job Application Details:
Candidates fulfilling the above requirements may email their resume to [email protected], walk-in or you can submit your resume online