Position: Voice AI Engineer
Experience: 2–5 Years (Relevant Experience Preferred)
Location: Remote
Employment Type: Full-Time
About the Role
We are seeking a skilled and passionate Voice AI Engineer with hands-on experience in building, deploying, and maintaining real-time voice AI solutions using a self-hosted LiveKit environment. The ideal candidate should have strong expertise in voice communication systems, speech technologies, and cloud-native deployments, along with the ability to troubleshoot complex audio and media pipeline issues.
Key Responsibilities
- Design, deploy, and maintain voice AI applications using self-hosted LiveKit infrastructure.
- Configure, optimize, and manage LiveKit servers for production-grade environments.
- Integrate and manage Text-to-Speech (TTS) and Speech-to-Text (STT) providers.
- Develop and enhance real-time voice agents and conversational AI solutions.
- Implement and optimize real-time audio streaming pipelines with low latency.
- Monitor, troubleshoot, and resolve WebRTC, media routing, and audio quality issues.
- Deploy and manage applications using Docker containers and cloud platforms.
- Collaborate with AI, backend, and product teams to build scalable voice-enabled solutions.
- Ensure system reliability, security, performance, and scalability.
- Maintain technical documentation and deployment procedures.
Required Skills & Qualifications
- Strong hands-on experience with LiveKit (Self-Hosted) setup, deployment, and maintenance.
- Experience integrating Text-to-Speech (TTS) and Speech-to-Text (STT) services.
- Solid understanding of voice agent frameworks and conversational AI architectures.
- Experience working with real-time audio streaming technologies.
- Strong knowledge of WebRTC, RTP, media servers, and audio processing concepts.
- Proficiency with Docker, containerized deployments, and Linux environments.
- Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP).
- Strong troubleshooting and debugging skills for media pipeline and connectivity issues.
- Understanding of networking concepts including NAT, TURN/STUN, and firewalls.
- Good problem-solving skills and ability to work independently.
Preferred Qualifications
- Experience integrating Large Language Models (LLMs) such as OpenAI, Anthropic, or open-source models.
- Familiarity with AI agent frameworks and orchestration tools.
- Experience with Python, Node.js, or Golang.
- Knowledge of monitoring and observability tools such as Prometheus, Grafana, or ELK Stack.
- Experience building production-grade conversational AI systems.
What We Offer
- Opportunity to work on cutting-edge Voice AI and Conversational AI products.
- Exposure to real-time communication technologies and AI-driven applications.
- Flexible work environment and collaborative team culture.
- Career growth opportunities in the rapidly evolving AI ecosystem.
Pay: ₹30,000.00 - ₹40,000.00 per month
Application Question(s):
- What is your current monthly salary, expected monthly salary and notice period?
- Do you have experience in LiveKit (self-hosted) setup and deployment?
Experience:
- LiveKit (Self-Hosted) setup, deployment: 3 years (Required)
- Text-to-Speech (TTS) and Speech-to-Text (STT): 2 years (Required)
- AWS, Azure, or Google Cloud Platform (GCP).: 2 years (Required)
- NAT, TURN/STUN, and firewalls: 1 year (Required)
- WebRTC and media pipeline issues: 1 year (Required)
- LLM integrations: 1 year (Required)
Work Location: Remote