Voice AI Engineer (LiveKit)

UMENIT SOLUTIONS LLP
Remote

Quick apply

Job details

Full-time
₹30,000 - ₹40,000 a month
1 day ago

Qualifications

Azure
Go
Node.js
Law
Software deployment
Google Cloud Platform
Master's degree
AWS
Docker
Linux
AI
Communication skills
Python
Debugging

Full job description

Position: Voice AI Engineer
Experience: 2–5 Years (Relevant Experience Preferred)
Location: Remote
Employment Type: Full-Time

About the Role

We are seeking a skilled and passionate Voice AI Engineer with hands-on experience in building, deploying, and maintaining real-time voice AI solutions using a self-hosted LiveKit environment. The ideal candidate should have strong expertise in voice communication systems, speech technologies, and cloud-native deployments, along with the ability to troubleshoot complex audio and media pipeline issues.

Key Responsibilities

Design, deploy, and maintain voice AI applications using self-hosted LiveKit infrastructure.
Configure, optimize, and manage LiveKit servers for production-grade environments.
Integrate and manage Text-to-Speech (TTS) and Speech-to-Text (STT) providers.
Develop and enhance real-time voice agents and conversational AI solutions.
Implement and optimize real-time audio streaming pipelines with low latency.
Monitor, troubleshoot, and resolve WebRTC, media routing, and audio quality issues.
Deploy and manage applications using Docker containers and cloud platforms.
Collaborate with AI, backend, and product teams to build scalable voice-enabled solutions.
Ensure system reliability, security, performance, and scalability.
Maintain technical documentation and deployment procedures.

Required Skills & Qualifications

Strong hands-on experience with LiveKit (Self-Hosted) setup, deployment, and maintenance.
Experience integrating Text-to-Speech (TTS) and Speech-to-Text (STT) services.
Solid understanding of voice agent frameworks and conversational AI architectures.
Experience working with real-time audio streaming technologies.
Strong knowledge of WebRTC, RTP, media servers, and audio processing concepts.
Proficiency with Docker, containerized deployments, and Linux environments.
Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform (GCP).
Strong troubleshooting and debugging skills for media pipeline and connectivity issues.
Understanding of networking concepts including NAT, TURN/STUN, and firewalls.
Good problem-solving skills and ability to work independently.

Preferred Qualifications

Experience integrating Large Language Models (LLMs) such as OpenAI, Anthropic, or open-source models.
Familiarity with AI agent frameworks and orchestration tools.
Experience with Python, Node.js, or Golang.
Knowledge of monitoring and observability tools such as Prometheus, Grafana, or ELK Stack.
Experience building production-grade conversational AI systems.

What We Offer

Opportunity to work on cutting-edge Voice AI and Conversational AI products.
Exposure to real-time communication technologies and AI-driven applications.
Flexible work environment and collaborative team culture.
Career growth opportunities in the rapidly evolving AI ecosystem.

Pay: ₹30,000.00 - ₹40,000.00 per month

Application Question(s):

What is your current monthly salary, expected monthly salary and notice period?
Do you have experience in LiveKit (self-hosted) setup and deployment?

Experience:

LiveKit (Self-Hosted) setup, deployment: 3 years (Required)
Text-to-Speech (TTS) and Speech-to-Text (STT): 2 years (Required)
AWS, Azure, or Google Cloud Platform (GCP).: 2 years (Required)
NAT, TURN/STUN, and firewalls: 1 year (Required)
WebRTC and media pipeline issues: 1 year (Required)
LLM integrations: 1 year (Required)

Work Location: Remote

Quick apply

Jobseeker tools

Employer Tools

Browse

Stay Connected