Job Summary:
We are seeking an experienced and motivated Generative AI Developer with 2+ years of experience in developing and deploying AI-powered applications. The ideal candidate should have hands-on experience integrating Large Language Models (LLMs), AI APIs, and AI tools into business applications, as well as managing and deploying local LLM infrastructure using Ollama.
Key Responsibilities:
- Design, develop, and deploy Generative AI solutions using modern LLM technologies.
- Integrate AI models and tools into web, mobile, and enterprise applications.
- Build and maintain AI workflows using APIs and agent frameworks.
- Deploy, configure, and manage local LLMs using Ollama.
- Optimize model performance, inference speed, and resource utilization.
- Develop Retrieval-Augmented Generation (RAG) solutions using vector databases.
- Work with prompt engineering, model fine-tuning, and evaluation.
- Collaborate with product, engineering, and business teams to deliver AI-driven features.
- Monitor AI systems for reliability, scalability, and security.
Required Skills:
Technical Skills
- 2+ years of software development experience.
- Strong proficiency in Python.
Experience with LLM APIs such as:
- OpenAI Platform
- Anthropic Claude API
- Google AI Studio
Hands-on experience with:
- Ollama
- LangChain
- LlamaIndex
- Knowledge of RAG architectures and vector databases.
- Experience with databases such as PostgreSQL and MongoDB.
- Familiarity with REST APIs and microservices architecture.
- Understanding of Docker and containerized deployments.
Educational Requirements:
- Bachelor's Degree in Computer Science, Information Technology, Engineering, or a related field.