In-depth understanding of various Gen AI and Agentic AI models and frameworks Experience in designing and engineering production grade Gen AI and Agentic solutions Experience with evaluation frameworks (human eval loops, LLM-as-judge, hallucination detection, toxicity testing) Experience with measuring and benchmarking AI solutions performance and accuracy
Conduct research on Gen AI and Agentic AI models and frameworks. Design and engineer production-grade Gen AI and Agentic AI solutions. Implement evaluation frameworks including human eval loops and LLM-as-judge. Develop methods for hallucination detection and toxicity testing in AI models. Measure and benchmark the performance and accuracy of AI solutions.