Project Role : Large Language Model Architect
Project Role Description : Architect large language models (LLM) that can process and generate natural language. Design neural network parameters, trained on large quantities of unlabeled text data.
Must have skills : Large Language Models (LLMs)
Good to have skills : NA
Minimum
3 year(s) of experience is required
Educational Qualification : 15 years full time education
Summary:
As a Large Language Model Architect, a typical day involves designing and structuring advanced language models capable of understanding and generating human-like text. This role requires careful planning of neural network configurations and managing extensive datasets to ensure the models perform effectively. The work includes continuous refinement of model architectures to enhance their ability to process natural language, collaborating with various teams to align model capabilities with project goals, and overseeing the integration of these models into broader applications. The position demands a thoughtful approach to innovation and problem-solving within the evolving landscape of language technologies.
Roles & Responsibilities:
- Expected to perform independently and become an SME.
- Required active participation/contribution in team discussions.
- Contribute in providing solutions to work related problems.
- Collaborate with cross-functional teams to align model development with business objectives.
- Continuously evaluate and improve model performance through experimentation and analysis.
- Document architectural decisions and maintain clear communication with stakeholders.
- Support junior team members by sharing knowledge and providing guidance.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in Large Language Models (LLMs).
- Experience in designing and tuning neural network architectures for natural language processing tasks.
- Strong understanding of language model training techniques using large-scale unlabeled datasets.
- Ability to optimize model parameters to improve accuracy and efficiency.
- Familiarity with state-of-the-art methods in natural language generation and understanding.
- Skilled in evaluating model outputs and implementing improvements based on feedback.
Additional Information:
- The candidate should have minimum 3 years of experience in Large Language Models (LLMs).
- This position is based at our Chennai office.
- A 15 years full time education is required.