Required Technical Skillset:-
1. Programming: Python (strong), SQL
2. ML Frameworks: Scikit learn, TensorFlow / PyTorch
3. GenAI / LLMs: OpenAI / Azure OpenAI, LangChain, vector databases
4. Data Engineering: ETL pipelines, data modeling, data validation
5. Cloud: Azure / AWS / GCP (at least one)
Must Have Skills:-
1. Programming: Python (strong), SQL
2. ML Frameworks: Scikit learn, TensorFlow / PyTorch
3. GenAI / LLMs: OpenAI / Azure OpenAI, LangChain, vector databases
4. Data Engineering: ETL pipelines, data modeling, data validation
5. Cloud: Azure / AWS / GCP (at least one)
6. Design and maintain ETL / ELT pipelines for large-scale data ingestion
7. Work with structured and semi-structured data (CSV, JSON, Parquet)
8. Ensure data quality, lineage, and reliability
9. Optimize pipelines for performance and scalability
10. Deploy ML/AI models using APIs or batch pipelines
11. Implement CI/CD for ML workflows
12. Monitor model performance, drift, and data issues