We are seekinga strong Data Scientist to support a strategic data transformation initiative focused on building scalable, AI-enabled data quality and data management capabilities.
The role combines Generative AI, traditional machine learning, and data engineering to develop intelligent solutions for metadata generation, DQ rule recommendation, anomaly detection, entity resolution, profiling automation, and data quality monitoring.
The ideal candidateshould be hands-on,technically strong, and able to translate advanced AI/ML methods into practical, production-ready solutions within an enterprise environment.
-
Develop AI-powered solutions usingLLMs for metadataenrichment, semantic classification, summarization, and data quality automation
-
Build and optimize RAG pipelines groundedin enterprise metadata,profiling outputs, rule libraries, and technical documentation
-
Develop ML models for anomaly detection, entity resolution, clustering, predictive analytics, and pattern recognition
-
Build scalable Python and SQL pipelinesto automate profiling, data onboarding, quality monitoring, and AI-assisted recommendations
-
Design Human-in-the-Loop workflows to ensure AI outputs are validated, auditable, and aligned with business requirements
-
Collaborate with business, governance, and technical teams to operationalize AI/ML solutions in enterprise environments
-
Strong hands-on experience with LLMs, promptengineering, and production-grade RAG architectures
-
Strong experience in Python,SQL, and ML frameworks such as Scikit-learn, LangChain, LlamaIndex, or equivalent
-
Experience with supervised and unsupervised ML techniques, including XGBoost, Random Forest, clustering, PCA, Isolation Forest, and anomaly detection
-
Good understanding of metadatamanagement, data qualitydimensions, data governance, and enterprise data platforms
-
Ability to evaluate modelperformance, improve outputquality, and explaintechnical results to non-technical stakeholders
-
Proven experience as a Data Scientistdelivering AI/ML solutionsin enterprise or large- scale data environments
-
Strong analytical, problem-solving, and communication skills
-
Understanding of responsible AI, data security, privacy, and governance practices
-
Bachelor’s or Master’s degreein Computer Science,Data Science, Engineering, Statistics, Mathematics, or a related field