Ph.D. degree, Master's degree, or equivalent experience in computer science, artificial intelligence, machine learning, operations research, statistics, or a related technical field.
5+ years with a Master's degree or 3+ years with a Ph.D. applying machine learning to real-world problems.
Strong Python programming skills and experience building production-quality ML, GenAI, or data systems.
Hands-on experience with PyTorch and modern deep learning stacks; experience with Hugging Face, LLMs, VLMs, diffusion models, or multimodal models is strongly preferred.
Experience with data-centric AI or GenAI methods such as synthetic data generation, data quality measurement, dataset curation, weak supervision, model-based labeling, active learning, deduplication, or data augmentation.
Experience designing experiments and interpreting results through statistical analysis, ablation studies, benchmark evaluation, and error analysis.
Strong understanding of model training, inference, evaluation, and production monitoring.
Ability to read research papers, identify practical value, and implement useful techniques in real systems.
Strong written and verbal communication skills, including technical proposals, design documents, experiment reports, and stakeholder presentations.
Experience building scalable data or ML pipelines using distributed compute, cloud storage, batch processing, or workflow orchestration.