We are an AI-led, platform-driven Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate what?s next. Our offerings and proven solutions create a unique competitive advantage for our clients by giving them the power to see beyond and rise above. We work with many industry-leading organizations across the world, including 20 Fortune 50 companies and 4 of the 5 top banks in both the US and India, and numerous innovators across the healthcare ecosystem.
Our disruptor?s mindset, commitment to client success, and agility to thrive in the dynamic environment have enabled us to sustain our growth momentum. Persistent has been recognized across top industry platforms for innovation, leadership, and inclusion. We reported $1,654.4M FY26 revenue with 17.4% Y-o-Y growth. We have delivered 24 sequential quarters of growth with $436.0M in Q4 FY26 revenue, up 3.2% Q-o-Q and 16.2% Y-o-Y growth. Our 27,500+ global team members, located in 18 countries, have been instrumental in helping the market leaders transform their industries. We have been recognized as the Fastest Growing IT Services Brand Globally in the 2026 Brand Finance IT Services 25 Report. We named a Leader in the Everest Group Private Equity (PE) Services PEAK Matrix? Assessment 2026 and Software Product Engineering PEAK Matrix? Assessment 2026.
About Position:
We are seeking an experienced and highly skilled Data Scientist specialising in Generative AI and Synthetic Data Generation to design, develop, and deploy advanced data-driven solutions. This role focuses on building and optimising generative models capable of producing high-quality synthetic data that closely mimics real-world datasets across domains such as text, images, and structured data.
The ideal candidate will play a critical role in leveraging machine learning and statistical techniques to enhance model performance, scalability, and reliability. You will collaborate closely with AI researchers, engineers, and domain experts to drive innovation in generative AI systems, particularly in data-constrained or regulated environments.
- Role: Data Scientist ? Synthetic Data
- Location: All Persistent Locations
- Experience: 8 to 12 years
- Job Type: Full Time Employment
What You'll Do:
- Synthetic Data Generation (Must-Have):
Design and develop machine learning models for synthetic data generation using techniques such as GANs, VAEs, diffusion models, and other deep generative approaches. Ensure generated data maintains statistical fidelity, diversity, and privacy compliance.
- Data Collection & Preprocessing:
Identify, acquire, and curate relevant datasets. Perform data cleansing, transformation, and structuring to ensure high-quality inputs for training generative models.
- Model Development & Training:
Build end-to-end data pipelines and workflows for training generative AI models using state-of-the-art architectures including GANs, Variational Autoencoders, Normalizing Flows, and Diffusion Networks.
- Model Optimisation:
Perform hyperparameter tuning and experiment with architectures to improve model accuracy, stability, and output quality.
- Data Augmentation:
Implement advanced data augmentation strategies to enhance dataset size, diversity, and model generalisation.
- Performance Evaluation:
Define, track, and improve model evaluation metrics to ensure objective assessment of generative model performance and synthetic data quality.
- Bias Detection & Mitigation:
Analyse datasets and model outputs to identify biases and implement techniques to ensure fairness, robustness, and ethical AI practices.
- Transfer Learning & Adaptation:
Apply transfer learning approaches to fine-tune pre-trained models for new domains and specific use cases.
- Collaborative Development:
Work cross-functionally with AI researchers, software engineers, product teams, and domain SMEs to integrate generative AI and synthetic data solutions into production systems.
- Documentation & Knowledge Sharing:
Maintain clear and comprehensive documentation of methodologies, experiments, and findings to ensure reproducibility and knowledge dissemination.
Expertise You'll Bring:
- Master?s or Ph.D. in Computer Science, Data Science, Machine Learning, or a related discipline with a focus on AI or generative modelling.
- Mandatory experience in synthetic data generation using machine learning or deep learning models.
- Strong proficiency in Python and leading AI frameworks such as TensorFlow or PyTorch.
- In-depth understanding of generative modelling techniques including GANs, VAEs, Diffusion Models, and Normalizing Flows.
- 10+ years of experience in data science, including large-scale data processing, feature engineering, and model training.
- Strong foundation in statistics, probability, and quantitative analysis.
- Preferred Skills
- Experience working with complex datasets such as biomedical, clinical, genomic, imaging, or omics data.
- Familiarity with Natural Language Processing (NLP) and Computer Vision techniques in generative AI contexts.
- Hands-on experience with privacy-preserving techniques (e.g., differential privacy, synthetic data validation).
- Knowledge of Responsible AI principles, including fairness, transparency, explainability, and data privacy.
- Experience working in regulated domains such as healthcare, life sciences, or finance.
- Core Competencies
- Strong problem-solving and analytical thinking capabilities.
- Ability to work independently as well as in cross-functional teams.
- Excellent communication and stakeholder management skills, with the ability to explain complex concepts to both technical and non-technical audiences.
Benefits:
- Competitive salary and benefits package
- Culture focused on talent development with quarterly growth opportunities and company-sponsored higher education and certifications
- Opportunity to work with cutting-edge technologies
- Employee engagement initiatives such as project parties, flexible work hours, and Long Service awards
- Annual health check-ups
- Insurance coverage: group term life, personal accident, and Mediclaim hospitalization for self, spouse, two children, and parents
Values-Driven, People-Centric & Inclusive Work Environment:
Persistent is dedicated to fostering diversity and inclusion in the workplace. We invite applications from all qualified individuals, including those with disabilities, and regardless of gender or gender preference. We welcome diverse candidates from all backgrounds.
- We support hybrid work and flexible hours to fit diverse lifestyles.
- Our office is accessibility-friendly, with ergonomic setups and assistive technologies to support employees with physical disabilities.
- If you are a person with disabilities and have specific requirements, please inform us during the application process or at any time during your employment
Let?s unleash your full potential at Persistent - persistent.com/careers
?Persistent is an Equal Opportunity Employer and prohibits discrimination and harassment of any kind."