8+ years of hands-on experience in Data Science and 5+ years in Machine Learning, with a proven track record, demonstrated through a robust portfolio of projects.
Strong programming skills in languages such as Python and familiarity building ETL pipelines.
Expertise in SQL and experience with both relational (preferably Postgres) and NoSQL databases (Open Search or Elastic Search)
Familiarity with AWS cloud platform and its services.
Experience with version control systems (e.g., Git) and CI/CD pipelines.
Ability to build scalable infrastructure to embed and search very large number of documents.
Ability to move fast in an environment where things are sometimes loosely defined and may have competing priorities or deadlines.
Expertise in ML inference optimizations
Solid experience with Hybrid RAG, chunking/segmentation refinements, embedding/index update workflows, metadata filtering, caching, etc.
Knowledge of network optimization for distributed ML training and inference.
Understanding of distributed training patterns and checkpointing strategies.
Strong English skills (B2 and higher)
Strong verbal and written communication skills.
Ability to work independently and collaborate in a group.