Experience: 6–8 years total, with at least 4 years in modern data engineering for AI/ML or SaaS environments
Must-Have Skills:
Core Data Engineering:
Expert in SQL and Python for data processing, transformation, and validation.
Strong hands-on experience with data pipeline orchestration tools (Airflow, Prefect, or similar).
Proven experience with ETL/ELT frameworks (DBT, Beam, Kafka).
Deep understanding of data modelling, partitioning, and schema evolution for large-scale systems.
Experience building real-time streaming pipelines (Kafka, Kinesis).
Prior startup or AI platform experience, comfortable working with evolving requirements.
Working experience with any of the Graph Databases like:- Neo4j, Neptune etc.
Cloud & Infrastructure:
Proficiency in AWS data platforms & infra.
Solid understanding of data lake / lakehouse architectures.
Knowledge of CI/CD, IaC (Terraform), Docker/Kubernetes, and version control workflows.
Experience setting up data observability tools.
Exposure to financial or transactional data systems.
Experience in event-driven architecture or CDC frameworks.
AI/ML Enablement:
Understanding of ML data needs - feature engineering, training/inference parity, model input/output formats.
Experience delivering data to AI model pipelines (batch + online inference).
Nice to Have:
Experience working with vector databases or embedding pipelines (FAISS, Pinecone, Milvus).
Familiarity with LLM data preparation workflows (chunking, retrieval indexing, evaluation data).
Awareness of MLOps principles - model retraining triggers, drift detection inputs, and experiment tracking data.
Success in 6–12 Months
Deliver robust data pipelines supporting at least 2 production AI features.
Establish foundational feature store and data validation frameworks.
Ensure data quality, latency, and compliance SLAs for AI workflows.
Enable reproducible, automated data workflows for training and inference.
Team Context
You’ll be part of Solifi’s AI Product Team, a lean, senior, cross-functional group responsible for building AI-driven capabilities end-to-end from discovery to deployment to operations.
Why This Role Matters
Data is the foundation of every intelligent system. As Senior Data Engineer, you’ll architect that foundation, ensuring Solifi’s AI products are powered by accurate, reliable, and production-grade data pipelines that scale with our ambitions.
Your work directly enables faster experimentation, better model performance, and more trustworthy AI in Solifi’s platform.