Bei Roche kannst du ganz du selbst sein und wirst für deine einzigartigen Qualitäten geschätzt. Unsere Kultur fördert persönlichen Ausdruck, offenen Dialog und echte Verbindungen. Hier wirst du für das, was du bist, wertgeschätzt, akzeptiert und respektiert. Dies schafft ein Umfeld, in dem du sowohl persönlich als auch beruflich wachsen kannst. Gemeinsam wollen wir Krankheiten vorbeugen, stoppen und heilen und sicherstellen, dass jeder Zugang zur Gesundheitsversorgung hat – heute und in Zukunft. Werde Teil von Roche, wo jede Stimme zählt.
Die Position
Data Engineer
Experience - 4 to 8 years
Location- Pune
Job description
The Senior IT Data Engineer is responsible for leading, designing, developing, and maintaining scalable and robust data pipelines and infrastructure. This role involves independently building ETL/ELT processes, optimizing data storage solutions (such as data warehouses and data lakes), ensuring data quality and reliability, and monitoring data systems. You will collaborate closely with data scientists and analysts to meet their specific data requirements, utilizing strongprogramming skills in Tableau, Snowflake, Talend, Python or Scala, expert SQLknowledge, and proficiency in big data technologies like Spark.
The ideal candidate will possess a strong background in the pharmaceutical or biotechnology industry, with experience working within Regulatory Affairs, Clinical Operations, or Pharmacovigilance / Safety team with a solid understanding on the E2E process flow across R Additionally, a proven track record of navigating the stringent requirements of GxP environments (GCP, GMP, GVP) and managing complex, cross-functional data workflows is highly desirable.
Description of the area
Job Responsibilities
- End-to-End Pipeline Delivery: Independently leads the design, build, and maintenance of scalable data pipelines, managing specific data engineering projects autonomously from inception to deployment.
- Performance Optimization& Problem Solving: Solves complex data ingestion and processing challenges, actively optimizing data flows to enhance overall system performance and reliability.
- Stakeholder Alignment& Integration: Partners directly with business units and data scientists to understand data requirements, effectively bridging technical execution with non-technical business needs.
- Strategic Infrastructure Impact: Owns large-scale data engineering initiatives, implementing robust strategies that significantly modernize and strengthen the organization’s data infrastructure.
- Complex Data Integration: Manages large, intricate data ecosystems by seamlessly integrating multiple diverse data sources to ensure efficient, secure cross-platform data flows.
Qualifications
Education / Experience
- Large-Scale Data Systems Management: Demonstrated experience owning major data engineering initiatives and managing complex, high-volume enterprise data systems.
- Autonomous ETL/ELT Pipeline Development: Proven track record of independently architecting, building, and maintaining automated ETL/ELT data ingestion and transformation processes.
- Storage Solution Optimization: Hands-on experience designing and optimizing modern data storage environments, including data warehouses and data lakes, for peak performance and cost efficiency.
- Data Quality& Reliability Assurance: Expert capability in implementing rigorous data quality checks, data cleansing rules, and reconciliation frameworks across all pipelines.
- System Monitoring& Observability: Strong experience building robust monitoring, alerting, and logging systems to ensure continuous high availability and minimal downtime of data workflows.
•
Technical Skills
- Programming& ETL/ELT Mastery: Advanced proficiency in SQL, Python, and Scala combined with expert use of tools like Talend to build, ingest, and process complex structured and unstructured data streams.
- Cloud& Big Data Architecture: Deep expertise leveraging distributed computing frameworks (Spark, Hadoop) and cloud-native data platforms (Snowflake) to manage and scale high-volume, enterprise-level data systems.
- Optimization& Performance Engineering: Proven capability to solve complex data processing bottlenecks, tune analytical environments for tools like Tableau, and continuously optimize end-to-end data flows for maximum efficiency and reliability.
- Good to have : AI expertise
Additional Qualifications
- Pharma& GxP Compliance: Extensive experience in pharma or biotech, architecting data pipelines that strictly comply with GxP frameworks (GCP/GMP/GVP), data integrity principles, and computer systems validation (CSV).
- Compliant Data Delivery: Proven capability to build scalable data solutions within regulated R environments, aligning technical execution with critical Clinical, Regulatory, and Safety milestones.
- Workflow& Data Optimization: Skilled at identifying data bottlenecks, eliminating operational silos, and optimizing fragmented workflows to ensure automated, streamlined cross-platform data transfers.
Wer wir sind
Eine gesündere Zukunft treibt uns zur Innovation an. Mehr als 100.000 Mitarbeiter weltweit arbeiten gemeinsam daran, wissenschaftliche Fortschritte zu erzielen und sicherzustellen, dass jeder Zugang zur Gesundheitsversorgung hat – heute und für zukünftige Generationen. Durch unser Engagement werden über 26 Millionen Menschen mit unseren Medikamenten behandelt und mehr als 30 Milliarden Tests mit unseren Diagnostik-Produkten durchgeführt. Wir ermutigen uns gegenseitig, neue Möglichkeiten zu erkunden, Kreativität zu fördern und hohe Ziele zu setzen, um lebensverändernde Gesundheitslösungen zu liefern.
Gemeinsam können wir eine gesündere Zukunft gestalten.
Roche ist ein Arbeitgeber, der die Chancengleichheit fördert.