Job Description:
ob Overview
We are seeking a skilled ETL Tester with strong expertise in Python and PySpark to join our QA team. The role involves validating large-scale data pipelines, ensuring data quality, and automating test processes in modern data platforms.
Key Responsibilities
Perform ETL testing across multiple data sources, transformations, and targets.
Validate data ingestion, transformation, and loading in Big Data and cloud environments.
Design, develop, and execute test cases, test scripts, and SQL queries for data validation.
Use Python and PySpark for test automation, data comparison, and framework enhancements.
Identify, analyze, and report defects with detailed logs and collaborate with developers for resolution.
Ensure compliance with data quality, integrity, and governance standards.
Work with Agile teams and participate in sprint planning, stand-ups, and retrospectives.
Required Skills
Strong experience in ETL/Data Warehouse testing.
Hands-on expertise with Python scripting for automation.
Proficiency in PySpark for validating and testing large data sets.
Solid knowledge of SQL for complex queries and data validation.
Experience with Big Data platforms (Hadoop, Spark, Hive, etc.) is a plus.
Familiarity with test management tools (JIRA, ALM, etc.).
Strong analytical and problem-solving skills.