TCS Walk-in at Bangalore- Bhuwalkha
Date- 18-Apr-26
JD
Responsibilities
- Develop Glue scripts primarily in Python (PySpark) to handle complex transformations, data quality checks, and business rules
- Design, develop, and maintain ETL/ELT jobs using AWS Glue (Jobs, Crawlers, Triggers) to ingest, transform, and load data from multiple sources
- Design end-to-end data workflows leveraging Glue Workflows, Triggers, and, where applicable, Step Functions or other orchestration tools
- Automate pipeline deployments and job scheduling, integrating with CI/CD pipelines and infrastructure-as-code frameworks (e.g., CloudFormation, Terraform)
- Troubleshoot ETL issues including job failures, performance bottlenecks, schema evolution, and data quality problems
- Integrate Glue with AWS data services such as S3, Redshift, Athena, RDS, DynamoDB, and Lake Formation
Required:
Must have total 6+ yrs. in IT and 4+ years' experience working as a AWS Glue Developer
- Experience with Pyth on or another scripting language for ETL orchestration and automation
- Hands-on experience with AWS Glue (Jobs, Crawlers, Workflows, Data Catalog)
- Strong Python and PySpark skills for ETL development and data processing at scale
- Experience with SQL and integrating Glue with query engines (e.g., Redshift, Athena)
- Familiarity with orchestration tools such as AWS Step Functions or Apache Airflow