Key Responsibilities
-
Administer and maintain AWS environments supporting data pipelines, including S3, EMR, Athena, Glue, Lambda, CloudFormation, and Redshift.
-
Cost Analysis – use AWS Cost Explorer to analyze services and usages, create dashboards to alert outliers on usage and cost
-
Performance and Audit – use AWS Cloud Trail and Cloud Watch to monitory system performance and usage
-
Monitor, troubleshoot, and optimize infrastructure performance and availability.
-
Provision and manage cloud resources using Infrastructure as Code (IaC) tools (e.g., AWS CloudFormation, Terraform).
-
Collaborate with data engineers working in PySpark, Hive, Kafka, and Python to ensure infrastructure alignment with processing needs.
-
Support code integration with GIT repositories
-
Implement and maintain security policies, IAM roles, and access controls.
-
Participate in incident response and support resolution of operational issues, including on-call responsibilities.
-
Manage backup, recovery, and disaster recovery processes for AWS-hosted data and services.
-
Interface directly with client teams to gather requirements, provide updates, and resolve issues professionally.
-
Create and maintain technical documentation and operational runbooks
Required Qualifications
-
3+ years of hands-on administration experience managing AWS infrastructure, particularly in support of data-centric workloads.
-
Strong knowledge of AWS services including but not limited to S3, EMR, Glue, Lambda, Redshift, and Athena.
-
Experience with infrastructure automation and configuration management tools (e.g., CloudFormation, Terraform, AWS CLI).
-
Proficiency in Linux administration and shell scripting, including Installing and managing software on Linux servers
-
Familiarity with Kafka, Hive, and distributed processing frameworks such as Apache Spark.
-
Ability to manage and troubleshoot IAM configurations, networking, and cloud security best practices.
-
Demonstrated experience in monitoring tools (e.g., CloudWatch, Prometheus, Grafana) and alerting systems.
-
Excellent verbal and written communication skills.
-
Comfortable working with cross-functional teams and engaging directly with clients.
Preferred Qualifications
-
AWS Certification (e.g., Solutions Architect Associate, SysOps Administrator)
-
Experience supporting data science or analytics teams
-
Familiarity with DevOps practices and CI/CD pipelines
Familiarity with Apache Iceberg–based data pipelines