- Should have good analytical skills and experienced in writing SQL queries
-
Should have hands-on experience in developing Spark applications using Data frame API, PySpark-SQL in Data bricks for data extraction, transformation, and aggregation from multiple file formats for Analyzing & transforming the data using Python to uncover insights into the customer usage patterns.
-
Should have hands-on experience Extract Transform and Load data from sources Systems (On-prem) to Azure Data Storage services using a combination of Azure Data factory, T-SQL, Spark SQL.
-
Metadata driven ingestion pipeline framework by using Azure Data Factory
-
Should have real time experience of implementing spark optimization techniques and adaptive query execution configuration in data bricks.
-
Involved in conducting the code review and Results with the Developers
-
Good understanding and real time architecture experience on Azure cloud (ADLS, ADFS, ADF)
-
Knowledge of using GitHub for code versioning and devops deployment process.
-
Consume data and generate reports using Power BI
Good to have:
Have good exposure to USA Healthcare Insurance domain