KPMG India

Data Engineer Azure Synapse and Data Modeling

1 June 2024
Apply Now
Deadline date:
£90000 - £158000 / year

Job Description

We are looking for a Data Engineer with a strong foundation in SQL and data modeling to join our team. The ideal candidate will be responsible for creating Feature Engineering Tables (FETs) from Erwin, utilizing Azure Synapse, and working with Spark notebooks to support our data warehouse project.

  • Develop and maintain Feature Engineering Tables (FETs) using Erwin data modeling tools, ensuring adherence to the 3rd Normal Form (3NF) for optimal database design.
  • Design, build, and optimize data pipelines using Azure Synapse Analytics, ensuring efficient data flow and storage.
  • Utilize Spark notebooks within Azure Synapse for complex data processing and analytics tasks.
  • Collaborate with stakeholders to understand data requirements and translate them into technical specifications.
  • Implement and maintain data security and compliance protocols within the data warehouse environment.
  • Conduct data quality assurance and implement measures to ensure data accuracy and integrity.
  • Stay uptodate with the latest trends and best practices in data engineering and Azure Synapse.
  • Strong knowledge of SQL and data modeling, particularly in 3NF, to ensure efficient and scalable database structures.
  • Proficiency in Azure Synapse Analytics, with experience in data migration and querying large datasets.
  • Experience with Azure technologies, including Azure Data Factory, Azure Databricks, and Azure Data Lake Storage.
  • Familiarity with big data technologies (Hadoop, Hive, HBase, Spark, etc.).
  • Proficiency in Python, SQL, and advanced SQL techniques.
  • Understanding of both batch and streaming data architectures.
  • Deep knowledge of data warehousing, ETL development, and data modeling.
  • Experience with software engineering best practices, including agile methodologies and coding standards.
  • Knowledge of data management fundamentals and data storage principles.
  • Advanced statistics and machine learning model implementation is a plus.