Job Description
The Senior Data Engineer will be responsible for leading the delivery of data on data and analytics platform. These services are at the forefront of building out data engineering practice cloud-native technologies. The Senior Data Engineer Service Provider must have experience in leading, designing, implementing, and collaborating with stakeholders to achieve the best results for our clients..
Specific Project Requirements:
- Increase the overall speed in which data is onboarded to the Data and Analytics platform.
- Building robust data pipelines to enable larger data consumption on the Data and Analytics Platform
- Increase the overall quality of data pipeline development through DevSecOps
Primary Responsibilities:
• Proven design, build and implementation of batch and real-time data pipelines. Driven by automated repeatable delivery of data that aligns to enterprise data governance standards.
• Experience in developing and proposing design patterns that conform to requirements. Responsible to ensure the proposed design, optimally addresses access and query patterns; data consumption and adheres to internal architecture standards.
• Experienced collaboration working with various stakeholders across the business, data scientists and IT. Working closely building relationships, refining data requirements to meet various data and analytics initiatives and data consumption requirements
Top Skills Required:
• Programming experience in Spark using modern languages such as Python, Scala
• Experience working with modern data architectures like Azure Data Lake Storage, Azure Databricks, Azure Synapse (formerly SQL Data Warehouse) and Delta Lakes
• Experience leading Data Engineering principles within an organization/team.
Other Skills Required:
• Experience working with Integration patterns and technologies such as Azure Event Hub, Function App and C#
• Knowledge and expertise of database modeling techniques: Data Vaults, Star, Snowflake, 3NF, etc.
• Experience working with streaming data architecture and technologies for real-time: Spark Streaming, Kafka, Flink, Storm
• Experience working with relational and non-relational database technologies: SQL Server, Oracle, Cassandra, MongoDB, CosmosDB, HBase
• Experience working with source code and configuration management environments such as Azure DevOps, Git, Maven, Nexus
Assets:
• Experience within Azure environment
• Strong Python, Scala and Spark experience
• Experience modernizing data platforms.
• Experience with Azure Functions and C#
• 2 to 3 projects developing Data Vault