About the Position
We are looking for a Senior Data Engineer to design, build, and optimize a modern cloud data platform on AWS with Databricks. You will contribute to developing scalable data pipelines, supporting analytics use cases, and ensuring reliable data processing across the platform.
About the Project
The project focuses on building a centralized cloud data platform that supports large scale data ingestion, processing, and analytics workloads. The platform enables advanced reporting, analytics, and data science use cases across multiple domains. Project nickname: Data Platform
About the Team
You will work in a cross functional team that includes data engineers, data architects, analysts, and platform specialists. The team collaborates closely using Agile practices, sharing responsibility for building scalable and reliable data solutions.
Responsibilities
- Develop and maintain scalable data pipelines for ingestion, transformation, and storage
- Implement data processing solutions using Databricks and Spark
- Collaborate with data architects to align implementations with target architecture and standards
- Build and optimize data models to support analytics and reporting use cases
- Ensure data quality, reliability, and performance across the platform
- Integrate data from various sources using cloud native services and tools
- Support data governance, security, and compliance requirements
- Participate in code reviews and contribute to continuous improvement of engineering practices
- Communicate technical concepts to both technical and non technical stakeholders
Requirements
- Experience working as a Data Engineer in complex data environments
- Hands on experience building data platforms on AWS
- Practical experience with Databricks and Spark based data processing
- Strong knowledge of data modeling and data warehousing concepts
- Experience building and maintaining large scale data pipelines
- Strong programming skills in Python and SQL
- Fluent English and French communication skills for professional environments
Nice to Have
- Experience with data governance or data quality frameworks
- Exposure to real time or streaming data architectures
- Experience working with data science or machine learning teams
- Background in enterprise scale environments
Technologies
AWS, S3, Redshift, Databricks, Spark, PySpark, dbt, Python, SQL, Git, MLflow, CI CD pipelines, data orchestration tools, data governance and security tools