We are looking for an experienced Senior Data Software Engineer to play a key role in designing, developing, and maintaining scalable data systems and pipelines.
Responsibilities
- Build, optimize, and maintain data processing pipelines to handle large-scale datasets
- Develop software solutions to integrate, transform, and analyze data effectively
- Collaborate with cross-functional teams, including data scientists, analysts, and cloud architects, to deliver high-quality solutions
- Ensure the reliability, scalability, and security of data systems
- Leverage cloud platforms to deploy and manage data solutions
- Work with distributed computing technologies like Apache Spark to process and analyze large datasets
- Monitor and debug systems, resolving issues, and optimizing performance
- Document technical processes and ensuring adherence to best practices
- Stay updated with the latest advancements in data engineering tools, frameworks, and technologies
Requirements
- 3+ years of experience in a data engineering role
- Proficiency in Python, with solid experience developing robust and scalable code
- Hands-on experience with cloud platforms (e.g., AWS, Microsoft Azure, or Google Cloud Platform) for deploying and managing data solutions
- Expertise in working with Apache Spark/PySpark for large-scale data processing
- Strong understanding of data modeling, ETL processes, and database design
- Familiarity with containerization technologies (e.g., Docker, Kubernetes)
- Knowledge of version control systems such as Git
- Excellent command of professional English (B2 level), both written and verbal, to effectively communicate with the engineering team
Nice to have
- Experience with workflow orchestration tools (e.g., Apache Airflow)
- Familiarity with machine learning pipelines and tools
- Knowledge of CI/CD processes for automating deployments