We are seeking a Senior Data Software Engineer. If you are looking to give your career a real boost with a global leader in digital transformation, EPAM is the perfect choice. If you’d like to learn more about this position and project, apply now.
Responsibilities
- Design, construct, install, test and maintain highly scalable and optimized data pipelines with state-of-the-art monitoring and logging practices
- Bring together large, complex, and sparse data sets to meet functional and non-functional business requirements and use a variety of languages, tools and frameworks to marry data
- Design and implement data tools for analytics and data scientist team members to help them build, optimize, and tune use cases
- Tackle challenging and varied problems related to gaining further maturity in Data Platform tools, processes, and engineering capabilities
- Focus on engineering excellence to deliver high-quality solutions that provide leverage for the company's objectives
Requirements
- 3+ years of hands-on experience with data processing-focused projects
- Proficiency in Java/Python/Scala and SQL
- Knowledge of Apache Spark
- Experience with one of the major cloud providers: AWS, Azure, or GCP
- Hands-on experience with selected data processing technologies, e.g., Hadoop, MongoDB, Cassandra, Kafka, Elasticsearch, Python libraries (Pandas/NumPy/etc.), and data processing tools of cloud providers (EMR, Glue, Data Factory, Big Table, etc.)
- Relevant experience with version control and code review
- Understanding of Agile methodologies
- Basic knowledge of Linux and Bash scripting
- Fluency in English, both written and spoken, at a minimum B2 level
Nice to have
- Good hands-on experience with Databricks and Delta Lake
- Ability to build Apache Airflow pipelines
- Experience with the Snowflake platform