We are searching for a Senior Data Software Engineer to join our progressive team. If you are looking to give your career a real boost with a global leader in digital transformation, EPAM is the perfect choice. If you’d like to learn more about this position and project, apply now.
Responsibilities
- Design, construct, install, test, and maintain highly scalable and optimized data pipelines with state-of-the-art monitoring and logging practices
- Bring together large, complex, and sparse data sets to meet functional and non-functional business requirements, and use a variety of languages, tools, and frameworks to marry data
- Design and implement data tools for analytics and data scientist team members to help them in building, optimizing, and tuning use cases
- Tackle challenging and varied problems related to gaining further maturity in Data Platform tools, processes, and engineering capabilities
- Focus on engineering excellence to deliver high-quality solutions that provide leverage for the company's objectives
Requirements
- 3+ years of hands-on experience in data processing focused projects
- Proficiency with Java, Python, or Scala, and SQL
- Knowledge of Apache Spark
- Experience with one of the major cloud providers: AWS, Azure, or GCP
- Hands-on experience with selected data processing technologies, such as Hadoop, MongoDB, Cassandra, Kafka, Elasticsearch, Python libraries (e.g., Pandas, NumPy), and data processing tools from cloud providers (e.g., EMR, Glue, Data Factory, BigTable)
- Relevant experience with version control and code review
- Knowledge of Agile methodologies
- Linux and Bash scripting basics
- English language proficiency at an Upper-Intermediate level (B2) or higher
Nice to have
- Good hands-on experience with Databricks and Delta Lake
- Ability to build Apache Airflow pipelines
- Experience with the Snowflake platform