We're looking for a Senior Python Data Engineer (Dataiku) to design, implement and optimize a Client Tax Data Provider application that will serve as the strategic data integration layer for client tax reporting obligations. Built on S3-compatible storage and Dataiku DSS, this solution will streamline data sourcing, enforce standards, and ensure compliance with EasyTax delivery requirements across jurisdictions. This is a hands-on engineering role ideal for professionals who thrive on building robust, scalable data pipelines within regulated environments and can ensure quality, security and accuracy throughout the delivery lifecycle.
This is a hybrid role based in Madrid, ideal for those eager to thrive in a dynamic environment and make a significant impact in private banking technology. Join EPAM and contribute to shaping the future of financial services in Spain!
Responsibilities
- Design and develop a tax data provider application leveraging Dataiku DSS, Python and S3-based storage
- Implement data pipelines for ingestion, transformation, normalization and validation of client tax-relevant data
- Integrate data from multiple banking systems in line with architectural standards and migration objectives
- Enable secure data access and enforce strict access control mechanisms
- Supply tax-relevant data to downstream consumers including tax calculation engines (e.g., EasyTax), reporting tools and analytics platforms
- Ensure continuity of historical reporting by retaining a read-only legacy tax database while enabling seamless migration to new structures
- Optimize performance through efficient use of processing engines like PySpark and Trino
- Collaborate with stakeholders to align technical solutions with regulatory and compliance requirements
- Maintain clear documentation to support sustainability and audit-readiness
Requirements
- Proven experience in Python for complex data transformation, API integration and Dataiku automation
- Strong expertise with Dataiku DSS for data processing and integration
- Hands-on experience with distributed processing engines such as Spark/PySpark or Trino
- Solid understanding of S3-compatible object storage and integration practices
- Familiarity with Oracle database and SQL for advanced queries
- Knowledge of data security, access control and data governance frameworks
- Ability to design and deliver resilient, maintainable data pipelines in financial services or similarly regulated environments
- Strong problem-solving abilities with attention to detail and quality
- Excellent communication skills to collaborate across technical and business teams
- Fluent in English, written and spoken
Nice to have
- Experience in tax reporting or financial data domains
- Exposure to EasyTax or similar tax calculation engines
- Familiarity with core banking transformation programs and legacy migration strategies
- Knowledge of regulatory compliance requirements related to data processing and storage