We are seeking a Lead Platform Engineer to design and maintain internal platforms that empower development teams with self-service deployment and management capabilities.
You will focus on creating standardized tools and frameworks that improve workflows and promote best practices across multiple teams. This role involves collaboration with various teams to ensure platform reliability, high availability, and seamless integration of development tools. Join us to contribute to enhancing the developer experience and advancing platform performance.
Responsibilities
- Design and maintain internal platforms that enable self-service deployment and application management for development teams
- Develop and update standardized tools, libraries, and frameworks to optimize development workflows
- Integrate diverse development tools and services within the platform ecosystem to support multiple teams
- Enhance developer experience by providing comprehensive documentation, training, and support for platform tools
- Implement strategies to ensure platforms deliver high availability, reliability, and performance
- Apply DORA metrics to assess and improve development and operational processes
- Monitor industry trends to incorporate emerging technologies and boost platform capabilities
- Collaborate with development, operations, and security teams to facilitate smooth application integration
- Offer technical guidance to resolve platform issues and enhance system health
Requirements
- Minimum of 5 years experience in platform engineering or a related field
- Strong knowledge of software development processes, CI/CD pipelines, and DevOps methodologies
- Proficiency with cloud platforms such as AWS, Azure, or Google Cloud and container technologies including Docker and Kubernetes
- Experience with infrastructure as code tools like Terraform, CloudFormation, or Bicep and GitOps solutions such as Flux or ArgoCD
- Familiarity with Crossplane, Pulumi, and AWS Controllers for Kubernetes to manage cloud resources
- Experience using internal developer platforms like Backstage, Humanitec, or OpsLevel to improve developer productivity
- Understanding of high availability, high performance multi-data center systems and hybrid cloud environments
- Scripting skills in Bash, Shell, or Python
- Strong troubleshooting abilities covering system resources and application stack issues
- Proficient English communication skills in speaking, writing, and reading