Description
We’re looking for a Senior DevOps Engineer to join the infrastructure team responsible for the stability, reliability, and evolution of Muse Group’s production platforms. This team runs the infrastructure behind our global music products — including high-traffic websites and services used by millions of musicians worldwide. As a senior engineer, you will play a key role in maintaining highly available systems while helping modernize our infrastructure.
We are actively integrating AI and automation into all areas of the business and expect candidates with mindset focused on improving efficiency and deliver a better user experience.
Key responsibilities
- Own and operate CI/CD pipelines (Jenkins, GitHub Actions) and the self-hosted runner fleet (Linux/macOS/Windows); keep deploys and rollbacks reliable and fast.
- Operate Kubernetes (Hetzner) and Helm-managed services; containerize and ship services with Docker.
- Operate and recover the data layer: MySQL/Percona XtraDB (multi-region replication), ClickHouse, Redis/Memcached, Kafka.
- Own the network/edge layer: nginx load balancing, Cloudflare/CDN, DNS, TLS certificates.
- Maintain the observability stack (self-hosted Sentry, Prometheus/Grafana, Graylog) and run its migrations.
- Maintain build infrastructure for mobile/desktop apps
- Serve as infrastructure on-call: incident response, coordination, and root-cause analysis.
- Use AI tooling to accelerate configuration, automation, diagnostics, and documentation.
Requirements
- Production CI/CD: Jenkins + GitHub Actions.
- Kubernetes + Helm; Docker.
- Strong Linux administration; nginx; operating PHP/php-fpm applications.
- MySQL/Percona XtraDB: replication, recovery, troubleshooting.
- Configuration management with Ansible.
- Scripting: Bash + (Python or PHP).
- CDN/Cloudflare or other S3-compatible storage experience.
- MacOS/iOS builds (fastlane, certificates, provisioning, DSYM), Android builds.
- Nexus or other private registries.
- On-call / incident-management experience.
- Practical, daily use of AI coding/ops assistants.
Nice to have
- ClickHouse, Redis/Memcached, Kafka, ZooKeeper.
- Self-hosted Sentry, Prometheus/Grafana, DataDog
- Multi-region infrastructure; Go (for tooling/exporters)