We are looking for a DevOps Engineer with 3–5 years of experience to join our team in Bangalore. The ideal candidate will have strong expertise in observability, log management, and infrastructure automation, with hands-on experience in building scalable monitoring and alerting solutions. This role requires working in a night shift and follows a hybrid work model (3 days in office).
Essential functions
- Design, implement, and maintain scalable log ingestion and observability platforms.
- Deploy and manage ClickHouse for high-performance log storage and analytics.
- Build and maintain dashboards and monitoring solutions using Grafana.
- Implement distributed tracing and telemetry using OpenTelemetry (OTEL).
- Configure and optimize log ingestion, processing, and retention pipelines.
- Develop and manage automated alerting solutions using Everbridge or similar enterprise alerting platforms.
- Manage multi-tenant observability environments with appropriate access controls.
- Automate log retention policies, archival, and lifecycle management.
- Collaborate with engineering and security teams to improve system reliability and operational efficiency.
- Perform production support, incident response, troubleshooting, and root cause analysis.
Qualifications
- 3–5 years of experience in DevOps, SRE, or Platform Engineering.
- Hands-on experience with:
- ClickHouse
- Grafana
- OpenTelemetry (OTEL)
- Log ingestion and observability pipelines
- Monitoring and alerting platforms
- Experience with enterprise alerting tools such as Everbridge or equivalent.
- Strong understanding of multi-tenant environments and observability architecture.
- Experience with retention policy automation and data lifecycle management.
- Good knowledge of Linux, networking, and cloud infrastructure.
- Scripting experience using Bash, Python, or similar.
- Willingness to work night shifts.
- Ability to work in a hybrid model (3 days from the Bangalore office).
- Immediate to 30 days' notice period preferred.
Would be a plus
- Experience in SecOps or security monitoring environments.
- Exposure to cloud platforms such as AWS, Azure, or GCP.
- Experience with Kubernetes, Docker, Terraform, or Ansible.
- Familiarity with CI/CD pipelines and Infrastructure as Code (IaC).
- Experience working in high-availability, production-scale environments.
We offer
- Opportunity to work on bleeding-edge projects
- Work with a highly motivated and dedicated team
- Competitive salary
- Flexible schedule
- Benefits package - medical insurance, sports
- Corporate social events
- Professional development opportunities
- Well-equipped office
About us
Grid Dynamics (NASDAQ: GDYN) is a leading provider of technology consulting, platform and product engineering, AI,
and advanced analytics services. Fusing technical vision with business acumen, we solve the most pressing technical
challenges and enable positive business outcomes for enterprise companies undergoing business transformation.
A key differentiator for Grid Dynamics is our 8 years of experience and leadership in
enterprise AI, supported by profound expertise and ongoing investment in
data,
analytics,
cloud & DevOps,
application modernization
and
customer experience.
Founded in 2006, Grid Dynamics is headquartered in Silicon Valley with offices across the Americas, Europe, and India.