At EPAM Vietnam, EPAM is hiring a Senior Site Reliability Engineer to join the team in Vietnam. You’ll design and optimize infrastructure, automate processes and ensure the reliability of our education platforms. More than that, at EPAM, engineering is in our DNA. So, when you join our growing team, you will work with top global clients and make significant contributions to the ever-changing technology landscape that keeps us, our communities and our clients moving forward.
Responsibilities
- Design and maintain robust on-premises infrastructure to support business operations
- Automate repetitive tasks using scripting languages to reduce manual effort and errors
- Manage deployment, monitoring and management tools for on-premises and cloud systems
- Monitor system performance, troubleshoot proactively and ensure high availability
- Perform capacity planning and scalability assessments to support business growth
- Administer Kubernetes and containerization technologies
- Uphold security best practices and compliance in production environments
- Recommend new technologies to improve reliability, performance and efficiency
- Document all processes, configurations and procedures for system management
Requirements
- At least 3 years of experience in an Application Support or SRE or DevOps role
- Strong background in software engineering and system administration (Windows and Linux)
- Proficiency in scripting/programming languages (PowerShell)
- Experience with database management, especially in SQL language, MSSQL database monitoring
- Hands-on experience with virtualization and containerization technologies (VMware, Hyper-V, Docker, Kubernetes)
- Familiarity with ITSM processes and procedures
- Good problem-solving skills and experience with Agile methodologies and DevOps practices
- Strong English communication skills and experience in directly interacting with business users