Talent Pool: UNIX/Linux Support Engineer (On-site) в Quantori — вакансия в геймдеве | OfferClawResponsibilities
- Monitor infrastructure alerts and tickets through ServiceNow and TrueSight.
- Diagnose and resolve filesystem, CPU, memory, and agent issues across Linux and Unix platforms.
- Restart services and daemons, clean up logs or disk space, manage processes, and apply configuration corrections.
- Perform standard operational changes, including agent reinstalls, disk extensions, and configuration updates.
- Support VMware ESX hosts (Dell/Cisco UCS), perform VM health checks, validate resource and datastore usage.
- Monitor and troubleshoot GPFS (IBM Spectrum Scale) and LSF workload schedulers.
- Provide basic OpenShift and RStudio support for system availability.
- Execute and enhance Ansible Tower and Red Hat Satellite workflows for patching and configuration management.
- Maintain and improve Puppet modules and shell scripts for operational efficiency.
- Ensure alerts are validated; tickets are updated and resolved in accordance with SLAs.
- Perform hardware interventions, including disk replacements, NIC reseating, and console access.
- Coordinate hardware replacements with Dell, IBM, Cisco, and HP vendors.
- Validate data center connectivity, participate in DR testing, and ensure accurate asset documentation.
- Participate in the on-call rotation approximately once per month, providing after-hours support for critical production incidents.
- Serve as an escalation point for critical system alerts during off-hours to ensure service continuity.
- Coordinate with data center or vendor teams for emergency actions when needed.
Requirements
- 3–6 years of experience in enterprise Linux/Unix administration and support.
- Strong experience in administering RHEL 6–9 and Ubuntu 20/22/24, with working knowledge of legacy AIX, HP-UX, and Solaris systems.
- Confident in managing Dell PowerEdge, IBM Frame, and Cisco UCS hardware, including iDRAC/iLO operations and rack-level maintenance.
- Hands-on experience with VMware ESX (console operations, vMotion, datastore validation).
- Familiarity with clustered storage environments such as GPFS (Spectrum Scale) and workload scheduling with IBM LSF.
- Automation experience with Ansible Tower, Red Hat Satellite, Puppet, and shell scripting
- Practical experience with ServiceNow for incident/change management and TrueSight/PATROL for monitoring.
- A good understanding of networking fundamentals,such as VLANs, bonding, and interface validation.
- Red Hat Certified System Administrator (RHCSA) or equivalent certification
- ITIL v4 Foundation certification.
- Strong analytical, documentation, and communication skills.
- Eligibility for compliance-based background checks (PHI exposure).
- Availability for working from 9 AM – 5 PM EST (On-site).
We offer
- Competitive compensation
- Remote or office work
- Flexible working hours
- Healthcare benefits: medical insurance and paid sick leave
- Continuous education, mentoring, and professional development programs
- A team with an excellent tech expertise
- Certifications paid by the company