Emplacement: India
Your tasks
- Administer and optimize Enterprise Linux systems across global data centers, ensuring uptime, performance, and adherence to security best practices (e.g., SELinux, firewallD, OpenSCAP).
- Deploy, configure, and maintain Linux-based applications such as HAProxy, NGINX, and other critical services, focusing on high-availability, scalability, and performance optimization.
- Design, implement, and operate monitoring and logging solutions using Prometheus, Sensu, OpsGenie, Grafana, and Loki to ensure visibility, proactive alerting, and rapid incident resolution.
- Drive automation with Infrastructure as Code (IaC) using Ansible to enable repeatable, reliable, and scalable deployments and configurations.
- Manage Identity Management solutions like FreeIPA and integrate them securely into the infrastructure, supporting authentication and access control across services.
- Assist in operating containerized workloads (Docker/Podman) and Kubernetes environments as part of a broader infrastructure strategy
Your profile
- Solid, hands-on expertise in Linux system administration (RHEL-based and Debian-based), including troubleshooting, tuning, and lifecycle management â this is the foundation of the role.
- Proven experience with Linux application services such as HAProxy and NGINX, including high-availability design and performance tuning.
- Practical experience with centralized monitoring and logging tools (Prometheus, Grafana Loki, Sensu, OpsGenie) in production environments.
- Strong skills in automation using Ansible and familiarity with Infrastructure as Code principles.
- Knowledge of Linux security and compliance frameworks (SELinux, firewallD, OpenSCAP) and distributed failover/high-availability strategies.
- Fluent English communication skills (written & spoken) with the ability to document clearly, collaborate effectively, and solve complex problems