Principal Site Reliability Engineer

zscaler • Remote - USA; San Jose, California, USA

No Relocation

Posted: May 1, 2026

Job Description

Role

We are looking for a Principal Site Reliability Engineer to join our Infrastructure services and architecture team. This role is hybrid 3 days a week onsite in San Jose, CA, or can be remote reporting to the Senior Manager, Cloud Operations. As a lead engineer, you will leverage deep expertise in IaC/CaC, Linux virtualization, and physical hardware management to drive our infrastructure forward. You will oversee networking services and general software engineering to ensure our systems remain scalable, resilient, and high-performing.

What you’ll do (Role Expectations)

Mentor junior engineers and lead high-impact infrastructure projects
Support business operations by responding to alerts and triaging systems to restore mission-critical capabilities
Ensure the reliability and performance of all customer-facing services
Design, document, and build technical solutions that align with evolving organizational needs
Support Agile processes to maintain high velocity and a collaborative environment

Who You Are (Success Profile)

You possess deep expertise in Kubernetes management and deployment, ensuring containerized environments are optimized and secure.
You are a champion for Infrastructure/Configuration as Code, maintaining a strict IaC/CaC mindset in every solution you build.
You operate with a security-first focus, embedding protection and compliance into the foundation of the infrastructure.
You bring a "go-getter" attitude, proactively identifying problem areas and proposing innovative solutions.
You are highly proficient in CI/CD pipelines and believe that automated testing is non-negotiable for stable releases.

What We’re Looking for (Minimum Qualifications)

Expert-level proficiency with Kubernetes
Deep professional experience with Terraform and Ansible
Expert-level programming skills in Python or Go
Hands-on experience with Enterprise Linux distributions such as Rocky, Red Hat, or Alma
Proven experience using Git within a structured SDLC
U.S. citizenship due to the nature of the customers assigned to this role

What Will Make You Stand Out (Preferred Qualifications)

Deep knowledge of Linux Hypervisors, including OpenStack, Proxmox, libvirt, or QEMU
Technical experience working with FreeBSD
Familiarity with Identity Access Management tools like HashiCorp Vault, LDAP, or OIDC

#LI-SanJose #LI-Remote #LI-JG1

Additional Content