.jpg?1700169058)
Principal Site Reliability Engineer
zscaler • Remote - USA; San Jose, California, USA
Posted: May 1, 2026
Job Description
Role
We are looking for a Principal Site Reliability Engineer to join our Infrastructure services and architecture team. This role is hybrid 3 days a week onsite in San Jose, CA, or can be remote reporting to the Senior Manager, Cloud Operations. As a lead engineer, you will leverage deep expertise in IaC/CaC, Linux virtualization, and physical hardware management to drive our infrastructure forward. You will oversee networking services and general software engineering to ensure our systems remain scalable, resilient, and high-performing.
What you’ll do (Role Expectations)
- Mentor junior engineers and lead high-impact infrastructure projects
- Support business operations by responding to alerts and triaging systems to restore mission-critical capabilities
- Ensure the reliability and performance of all customer-facing services
- Design, document, and build technical solutions that align with evolving organizational needs
- Support Agile processes to maintain high velocity and a collaborative environment
Who You Are (Success Profile)
- You possess deep expertise in Kubernetes management and deployment, ensuring containerized environments are optimized and secure.
- You are a champion for Infrastructure/Configuration as Code, maintaining a strict IaC/CaC mindset in every solution you build.
- You operate with a security-first focus, embedding protection and compliance into the foundation of the infrastructure.
- You bring a "go-getter" attitude, proactively identifying problem areas and proposing innovative solutions.
- You are highly proficient in CI/CD pipelines and believe that automated testing is non-negotiable for stable releases.
What We’re Looking for (Minimum Qualifications)
- Expert-level proficiency with Kubernetes
- Deep professional experience with Terraform and Ansible
- Expert-level programming skills in Python or Go
- Hands-on experience with Enterprise Linux distributions such as Rocky, Red Hat, or Alma
- Proven experience using Git within a structured SDLC
- U.S. citizenship due to the nature of the customers assigned to this role
What Will Make You Stand Out (Preferred Qualifications)
- Deep knowledge of Linux Hypervisors, including OpenStack, Proxmox, libvirt, or QEMU
- Technical experience working with FreeBSD
- Familiarity with Identity Access Management tools like HashiCorp Vault, LDAP, or OIDC
#LI-SanJose #LI-Remote #LI-JG1
Additional Content
Role
We are looking for a Principal Site Reliability Engineer to join our Infrastructure services and architecture team. This role is hybrid 3 days a week onsite in San Jose, CA, or can be remote reporting to the Senior Manager, Cloud Operations. As a lead engineer, you will leverage deep expertise in IaC/CaC, Linux virtualization, and physical hardware management to drive our infrastructure forward. You will oversee networking services and general software engineering to ensure our systems remain scalable, resilient, and high-performing.
What you’ll do (Role Expectations)
- Mentor junior engineers and lead high-impact infrastructure projects
- Support business operations by responding to alerts and triaging systems to restore mission-critical capabilities
- Ensure the reliability and performance of all customer-facing services
- Design, document, and build technical solutions that align with evolving organizational needs
- Support Agile processes to maintain high velocity and a collaborative environment
Who You Are (Success Profile)
- You possess deep expertise in Kubernetes management and deployment, ensuring containerized environments are optimized and secure.
- You are a champion for Infrastructure/Configuration as Code, maintaining a strict IaC/CaC mindset in every solution you build.
- You operate with a security-first focus, embedding protection and compliance into the foundation of the infrastructure.
- You bring a "go-getter" attitude, proactively identifying problem areas and proposing innovative solutions.
- You are highly proficient in CI/CD pipelines and believe that automated testing is non-negotiable for stable releases.
What We’re Looking for (Minimum Qualifications)
- Expert-level proficiency with Kubernetes
- Deep professional experience with Terraform and Ansible
- Expert-level programming skills in Python or Go
- Hands-on experience with Enterprise Linux distributions such as Rocky, Red Hat, or Alma
- Proven experience using Git within a structured SDLC
- U.S. citizenship due to the nature of the customers assigned to this role
What Will Make You Stand Out (Preferred Qualifications)
- Deep knowledge of Linux Hypervisors, including OpenStack, Proxmox, libvirt, or QEMU
- Technical experience working with FreeBSD
- Familiarity with Identity Access Management tools like HashiCorp Vault, LDAP, or OIDC
#LI-SanJose #LI-Remote #LI-JG1