Senior Site Reliability Engineer - Security and Data Systems

Bellevue, WashingtonFull-TimeSeniorDevOps

You will be redirected to the company career page

Senior Site Reliability Engineer (SRE) - Security and Data Systems

  • Our company is seeking a highly skilled Senior Site Reliability Engineer to join our team. We are a SaaS company specializing in securing large-scale systems. This role is a blend of software engineering and systems administration, where you'll be responsible for building and maintaining highly reliable, scalable, and secure infrastructure. You will be a key contributor, applying your expertise to automate manual processes and proactively solve complex problems before they become incidents, handling incidents, and includes on-call shifts.

Responsibilities

  • Platform & Reliability: Design, build, and maintain the core infrastructure that underpins our security SaaS offerings, ensuring high availability, performance, and scalability. This includes building and operating the tooling for our Snowflake data systems.
  • Automation: Develop robust automation using code to eliminate toil and ensure consistency across our environments. You'll be a key driver in automating everything from infrastructure provisioning to application deployment and incident response.
  • Security & Compliance: Work closely with our security teams to embed a security-first mindset into all our processes and infrastructure. You will be responsible for ensuring our systems and data platforms are compliant with industry standards.
  • Incident Response: Participate in on-call rotations and be a primary responder for critical incidents, leading root cause analysis and implementing preventative measures to ensure issues don't recur.
  • Collaboration: Partner with development, data science, and security teams to provide expert guidance on architectural decisions, best practices, and the implementation of new services.

Key Skills & Qualifications

  • Strong Coding Skills: You are a developer at heart and are comfortable writing production-level code to solve complex operational challenges.
  • Infrastructure as Code (IaC): Deep experience with Terraform for provisioning and managing cloud infrastructure and services.
  • Continuous Delivery: Familiarity with modern CI/CD practices and tools, particularly Spinnaker, to automate and standardize our release pipelines.
  • Containerization & Orchestration: Expertise in container technologies and hands-on experience managing large-scale, production-ready clusters with Kubernetes.
  • Database Migrations: Experience with database schema management tools like Flyway for safely and reliably handling database changes.
  • Data Systems: Direct experience with large-scale data systems, specifically with the Snowflake platform.
  • AI/ML Experience (a plus): Experience or a strong interest in AI/ML, particularly how these technologies can be applied to improve reliability, security, and operational efficiency (e.g., AIOps, predictive analysis).
  • Problem-Solving: Excellent analytical and problem-solving skills with a proactive approach to identifying and addressing potential issues.

This role requires in-person onboarding and travel to our San Francisco Office during the first week of employment.

  • #LI-Hybrid#LI-TM(P18058_3355591)
  • Below is the annual base salary range for candidates located in California (excluding San Francisco Bay Area), Colorado, Illinois, New York and Washington. Your actual base salary will depend on factors such as your skills, qualifications, experience, and work location. In addition, Okta offers equity (where applicable), bonus, and benefits, including health, dental and vision insurance, 401(k), flexible spending account, and paid leave (including PTO and parental leave) in accordance with our applicable plans and policies. To learn more about our Total Rewards program please visit: https://rewards.okta.com/us.

What you can look forward to as a Full-Time Okta employee!

  • Amazing Benefits
  • Making Social Impact
  • Developing Talent and Fostering Connection + Community at Okta
  • Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today! https://www.okta.com/company/careers/.Some roles may require travel to one of our office locations for in-person onboarding.

Job Summary

CompanyOkta
LocationBellevue, Washington
TypeFull-Time
LevelSenior
DomainDevOps
Senior Site Reliability Engineer - Security and Data Systems at Okta (Bellevue, Washington) | WorkWay