Manager- Site Reliability Engineering
At a glance
Highlights
- Career-defining mission
- Global community
- In-person onboarding
- Well-being support
- Social impact focus
Heads up
- On-call rotation required
Why this role might suit you
The position provides an opportunity to lead a technically diverse SRE team, drive security-focused automation, and influence reliability metrics for critical services within a globally connected organization that emphasizes continuous learning and community involvement.
Skills
About the role
Secure Every Identity, from AI to Human
Identity is the key to unlocking the potential of AI. Okta secures AI by building the trusted, neutral infrastructure that enables organizations to safely embrace this new era. This work requires a relentless drive to solve complex challenges with real-world stakes. We are looking for builders and owners who operate with speed and urgency and execute with excellence.
This is an opportunity to do career-defining work. We're all in on this mission. If you are too, let's talk.
At Okta, our motto is "Always On" and nowhere do we embrace that more than in Technical Operations. We strive to build the most reliable and performant systems on the planet through the skillful use of automation. If you like to be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools.
You will work on: ● Mentoring, managing, and leading a team of SRE’s with a broad range of expertise and experience. ● Being an evangelist and advocate for security best practices, leading initiatives and projects to strengthen our security posture for our most critical infrastructure. ● Responding to production incidents, driving us to remediation as quickly as possible and determining how we can prevent them in the future. ● Triaging and troubleshooting complex production issues to ensure reliability and performance. ● Working closely with our stakeholders across the organization to ensure our new capabilities are aligned to our competing constraints of reliability, security, and delivery velocity. ● Partnering directly with recruiting and people ops to hire and retain the best talent in the world. ● Keep sharp eyes on our metrics, including vulnerability scanning and security posture, cloud spend, RPO and RTO, and toil overhead, and ensure our projects are driving our metrics in the right direction. ● Supporting a 24x7 online environment as part of an on-call rotation.
You are an ideal candidate if you: ● Are always willing to go the extra mile: see a problem, fix the problem. ● Are passionate about encouraging the development of engineering peers and leading by example. ● Have experience managing teams running large-scale production Java/Tomcat and containerized services in AWS (EC2, ECS, KMS, Kinesis, RDS) or other cloud providers. ● Have deep knowledge of CI/CD principles, Linux fundamentals, OS hardening, networking concepts, and IP protocols.
Minimum Required Knowledge, Skills, Abilities, and Qualities: ● 4+ years of experience managing SRE or SWE teams, ideally in a cloud native environment. ● 13+ years Strong leadership, communication, and project management skills. ● Strong security background and knowledge. ● BS In computer science (or equivalent experience).
#LI-Hybrid
P19336_3419925
The Okta Experience
Supporting Your Well-Being
Driving Social Impact
Developing Talent and Fostering Connection + Community
We are intentional about connection. Our global community, spanning over 20 offices worldwide, is united by a drive to innovate. Your journey begins with an immersive, in-person onboarding experience designed to accelerate your impact and connect you to our mission and team from day one.
If reasonable accommodation is needed to complete any part of the job application, interview process, or onboarding please use this Form to request an accommodation.
Notice for New York City Applicants & Employees: Okta may use Automated Employment Decision Tools (AEDT), as defined by New York City Local Law 144, that use artificial intelligence, machine learning, or other automated processes to assist in our recruitment and hiring process. In accordance with NYC Local Law 144, if you are an applicant or employee residing in New York City, please click here to view our full NYC AEDT Notice.
Okta is committed to complying with applicable data privacy and security laws and regulations. For more information, please see our Personnel and Job Candidate Privacy Notice at https://www.okta.com/legal/personnel-policy/.
Questions about this role
How do I apply to this Manager- Site Reliability Engineering role at Okta?
Click "Apply with AI Applyd" above. We auto-fill the application from your resume and answer screening questions in seconds. No copy and paste, no juggling tabs.
What's the typical salary for DevOps / SRE in India?
Compensation for DevOps / SRE roles in India varies widely by seniority, employer size, and remote vs onsite arrangement. Check the salary range on this listing when published, or browse our DevOps / SRE hub for India medians across recent openings.
How fast does AI Applyd auto-apply?
Most applications complete in under 90 seconds. You can track the status in your dashboard and watch the screenshot proof land the moment the application submits.
What ATS does Okta use?
AI Applyd supports Greenhouse, Lever, Ashby, Workday, iCIMS, SmartRecruiters, LinkedIn Easy Apply, and most other ATS platforms. If we can submit through the platform, we do.
Want AI Applyd to auto-apply to roles like this?
We tailor your resume per posting, fill the forms, and track replies for you.