Skip to content

Model Policy Manager

OpenAI

San Francisco, USremote country$207k-$295k/yrPosted May 13, 2026

At a glance

Highlights

  • safety-focused research
  • high-impact policy work
  • collaboration with leading ai researchers
  • hybrid work model
  • competitive compensation

Why this role might suit you

The role allows a professional to design and enforce safety policies for frontier AI models, work with top researchers, and influence real-world AI risk mitigation within a mission-driven organization.

Skills

policy-developmentrisk-assessmentthreat-modelingbehavioral-specificationevaluation-criteriared-teamingdeployment-data-analysismodel-failure-analysishuman-data-campaignsgold-set-constructionlabeling-guidancecalibrationadjudicationsystem-cardssafety-reportsmodel-spec

About the role

About the Team

Our Safety Systems https://openai.com/safety/safety-systems team is at the forefront of OpenAI's mission to build and deploy safe AGI, driving our commitment to AI safety and fostering a culture of trust and transparency.

Within Safety Systems, the Model Policy team aligns model behavior with desired human values and norms. We co-design policy with models and for models by driving rapid policy taxonomy iteration based on data and defining evaluation criteria for foundational models’ ability to reason about safety.

About the Role

If you have a specific expertise or speciality related to this work, please note it in your application via your resume, cover letter or application note.

Frontier AI systems are expanding what people can do across domains, creating both enormous opportunities and difficult safety questions: when should a model help, when should it refuse, and how do we make those boundaries clear enough to train, evaluate, and enforce?

In this role, you will help define how OpenAI’s models should behave in high-risk or high-ambiguity contexts, such as agentic systems, multimodal systems, user safety, privacy, and other emerging risk domains.

This is an ideal role for someone who can move across unfamiliar topics, reason from first principles, and turn ambiguity into practical model behavior. You will work closely with research, engineering, product, preparedness, and operations teams to build policies that are technically grounded, measurable, and responsive to real-world risk.

In this role, you will:

- Design and maintain model policies across safety-relevant domains, including dual-use, agentic, and emerging frontier-risk areas.

- Translate risk and harm models into clear behavioral specifications, evaluation criteria, grading guidance, and system-level safeguards.

- Define practical boundaries between beneficial uses of AI and assistance that could materially enable harm, exploitation, misuse, or unsafe outcomes.

- Build policy artifacts that support model training, evaluation, and deployment.Partner with safety researchers, engineers, product teams, and other stakeholders to operationalize policy into scalable model behavior and measurable safeguards.

- Use red-teaming results, deployment data, model failures, over-refusals, under-refusals, and ambiguous edge cases to improve policy and evaluation quality over time.

- Identify emerging capability areas where frontier AI systems could create new safety challenges or lower barriers to harm.

- Study real-world deployments to identify where model behavior succeeds, fails, or drifts from the intended safety posture.

- Combine longer-horizon safety research with hands-on launch and deployment work.

- Contribute to system cards, safety reports, policy documentation, launch reviews, and external communications on OpenAI's approach to model safety and risk mitigation.

- Design and run human data campaigns, including gold set construction, labeling guidance, calibration, adjudication, and eval coverage analysis, to ensure policies can be reliably measured and improved.

You might thrive in this role if you:

- Have strong judgment about how advanced AI systems may affect real-world risk, especially in ambiguous, fast-moving, or high-impact areas.

- Have experience building or applying policies, taxonomies, harm models, threat models, or risk frameworks for complex technical, social, or adversarial systems.

- Can move across domains without needing to be the deepest subject-matter expert in every area, while knowing when to seek expert input.

- Can turn fuzzy questions into structured policy frameworks, evaluation criteria, operational guidance, and enforceable model behavior.

- Are comfortable using empirical evidence, including evaluations, red-teaming results, deployment observations, and model failure modes, to inform policy decisions.

- Think in systems across policy, data, graders, classifiers, training, deployment safeguards, measurement, monitoring, and escalation workflows.

- Have technical judgment about what model behavior can realistically be trained, measured, evaluated, and enforced at scale.

- Work well across research, engineering, product, policy, domain experts, and operational teams.

- Write clearly about complex tradeoffs where safety, user value, and implementation constraints all matter.

- Take a pragmatic approach to safety, focused on reducing real-world risk while preserving legitimate, beneficial, and socially valuable uses of AI.

- Enjoy fast-paced, collaborative research environments where priorities shift as models, evidence, and risks change.

- Stay grounded in implementation details, empirical results, and what can actually be trained or measured.

Our relevant publications:

- Accelerating the cyber defense ecosystem that protects us all https://openai.com/index/accelerating-cyber-defense-ecosystem/

- Trusted Access https://openai.com/index/scaling-trusted-access-for-cyber-defense/

- Safety at every step https://openai.com/safety/

- Safety evaluations hub https://openai.com/safety/evaluations-hub/

- OpenAI GPT5.5 System Card https://openai.com/index/gpt-5-5-system-card/

- Improving Model Safety Behavior with Rule-Based Rewards https://openai.com/index/improving-model-safety-behavior-with-rule-based-rewards/

- OpenAI Model Spec https://openai.com/index/introducing-the-model-spec/

Workplace & Location

This role is based in our San Francisco office. We do encourage you to apply even if you prefer a different work location as factors may change over time.

We offer relocation support to new employees, and we use a hybrid model: three days in the office per week with optional work from home on Thursdays and Fridays.

Our open-plan offices have height-adjustable desks, conference rooms, phone booths, well-stocked kitchens full of snacks and drinks, three in-house prepared meals daily, a private outdoor space for working in the sun or socializing, nap rooms, private bike storage, and more.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form https://form.asana.com/?d=57018692298241&k=5MqR40fZd7jlxVUh5J-UeA. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link https://form.asana.com/?k=bQ7w9h3iexRlicUdWRiwvg&d=57018692298241.

OpenAI Global Applicant Privacy Policy https://cdn.openai.com/policies/global-employee-and-contractor-privacy-policy.pdf

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.

Compensation

This Other role pays $207k-$295k/yr. Within typical range for other roles in United States.

Questions about this role

  • How do I apply to this Model Policy Manager role at OpenAI?

    Click "Apply with AI Applyd" above. We auto-fill the application from your resume and answer screening questions in seconds. No copy and paste, no juggling tabs.

  • What's the typical salary for Other in United States?

    Compensation for Other roles in United States varies widely by seniority, employer size, and remote vs onsite arrangement. Check the salary range on this listing when published, or browse our Other hub for United States medians across recent openings.

  • How fast does AI Applyd auto-apply?

    Most applications complete in under 90 seconds. You can track the status in your dashboard and watch the screenshot proof land the moment the application submits.

  • What ATS does OpenAI use?

    AI Applyd supports Greenhouse, Lever, Ashby, Workday, iCIMS, SmartRecruiters, LinkedIn Easy Apply, and most other ATS platforms. If we can submit through the platform, we do.

Want AI Applyd to auto-apply to roles like this?

We tailor your resume per posting, fill the forms, and track replies for you.