Nuvepro - Task Intelligence for the Enterprise
Anthropic· Safeguards (Trust & Safety) · San Francisco, CA | New York City, NY | Washington, DC

Policy Design Manager, Age-Appropriate Design

Classified Tasks (16)

Automate 0%Augment 88%Human-Only 13%

Augment (14)

AI assists, human decides

Develop usage policies governing responsible use of models for emerging capabilities and use cases

leadership

Clarify enforcement guidelines and escalation processes for policy violations

operational

Advise product and engineering teams on safety interventions for products and services

technical

Define best practices for developers building on Claude for deployments across different developmental stages

technical

Design age-assurance policies to protect minors from inappropriate content and interactions

leadership

Establish clear boundaries and guidelines for adult sexual content and experiences

leadership

Design evaluation frameworks to test model performance in child safety, age-assurance, content classification, and adult content areas

analytical

Conduct regular reviews and tests of existing policies to identify and remediate gaps and ambiguities

operational

Review flagged content to inform enforcement actions and drive policy improvements

operational

Update usage policies based on feedback from external experts, enforcement teams, and reviewed edge cases

operational

Collaborate with safeguards product teams to identify and mitigate safety concerns and design age-targeted user interventions

communication

Advise Enforcement, Product, Engineering, and Legal teams on age-assurance approaches and content classification frameworks

leadership

Monitor AI policy norms, regulatory requirements (e.g., age-appropriate design codes), and industry standards and incorporate them into policy decision-making

analytical

Shape policy creation and development to enable safe user interactions and developer integrations

leadership

Human-Only (2)

Requires human judgment

Serve as an internal subject matter expert on child safety, adult content, youth development, and age-appropriate design

leadership

Educate and align internal stakeholders on policies and safety approaches in relevant focus areas

communication

Job description

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As a Safeguards Policy Design Manager, you will be responsible for developing usage policies, clarifying enforcement guidelines, and advising on safety interventions for our products and services. Your core focus will be on age-appropriate design and experiences, including child safety, age assurance, content classification, and adult sexual content. You will help define best practices for developers building on claude for deployment to users across different developmental stages, design age-assurance policies that protect minors from inappropriate content and interactions, and establish clear boundaries for adult content and experiences. In addition, you will advise teams on opportunities for age-appropriate helpfulness, including advising cross-functional teams on beneficial use cases for younger users where appropriate. Safety is core to our mission and you’ll help shape policy creation and development so that our users can safely interact with and build on top of our products in a harmless, helpful and honest way. *Important context for this role: In this position you may be exposed to and engage with explicit content spanning a range of topics, including those of a sexual, violent, or psychologically disturbing nature. Responsibilities: Serve as an internal subject matter expert, leveraging deep expertise in child safety, adult content, youth development, and age-appropriate design to: Draft new policies that help govern the responsible use of our models for emerging capabilities and use cases Design evaluation frameworks for testing model performance in areas of expertise Conduct regular reviews and testing of existing policies to identify and address gaps and ambiguities Review flagged content to drive enforcement and policy improvements Update our usage policies based on feedback collected from external experts, our enforcement team, and edge cases that you will review Work with safeguards product teams to identify and mitigate concerns, and collaborate on designing appropriate interventions for users across different age groups Advise on age assurance approaches and content classification frameworks in partnership with Enforcement, Product, Engineering, and Legal teams Educate and align internal stakeholders around our policies and our approach to safety in your focus area(s) Keep up to date with new and existing AI policy norms, regulatory requirements (e.g., age-appropriate design codes), and industry standards, and use these to inform our decision-making on policy areas You may be a good fit if you have experience: As a researcher, subject matter expert, or trust & safety professional working in one or more of the following focus areas: child safety, youth online safety, age assurance, developmental science, content classification and rating systems, or adult content policy. Note: For this role, an advanced degree in developmental psychology, child development, education, or a related field is preferred. Drafting or updating product and / or user policies, with the ability to effectively bridge technical and policy discussions Designing or implementing age-appropriate experiences, age assurance mechanisms, or content classif
Source: Anthropic careers · scraped 2026-05-22
Apply at Anthropic