Senior Software Development Engineer, AWS Mantle

Amazon Web Services

Seattle, USonsite$168k-$227k/yrPosted Jun 16, 2026

Skills

openaiawsml

About the role

DESCRIPTION

We're looking for an experienced Senior Software Development Engineer to help build and scale the distributed inference engine that powers Amazon Bedrock. As part of the AWS Mantle team, you will design and deliver critical systems that enable millions of customers to access the world's leading foundation models—securely, reliably, and at global scale. This is an opportunity to work on one of the most impactful AI infrastructure platforms at AWS, where your code and design decisions will directly shape how generative AI is served to enterprises worldwide.

Design, build, and operate high-performance distributed systems that serve ML inference at massive scale across all AWS regions

Own the end-to-end delivery of complex features—from requirements through design, implementation, testing, deployment, and production operations

Collaborate with cross-functional teams to solve challenging problems in capacity management, model serving, and API compatibility

Contribute to a culture of engineering excellence by writing clean, maintainable code and driving continuous improvement in system reliability

Influence technical direction within your team while contributing to broader architectural discussions across Mantle and Amazon Bedrock

Key job responsibilities

As a Senior SDE on the Mantle team, you will be a hands-on technical leader who owns significant components of our inference platform. You will balance deep technical execution with thoughtful design, delivering solutions that are scalable, secure, and operationally excellent—while mentoring teammates and raising the bar for the team.

Design and implement core components of Mantle's distributed inference engine, including request routing, load balancing, model lifecycle management, and quality-of-service enforcement

Build and operate services that onboard new foundation models rapidly while maintaining strict performance SLAs and Zero Operator Access (ZOA) security guarantees

Drive operational excellence by owning your team's services in production—monitoring, alarming, incident response, and continuous reliability improvement

Partner with applied scientists, ML engineers, and partner teams to integrate new model architectures and optimize inference performance across GPU/accelerator fleet.

Mentor junior and mid-level engineers through code reviews, design reviews, and hands-on guidance that elevates the team's technical capabilities

About the team

The AWS Mantle team is building the next-generation inference engine that powers Amazon Bedrock—providing secure, enterprise-grade access to high-performing foundation models from the world's leading AI companies. Our mission is to simplify and accelerate how models are served at global scale, with an unwavering commitment to customer trust through innovations like our Zero Operator Access architecture, designed so that no person—whether from AWS, a customer, or a model provider—can ever access customer inference data.

We operate at massive scale, serving inference requests across all major AWS regions with sophisticated automated capacity management and unified resource pools

Our team values builders who thrive in ambiguity, think long-term, and are excited to define the future of AI infrastructure from the ground up

We foster a collaborative, inclusive environment where diverse perspectives drive better solutions—and where the best ideas win regardless of where they originate

We ship fast and iterate with purpose, having rapidly expanded from launch to supporting models from OpenAI, DeepSeek, Google, Mistral, NVIDIA, and more

We believe work should be meaningful and fun—you'll join a team that takes pride in making history at the forefront of generative AI

BASIC QUALIFICATIONS

5+ years of non-internship professional software development experience

5+ years of programming with at least one software programming language experience

5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience

Experience as a mentor, tech lead or leading an engineering team

Bachelor's degree in Computer Science, Engineering, a related field, or equivalent experience

PREFERRED QUALIFICATIONS

Master's degree in computer science, machine learning, engineering, or related fields

Experience in machine learning, data mining, information retrieval, statistics or natural language processing, or experience in developing and deploying LLMs in production on GPUs, Neuron, TPU or other AI acceleration hardware

Experience designing APIs at scale, particularly RESTful or streaming APIs with strict latency and availability requirements

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, WA, Seattle - 168,100.00 - 227,400.00 USD annually

Compensation

This Software Engineer role pays $168k-$227k/yr. Within typical range for software engineer roles in United States.

Questions about this role

Click "Apply with AI Applyd" above. We auto-fill the application from your resume and answer screening questions in seconds. No copy and paste, no juggling tabs.

Compensation for Software Engineer roles in United States varies widely by seniority, employer size, and remote vs onsite arrangement. Check the salary range on this listing when published, or browse our Software Engineer hub for United States medians across recent openings.

Most applications complete in under 90 seconds. You can track the status in your dashboard and watch the screenshot proof land the moment the application submits.

AI Applyd supports Greenhouse, Lever, Ashby, Workday, iCIMS, SmartRecruiters, LinkedIn Easy Apply, and most other ATS platforms. If we can submit through the platform, we do.

Want AI Applyd to auto-apply to roles like this?

We tailor your resume per posting, fill the forms, and track replies for you.