Skip to content

Backend Engineer - Studio Media Platform

Sarvam

Bengaluru, INonsitePosted Jun 1, 2026

Skills

postgreskubernetesprometheussqlalchemygrafanapytorchdockerpythonazurehelmcicdgooglecloudnaturallanguageprocessingawsllmml

About the role

About Sarvam

Sarvam is building the bedrock of Sovereign AI for India. The company is developing India’s full-stack sovereign AI platform, building across research, models, infrastructure and applications with a singular focus on making AI genuinely work for India. Sarvam works with leading enterprises and public institutions and is backed by Lightspeed, Peak XV, and Khosla Ventures. Sarvam partners with India’s leading brands, including Tata Capital, SBI Life, CRED, IDFC, and LIC.

About the Role

We are hiring a Backend Engineer to work across Sarvam’s Studio media platform — spanning AI dubbing, live translation, and the shared service foundation that powers all Studio products (voice cloning, stem separation, lip sync, music generation, and more). You will build and maintain production services, ML pipeline libraries, and platform SDKs that together enable multilingual media processing at scale for enterprise customers and Sarvam Studio users.

The work cuts across multiple codebases: a core ML pipeline library (ASR, translation, TTS, audio processing), production services for dubbing and live translation, and a shared platform SDK that provides common capabilities to every Studio service.

What You’ll Do

Service & Infrastructure

Design and optimize production FastAPI services for dubbing and live translation — multi-stage task orchestration, rate-limited scheduling, and backpressure controls for concurrent workloads

Build and maintain distributed worker architectures with independent scaling per pipeline stage and automatic recovery of stuck or failed tasks

Own the data layer — async ORM models, schema migrations, and query optimization on PostgreSQL

Implement real-time features — WebSocket-based job tracking for dubbing and streaming audio pipelines for live translation

Manage Kubernetes deployments — Helm charts, secrets management, ingress configuration, and multi-role container images

ML Pipeline & Library

Extend and maintain the core dubbing library across all pipeline stages: audio extraction, VAD, speech recognition, translation, QC, TTS, and final video stitching

Integrate and optimize ML model serving — remote inference server clients and local model inference for audio analysis and vocal separation

Build and improve QC orchestration — automated scoring, tempo analysis, guided normalization, and pronunciation verification

Design async-first pipelines with efficient concurrency patterns for CPU-bound audio processing

Maintain and evolve LLM integration layers for translation, QC, and pre-processing across multiple provider backends

Platform SDK & Shared Services

Build and maintain the shared Studio service SDK — reusable FastAPI middleware and routers for authentication, billing, workspace isolation, and input validation

Design media storage abstractions — upload, signed URL generation, retention policies, and cloud blob storage integration

Implement cross-cutting concerns: rate limiting, metering, audit trails, and request history across all Studio services

Build observability foundations — OpenTelemetry instrumentation, structured logging, and metrics collection shared across services

Quality & Operations

Maintain high test coverage with strong CI gates, parallel test execution, and thorough mocking of external services

nstrument services with custom metrics and structured error tracking for production observability

Manage CI/CD pipelines — automated testing, linting, container builds, artifact publishing, and version management

What We’re Looking For

4–6 years of experience in backend engineering, with a focus on building and operating production services at scale

Strong proficiency in Python with hands-on experience building production FastAPI or similar async web services (non-negotiable)

Deep understanding of async programming — asyncio, concurrent execution patterns, and designing for high-throughput workloads

Experience with distributed task systems: task queues (Celery or similar), message brokers, and designing fault-tolerant job orchestration

Hands-on with PostgreSQL and an async ORM (SQLAlchemy preferred) — comfortable with query optimization, schema design, and migrations

Familiarity with audio/media processing: FFmpeg, common audio formats, and processing libraries like soundfile or librosa

Experience integrating ML models into production — API-based (REST/gRPC) or local inference (PyTorch, ONNX Runtime)

Experience building reusable libraries or SDKs — designing clean APIs, managing backward compatibility, and publishing packages for internal consumers

Proficiency with Docker, Kubernetes, Helm charts, and at least one major cloud platform (Azure/GCP/AWS)

Strong testing discipline — writing thorough tests, mocking external services, and maintaining CI/CD pipelines

Bonus Points

Prior experience with speech/NLP systems: ASR, TTS, or machine translation

Experience with ML model serving infrastructure (Triton, TorchServe, or similar)

Familiarity with LLM orchestration for structured output and multi-step agent workflows

Experience with real-time streaming — WebSocket, WebRTC, or Server-Sent Events in production

Familiarity with Indic languages and the nuances of multilingual content (code-mixing, transliteration, regional dialects)

Experience designing platform middleware — auth, billing, rate limiting, or multi-tenant isolation

Experience with observability tooling — Prometheus, Grafana, OpenTelemetry, or similar stacks

Familiarity with video processing pipelines and media localization workflows

Contributions to open-source backend, audio, or NLP projects

Why Sarvam?

Sarvam is a fast-moving, high talent-density team building full-stack AI for India, working on problems that push the frontiers of AI with real population-scale impact.

Work alongside researchers, engineers, builders, and business leaders who move fast and hold each other to a very high bar

High ownership and high impact, from day one

Everything we do is AI-first, from the way we build and ship to the way we think about problems

You can work on problems that could change how an entire country learns, works, and communicates

If you want to work on problems at the frontier of AI in India, Sarvam is the place to be.

Questions about this role

  • How do I apply to this Backend Engineer - Studio Media Platform role at Sarvam?

    Click "Apply with AI Applyd" above. We auto-fill the application from your resume and answer screening questions in seconds. No copy and paste, no juggling tabs.

  • What's the typical salary for Backend Engineer in India?

    Compensation for Backend Engineer roles in India varies widely by seniority, employer size, and remote vs onsite arrangement. Check the salary range on this listing when published, or browse our Backend Engineer hub for India medians across recent openings.

  • How fast does AI Applyd auto-apply?

    Most applications complete in under 90 seconds. You can track the status in your dashboard and watch the screenshot proof land the moment the application submits.

  • What ATS does Sarvam use?

    AI Applyd supports Greenhouse, Lever, Ashby, Workday, iCIMS, SmartRecruiters, LinkedIn Easy Apply, and most other ATS platforms. If we can submit through the platform, we do.

Want AI Applyd to auto-apply to roles like this?

We tailor your resume per posting, fill the forms, and track replies for you.