DevOps / Site Reliability Engineer (SRE)
Skills
About the role
Job Title: DevOps / Site Reliability Engineer (SRE)
Location: Coimbatore, Tamil Nadu (Hybrid/Remote)
Job Type: Full-time
About the Role
We are looking for a high-performance DevOps / Site Reliability Engineer (SRE) to own the stability, deployment, and performance scaling of our real-time, low-latency meta-dispatch kernel. Unlike typical cloud-only roles, this position bridges elite software engineering with bare-metal Linux infrastructure management.
You will work directly with our core architecture team to ensure our concurrent Go ingestion layers, Python heuristic engines, and real-time gRPC communication pipelines operate with deterministic microsecond latency. You will design, implement, and maintain the infrastructure that keeps hundreds of high-throughput mobile and aerial assets synchronized across Country.
Key Responsibilities
Infrastructure Ownership: Configure, optimize, and maintain our enterprise bare-metal Dell PowerEdge server environment running high-density Linux host distributions.
Kernel & Network Tuning: Maximize packet-processing throughput by implementing advanced Linux system configurations, including CPU core isolation (isolcpus), interrupt affinity, socket re-use parameters (SO_REUSEPORT), and 1GB HugePages allocation.
SRE Framework Implementation: Codify the reliability of our network core. Define and monitor precise Service Level Indicators (SLIs) and Service Level Objectives (SLOs) around memory allocation boundaries, network saturation, and gRPC payload latency.
CI/CD Pipeline Architecture: Build and automate robust deployment pipelines that securely compile cross-platform, statically linked Go binaries and containerized Python workloads.
Observability & Monitoring: Design and scale high-fidelity telemetry dashboards monitoring the "Four Golden Signals" (Latency, Traffic, Errors, Saturation) to proactively mitigate performance degradation.
Security & Fail-safe Engineering: Implement and maintain mutual TLS (mTLS) cryptographic handshakes across public wireless networks and manage root security permissions within Linux systemd service units.
Required Technical Skills
Systems & Infrastructure: 3+ years of experience in Linux System Administration managing dedicated bare-metal servers (compute sharding, hardware offloading, storage arrays).
Programming/Scripting: Proficiency in Go (Golang) and Python for writing automation scripts, monitoring tools, and understanding low-level execution paths.
Networking Protocols: Deep understanding of high-concurrency network architectures, specifically handling low-level UDP sockets, TCP, gRPC, and protocol buffer serialization.
Process Management: Strong experience deploying, performance-capping, and securing system services via Linux systemd.
Databases: Hands-on experience scaling and managing high-frequency write operations inside MongoDB or PostgreSQL.
Preferred Qualifications
Familiarity with spatial indexing libraries (specifically the Uber H3 spatial grid system or PostGIS).
Experience configuring network elements over commercial 5G/LTE cellular backhauls or Machine-to-Machine (M2M) SIM communication channels.
A strong background in blameless post-mortem operational cultures and automated toil reduction.
What We Offer
The opportunity to work on a cutting-edge, high-impact proprietary kernel platform.
A tech-first environment where performance engineering takes priority over boilerplate cloud configuration.
Competitive compensation and growth opportunities within a fast-scaling venture.
Pay: ₹160,446.92 - ₹567,232.34 per year
Benefits:
Flexible schedule
Internet reimbursement
Work from home
Work Location: Remote
Questions about this role
How do I apply to this DevOps / Site Reliability Engineer (SRE) role at Garuda Spacex Technologies?
Click "Apply with AI Applyd" above. We auto-fill the application from your resume and answer screening questions in seconds. No copy and paste, no juggling tabs.
What's the typical salary for DevOps / SRE in your country?
Compensation varies by seniority, employer size, and location. When this listing publishes a salary band you'll see it in the badge row above the description.
How fast does AI Applyd auto-apply?
Most applications complete in under 90 seconds. You can track the status in your dashboard and watch the screenshot proof land the moment the application submits.
What ATS does Garuda Spacex Technologies use?
AI Applyd supports Greenhouse, Lever, Ashby, Workday, iCIMS, SmartRecruiters, LinkedIn Easy Apply, and most other ATS platforms. If we can submit through the platform, we do.
Want AI Applyd to auto-apply to roles like this?
We tailor your resume per posting, fill the forms, and track replies for you.