Site Reliability Engineer (AWS & Kubernetes)

Posted:
6/2/2026, 5:00:00 PM

Location(s):
Bengaluru, Karnataka, India ⋅ Haryana, India ⋅ Tamil Nadu, India ⋅ Karnataka, India ⋅ Chennai, Tamil Nadu, India ⋅ Gurgaon, Haryana, India

Experience Level(s):
Junior ⋅ Mid Level ⋅ Senior

Field(s):
DevOps & Infrastructure ⋅ Software Engineering

Join us as a Site Reliability Engineer

In this key role, you’ll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
You’ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
We're offering this role at associate level

What you'll do

As our Site Reliability Engineer, As our Site Reliability Engineer, you’ll contribute to the reliability, monitoring and operational excellence of cloud-native platforms.

You’ll work closely with senior engineers to support production systems, implement SRE practices, and ensure services are observable, scalable and resilient. You’ll also participate in the 24/7 support and on-call rotation, gaining experience in incident response and platform operations.

You'll also be:

Supporting the operation of AWS-based Kubernetes platforms (EKS)
Contributing to monitoring, alerting and observability implementations using tools like Grafana and Prometheus
Assisting in incident management, troubleshooting and root cause analysis
Participating in on-call rotations and production support activities
Implementing infrastructure changes using Terraform and GitOps workflows
Supporting CI/CD pipelines (GitLab, Argo CD) and deployment processes
Helping improve system reliability through automation and operational improvements
Following SRE practices such as runbooks, documentation and post-incident reviews
Working with DevOps and engineering teams to improve system performance and stability
Ensuring solutions align with security, compliance and operational standards

The skills you'll need

We’re looking for an engineer with solid foundational experience in cloud platforms and a keen interest in reliability engineering and production operations.

You'll also need:

Experience working with AWS and Kubernetes (EKS) in a production or pre-production environment
Familiarity with monitoring and observability tools such as Grafana and Prometheus
Understanding of CI/CD pipelines and Git-based workflows (GitLab preferred)
Exposure to Terraform or infrastructure-as-code concepts
Basic understanding of SRE practices and production support models
Experience troubleshooting applications or infrastructure issues
Awareness of networking and security fundamentals in cloud environments
Willingness to participate in on-call rotations and incident response
Strong problem-solving mindset and eagerness to learn
Good communication and collaboration skills

Hours

Job Posting Closing Date:

16/06/2026

Notify

postings

pricing

login

Site Reliability Engineer (AWS & Kubernetes)

What you'll do

The skills you'll need

Digi Ventures Ltd

Related Postings

Senior Staff Engineer, DevAI

Senior Security Engineer

Software Engineer

Sr. System Engineer- ArcGIS

Lead Software Engineer - C#

Notify

postings

our prices

login

contact us

privacy policy