Co
Posted:
March 02, 2026
Location:
Mumbai, India, India
Job Description
We are seeking a highly skilled Site Reliability Engineer (SRE) with strong experience in Kubernetes troubleshooting, incident response, and deep knowledge of monitoring and alerting systems, along with solid experience in CI/CD pipeline design and maintenance. You will play a key role in building and maintaining reliable infrastructure, enhancing observability, and ensuring uptime for mission-critical systems.
**In this role, you will…**
+ Diagnose and resolve issues in Kubernetes clusters, including deployments, pod failures, networking issues, and autoscaling.
+ Lead incident management efforts including on-call response, root cause analysis, and continuous improvement of incident playbooks.
+ Design and maintain monitoring, logging, and alerting systems using tools such as Prometheus, Grafana, and ELK (Elasticsearch, Logstash, Kibana).
+ Set up and manage Kibana dashboards and maintain the ELK stack to ensure high availability and performance of logging i...
**In this role, you will…**
+ Diagnose and resolve issues in Kubernetes clusters, including deployments, pod failures, networking issues, and autoscaling.
+ Lead incident management efforts including on-call response, root cause analysis, and continuous improvement of incident playbooks.
+ Design and maintain monitoring, logging, and alerting systems using tools such as Prometheus, Grafana, and ELK (Elasticsearch, Logstash, Kibana).
+ Set up and manage Kibana dashboards and maintain the ELK stack to ensure high availability and performance of logging i...
Apply for this Job
Submit your application for the Cloud Site Reliability Engineer position at Cornerstone onDemand.
Apply Now Save for LaterJob Overview
Job Type:
Full-time
Location:
Mumbai, India
Posted:
March 02, 2026
Deadline:
March 08, 2026