🌎
This job posting isn't available in all website languages
📁
Software Engineer
📅
CREQ233818 Requisition #
We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join our growing engineering team. The SRE will be responsible for ensuring the availability, performance, scalability, and reliability of our production systems. You will work at the intersection of software development and operations, driving best practices in observability, automation, and incident response.
Key Responsibilities:
Design, build, and maintain scalable and resilient infrastructure using cloud-native technologies (e.g., AWS, GCP, Azure).
Develop automated solutions for system provisioning, monitoring, deployments, and incident response.
Improve system observability using logging, monitoring, and alerting tools (e.g., Prometheus, Grafana, ELK, Datadog).
Collaborate with development teams to ensure reliable deployment pipelines (CI/CD) and service rollouts.
Conduct and participate in blameless postmortems and continuously improve system reliability.
Champion SRE principles, such as SLIs, SLOs, and error budgets.
Maintain runbooks, architecture diagrams, and incident reports for transparency and consistency.
Ensure security and compliance standards are met in operational practices.

Previous Job Searches

Similar Listings

Hyderabad, Andhra Pradesh, India

📁 Software Engineer

Requisition #: CREQ233889

Hyderabad, Andhra Pradesh, India

📁 Software Engineer

Requisition #: CREQ234416

Hyderabad, Andhra Pradesh, India

📁 Software Engineer

Requisition #: CREQ233865