Site Reliability Engineer
Developer with 5+ years of experience in AWS Infrastructure, Terraform, and Site Reliability Engineering (SRE) principles to join our team. The ideal candidate will have a strong background in cloud infrastructure, automation using Terraform, and a solid understanding of SRE practices to help maintain and optimize system reliability, scalability, and performance. You will work on the development and maintenance of cloud-based solutions, contribute to infrastructure automation, and ensure the reliability of critical systems.
Design, develop, and maintain cloud infrastructure solutions using AWS services such as EC2, S3, RDS, Lambda, and VPC.
Implement and manage infrastructure as code (IaC) using Terraform to provision, configure, and manage cloud resources.
Apply Site Reliability Engineering (SRE) principles to monitor, optimize, and improve the availability and reliability of cloud-based systems.
Automate and streamline operational tasks, including monitoring, logging, incident management, and alerting.
Contribute to the development and maintenance of CI/CD pipelines to automate deployments and improve operational efficiency.
Work closely with development teams to ensure that infrastructure is aligned with application requirements and business needs.