Skip to main content

Job Description

   Back

Sr. Dev Ops Engineer

18-12-2024 16:01:10

4 - 8 years

  • Pune, Maharashtra, India (PUN)

Responsibilities:

Design, implement, and maintain scalable and reliable infrastructure solutions to support our applications and services.

Monitor the health and performance of systems and services using Datadog, Sumo Logic, Grafana, and other monitoring tools. 

Proficiency in Terraform

Address issues, and implement preventive measures to minimize downtime and service disruptions.

Automate repetitive tasks, streamline operational workflows, and improve efficiency through infrastructure as code (IaC) and automation tools.

Collaborate with development teams to ensure that applications are designed and implemented with reliability and scalability in mind.

Participate in on-call rotation and respond to incidents and emergencies in a timely manner, effectively triaging and resolving issues to minimize impact on customers.

Conduct post-incident reviews, analyse root causes, and implement corrective actions to prevent recurrence.

Continuously evaluate and improve monitoring, alerting, and logging solutions to enhance visibility into system behaviour and performance.

Stay informed of industry trends and best practices in site reliability engineering and contribute to the adoption of new technologies and methodologies.