As a Site Reliability Engineer you will work as a member of a Cloud Operations team, collaborating with other engineers to release world class cloud solutions for use by IT professionals.
You will be expected to take ownership of reliability, scalability, automation and other issues related to the function, performance and availability.
You're right for the job if you're comfortable demonstrating a deep technical understanding of the Microsoft Azure platform, networking topics and distributed architectures.
The goal of a Site Reliability Engineer is to build, scale and guard the systems that our customers use to manage their environments.
To do so, you will need strong skills in following areas : Responsibilities : · Collaborate with engineers to deliver solutions, working through the entire development lifecycle to release to Production· Participate in conversations and provide technical solutions on escalated production, performance and availability issues · Engage with other Site Reliability Engineers and Operations Engineers to discuss and define best practices· Work closely with the Development Team to keep critical technical issues in the product backlog updated· Work closely with the Cloud SecOps Team to keep the development pipeline compliant with security policies, standards and procedures· Ensure that proper telemetry, logging and metrics are implemented to measure service health and conform to error budgeting· Build architecture and operations tools to run, validate and analyze multiple enterprise-
level SaaS services· Engender reliability and availability, starting with metrics and measurements· Build tools and automation to prevent re-
occurrence of problems to mission critical products and services· Practice sustainable incident response as well as participate in peer reviews and blameless postmortems Job Requirements : · Three (3+) years’ experience in a Site Reliability or Operations role working on a SaaS product running on Microsoft Azure
Experience with performance tuning (applications, SQL servers, etc.)· Experience with enterprise monitoring solutions (e.
g., AppDynamics, New Relic, Application Insights, Splunk)· Experience with infrastructure automation and monitoring tools (e.
g., SaltStack, Kubernetes, Terraform, Ansible, Containers, Docker, Puppet, Chef)· Familiarity with continuous integration and deployment processes and tools (e.
g., VSTS, Jenkins, Maven, Nexus) Education : - Bachelor of Science degree in Computer Science or related Engineering field preferred Apply now!We would love to get to know you