Apply Now: Site Reliability Engineer

CAPTCHA code Image not clear?Get new image

Job Details

Site Reliability Engineer


Site Reliability Engineer


Our client's product is changing the Cloud security game. They are doing it with Cloud Native applications that deploy continuously, bringing value to their customers at a lightning pace. This requires SRE excellence in all stages of development and deployment through to feedback and improvement.

If you have the skills, experience, and passion for leading SRE activities, then this is the position for you.

         SRE Passion is automating everything, removing the toil, and seeing the entire world in code.

         SRE Passion is feeling the energy of enabling development teams to safely go faster.

         SRE Passion is loving Cloud technology and how it enables acceleration.

         SRE Passion is incomplete without security and compliance - DevSecOps.

         SRE Passion is solving complex problems with simple solutions.

         SRE Passion is being on a continuous path for improvement.

They are are looking for someone who:

         Has the SRE Passion

         Has the proven skills in handling incidents, with a customer first focus

         Measures extensively and creatively applies improvements

         Has a strong focus on security and compliance within a cloud environment

         Has a proven track record for influencing a group of individuals with the same passion, both in SRE and operations

         Has exceptional prioritization skills, including transparently communicating and justifying time investments

         Effectively collaborates with designers, developers, architects and operations, about SRE evolutions

         Writes code, regularly

         Embraces and presents new technologies and processes with the goals of learning and improving service quality while reducing costs

Duties and Responsibilities:

         Design and create automations that remove toil from the operational requirements of the services while enabling an increase of velocity for service teams

         Influence teams through SRE improvements, compliance audits, disaster recovery simulations and operational incidents

         Contribute to the enhancement of tools and practices used within the business units, keeping a keen eye on customer outcomes

         Available for off-hours on-call rotation 25%

To be successful with this role, the following skills are needed:

         Proven SRE experience utilizing SLO/SLIs in a commercial, cloud-based service

         At least four years of experience designing and implementing software with high-level languages including Python or NodeJS

         At least two years of experience in cloud computing with a preference to Amazon AWS

         Proven experience in a high impact team in both development and operations activities

         Familiarity with secure coding practices and compliance adherence within a cloud environment including PCI, ISO27001 and SOC2

         Familiarity with FedRAMP a plus

         Proven experience in developing infrastructure as code using technologies like CloudFormation, Terraform and Serverless

         Hands on infrastructure, systems and application architecture experience in large scale, web-based applications

         Experience with testing and managing high availability environments

         Excellent communication, interpersonal, and influencing skills in a cross functional role

         Bachelor of Science degree in related discipline nice to have, advanced degree desirable

         Must be a naturalized US citizen or permanent resident due to data sovereignty requirements