Apply

Apply Now: Site Reliability Engineer (Buffer)

 
Submit
CAPTCHA code Image not clear?Get new image

Job Details

Site Reliability Engineer (Buffer)

Dallas, TX, 75201

Job Description: Site Reliability Engineer (Buffer) 

• Bachelor's Degree in Computer Science or related; or equivalent combination of education and experience 

• 5~~@~~ yrs overall experience in Software Application Development & Engineering 

• 2~~@~~ years of SRE experience 

• 1~~@~~ yrs experience in AWS services 

• Experience in Typescript, NodeJs, and web development technologies 

• Proficient in scripting languages such as Powershell and/or Python 

• Knowledge of DevOps methodologies and the tools involved such as CI/CD concepts, CI/CD tools (Jenkins, CodePipeline, etc.), automation and config • Help build a Site Reliability Engineering culture by sharing best practices, approaches, documentation, and code with other engineering teams 

• Define and setup KPIs to monitor Error Budgets 

• Implement strategies to ensure Error Budgets stay above the defined-acceptance levels 

• Define and implement response mechanisms when Error Budget thresholds are breached 

• Apply automation and software to any tasks or parts of the system that would benefit from it or are performed manually; 

• Able to troubleshoot complicated issues handling OS, Networking, Database in a cloud-based SaaS environment and handle live production incidents, debug/troubleshoot infrastructure and application issues, including development and testing 

• Monitor application performance, take steps to improve overall application performance and stability and follow through with implementation (design, develop and test); 

• Conduct system analysis, configuration management and develops improvements for system software performance, availability and reliability; 

• Design, write, ship, and motivate the creation of software and systems to increase observability, product reliability and organizational efficiency; 

• Work closely with software engineers and QAs to ensure the system is responding properly to non-functional requirements such as performance, security, and availability; 

• Document your system knowledge as you acquire it over time, create runbooks, and ensure critical system information is readily available to those who need it; 

• Maintain and monitoring deployment, orchestration, of the servers, docker containers, databases, and general backend infrastructure; 

• Design, Develop & Test Terraform based Infrastructure as Code scripts to automate AWS infrastructure setup 

• Develop Typescript, NodeJS based REST/JSON Web Services deployed on AWS.


Compensation: 55-64.52 Hourly W2 (Open to C2C)