Advertisement
Job Description:
Design, code, test, and deliver software to automate manual operational work
Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
Identify application patterns and analytics in support of better service level objectives
Design self-healing and resiliency patterns
Design self-healing and resiliency patterns
Design performance tests, identify bottlenecks and opportunities for optimization and capacity demands, and present solutions for continuous improvements
Design best in class monitoring frameworks to accomplish end-to-end flow monitoring and noiseless alerting
Design automated software and product upgrades, change management, and release management solutions
Coach or manage teams as applicable
Participate in the support coverage including weekends as needed
About Company:



