Site Reliability Engineer-Cloud

ATC Search, Santa Clara-Bay Area

Site Reliability Engineer-   Join a rapidly growing team building  The Next Generation Cloud Infrastructure Service with focus on Enterprise Workloads.  
In this visible role you will be the seasoned systems operations expert to lead initiatives focused on systems infrastructure management within a high volume fast scaling environment. Responsibilities




● You will be responsible for the systems deployment, operations, and monitoring for infrastructure, including design and development of infrastructure automation.




● You will get your hands dirty, troubleshooting infrastructure, and architectural challenges using your existing knowledge and toolkits.




● You will drive reliability and supportability aspects of cloud service by creating knowledge base and, working with DevOps, coordinate change management policies, deploy ticket/incident management system, service request queue triaging and auto-remediation.




● You will leverage your advanced system architecture & administration skills for collaboration with engineering and product management, test and automation teams to architect and develop strategic and tactical solutions.




● You will engage with suppliers for the purposes of infrastructure equipment procurement, technical design exercises, and supplier roadmap reviews. 




● You will help develop requirements for customer on-boarding processes, target environment sizing and migration automations
To get started, enter your information below

I agree to the Flashrecruit terms of use and acknowledge I have read the privacy policy, and agree to receive jobs alerts.