Azure Reliability (in Azure Customer Experience) is transforming Microsoft's cloud services to meet the scale and reliability required to help Azure's customers achieve more. Our team designs and builds tools and processes to manage a more reliable infrastructure, with an emphasis on the physical layer (datacenter, server, network). We work across Microsoft - blending expertise in supply chain, hardware engineering, power, security, and site reliability. We know our work enables the broader Azure organization to scale globally through the use of intelligent automation. Our distributed team has a presence in Redmond (WA), Sunnyvale (CA), and Atlanta (GA).
This role collaborates with teams responsible for datacenter, server, storage, cloud products, risk/change management, and incident management. You will partner to identify and integrate process and instrumentation, and specify or build automation for fleet operating tools and platforms. Your work will fundamentally improve the agility of Microsoft's cloud capacity and reduce production risk.
As a principal - we require knowledge of non-abstract large system design using container based solutions, hardware management processes and inventory systems. The right candidate will have a history of partnering with a variety of engineering teams to develop horizontally scalable management systems.
* Defining and promoting secure and scalable engineering standards and processes.
* Partnering with internal teams on automation of deployment and risk forecasting
* Management of a highly automated operational measurement systems
* Analyzing existing and custom datasets to assist change management and risk assessment opportunities or suspected defects
* Design and implementation of systems to capture and integrate data from purchasing, deployment, repair, RMA, and EOS/EOL tools.
* Design and automation of systems to capture and integrate data from proprietary scanning and behavioral monitoring tools / agents.
* Automating support for monitoring and alerting of changes based on utilization, trends, planned maintenance, etc. Engage and foster opportunities to improve existing planning, processes, and automation.
Skills, Experience & Knowledge:
* Strong analytical skillset in the context of Cloud Capacity and Service Delivery
* Strong experience managing server or network components at scale
* Moderate to extensive experience with cloud datacenter components
* Previous role-specific experience organizing and presenting to executive leadership
Required Qualifications:
* BS/MS in Computer Science or related field, or equivalent industry experience.
* 7+ years software development experience.
* Experience with either Go or Rust as a primary language
* Prior experience with shipping cloud/network services at a cloud provider
* Experience with Linux and container deployments
Preferred Qualification and Experience:
* Working knowledge of TLS and OAuth
* Experience with .Net is a plus, but not required
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Redmond, WA
Microsoft Corporation develops, licenses, and supports software, services, devices, and solutions worldwide. Its company’s Productivity and Business Processes segment offers Office 365 commercial products and services, such as Office, Exchange, SharePoint, Skype for Business, Microsoft Teams, and related Client Access Licenses (CALs); Office 365 consumer services, including Skype, Outlook.com, and OneDrive; LinkedIn online professional network; and Dynamics business solutions comprising financial management, enterprise resource planning, customer relationship management, supply chain management, and analytics applications for small and medium businesses, large organizations, and divisions of enterprises.
The company’s Intelligent Cloud segment licenses server products and cloud services, such as SQL Server, Windows Server, Visual Studio, System Center, and related CALs, as well as Azure, a cloud platform; and enterprise services, including premier support and Microsoft consulting services to assist customers in developing, deploying, and managing Microsoft server and desktop solutions, as well as provides training and certification to developers and IT professionals.
Its More Personal Computing segment offers Windows OEM, volume, and other non-volume licensing of the Windows operating system; patent licensing, Windows Internet of Things, and MSN display advertising; Surface, PC accessories, and other devices; Xbox hardware and software and services; and Bing and Bing Ads search advertising. It markets its products through original equipment manufacturers, distributors, and resellers; and online and Microsoft retail stores.
Microsoft Corporation has collaboration with E.ON, NIIT Technologies Ltd., CUNA Mutual Group, and Mastercard Incorporated; strategic alliance with Nielsen Holdings plc and PAREXEL International Corp.; and a strategic partnership with SK Telecom Co., Ltd. The company was founded in 1975 and is headquartered in Redmond, Washington.