Back to jobsJob overview

About the role

Senior Site Reliability Engineer at Microsoft

Required Skills

site reliability engineeringazurecloud servicesinfrastructuremonitoringautomationsecuritynetworkinggpu

About the Role

Senior Site Reliability Engineer role in Azure Specialized team, focusing on designing, developing, deploying, and monitoring product features and infrastructure for Azure workloads. Involves working across control and data plane technologies, service architecture, datacenter networking, and security while collaborating with partner teams.

Key Responsibilities

  • Acts as Designated Responsible Individual (DRI) on call to monitor service and respond to issues within SLA
  • Contributes to data collection, classification, and analysis to refine product features and inform decisions
  • Develops automation for production and deployment of complex product features
  • Ensures compliance with security, privacy, safety, and accessibility processes
  • Maintains operations of live service, implements solutions for issues, and writes postmortems

Required Skills & Qualifications

Must Have:

  • Master's Degree in Computer Science/IT + 2+ years technical experience OR Bachelor's + 4+ years OR equivalent experience
  • 1+ years experience with support of physical infrastructure
  • 1+ years experience with GPU and/or Infiniband support
  • Ability to pass Microsoft Cloud Background Check upon hire and every two years

Nice to Have:

  • 7+ years technical experience in software engineering, network engineering, or systems administration
  • Bachelor's Degree in Computer Science/IT + 4+ years technical experience
  • Master's Degree in Computer Science/IT + 3+ years technical experience

Benefits & Perks

  • Industry leading healthcare