Back to jobsJob overview

About the role

Software Reliability Engineer II at Microsoft

Required Skills

azurecloud computingdistributed systemsautomationc#javac/c++networkingvirtualization

About the Role

Software Reliability Engineer II role focused on improving reliability, performance, and scalability of Azure Edge services. Responsibilities include automation, incident response, telemetry analysis, and on-call rotations to maintain live site operations.

Key Responsibilities

  • Acts as Designated Responsible Individual (DRI) for on-call monitoring and incident response
  • Develops automation tools and scripts to reduce operational burden
  • Analyzes telemetry data to identify patterns and drive continuous improvement
  • Maintains communication with partners across Microsoft ecosystem
  • Participates in incident response and writes postmortem reports

Required Skills & Qualifications

Must Have:

  • 4+ years of experience in Software Development/SRE
  • Bachelor's/master's degree in Computer Science or related field
  • Strong Computer Science background with C#, Java, C/C++ programming
  • Debugging skills

Nice to Have:

  • Experience with AI/ML and LLMs
  • Knowledge of Microsoft Azure, AWS or similar cloud platforms
  • Prior experience working in hyperconverged infrastructure
  • Prior experience working with Fortune 500 customers

Benefits & Perks

  • Industry leading healthcare