Back to jobsJob overview
About the role
Software Reliability Engineer II at Microsoft
Required Skills
azurecloud computingdistributed systemsautomationc#javac/c++networkingvirtualization
About the Role
Software Reliability Engineer II role focused on improving reliability, performance, and scalability of Azure Edge services. Responsibilities include automation, incident response, telemetry analysis, and on-call rotations to maintain live site operations.Key Responsibilities
- Acts as Designated Responsible Individual (DRI) for on-call monitoring and incident response
- Develops automation tools and scripts to reduce operational burden
- Analyzes telemetry data to identify patterns and drive continuous improvement
- Maintains communication with partners across Microsoft ecosystem
- Participates in incident response and writes postmortem reports
Required Skills & Qualifications
Must Have:
- 4+ years of experience in Software Development/SRE
- Bachelor's/master's degree in Computer Science or related field
- Strong Computer Science background with C#, Java, C/C++ programming
- Debugging skills
Nice to Have:
- Experience with AI/ML and LLMs
- Knowledge of Microsoft Azure, AWS or similar cloud platforms
- Prior experience working in hyperconverged infrastructure
- Prior experience working with Fortune 500 customers
Benefits & Perks
- Industry leading healthcare