Back to jobsJob overview
About the role
Software Engineer II at Microsoft
Required Skills
distributed systemsc++pythonai/mlgenerative aiagile methodologiesobservabilitycloud computingazure
About the Role
Software Engineer II role on the Azure Compute Availability Platform team, focusing on ensuring high SLA for Azure VMs through AI-driven monitoring and repair services. Responsibilities include designing hyperscale services, integrating predictive AI models, and collaborating with data scientists to enhance platform resilience and automation.Key Responsibilities
- Partners with stakeholders to determine project requirements and leads design of change management features
- Develops high-quality, maintainable code and supports livesite as Designated Responsible Individual (DRI)
- Collaborates with data scientists to integrate predictive models for hardware anomaly detection and live migration
- Leads initiatives to embed AI-driven diagnostics and root cause analysis into availability services
- Evaluates and integrates large-scale AI models into control plane services for smarter repair decisions
Required Skills & Qualifications
Must Have:
- Bachelor's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including C, Rust, C++, C#, Java, JavaScript, or Python OR equivalent experience
- Ability to meet Microsoft, customer and/or government security screening requirements, including Microsoft Cloud Background Check
Nice to Have:
- Master's Degree in Computer Science or related technical field AND technical engineering experience
- Ability and passion for designing and building highly available distributed systems at scale
- Ability to exercise sound judgment in ambiguous situations
- Experience with agile methodologies and willingness to adopt them
Benefits & Perks
- Industry leading healthcare