Back to jobsJob overview
About the role
Software Engineer II at Microsoft
Required Skills
c++pythonai infrastructurevirtualizationdistributed systemshigh performance computingperformance analysisgpuoperating systems
About the Role
Software Engineer II role in Azure HPC/AI team, focusing on designing and delivering next-gen AI training platforms. Responsibilities include hardware/software interactions, device virtualization, and performance analysis of GPU workloads in VMs. This position impacts Microsoft's AI infrastructure and initiatives.Key Responsibilities
- Analyzes functionality, integration, and performance issues at hardware/software stack levels
- Designs and codes solutions to improve correctness, stability, and performance of AI training VMs
- Optimizes, debugs, refactors, and reuses code for performance and maintainability
- Acts as Designated Responsible Individual (DRI) and collaborates on-call for system monitoring
- Develops playbooks and maintains communication with partners across Microsoft ecosystem
Required Skills & Qualifications
Must Have:
- Bachelor's Degree in Computer Science or related technical field OR equivalent experience
- Technical engineering experience with coding in languages like C, C++, C#, Java, JavaScript, or Python
- Experience in HPC or Machine Learning
- Ability to pass Microsoft Cloud Background Check security screening
Nice to Have:
- Master's Degree in Computer Science or related technical field OR equivalent experience
- Familiarity with Machine Learning, AI Infrastructure, Operating Systems fundamentals, virtualization, Accelerators
- Experience on Distributed Systems, HPC/ML middleware, Co-Designing Hardware-Software, Profiling and Performance Analysis Tools
Benefits & Perks
- Industry leading healthcare