Back to jobsJob overview
About the role
Senior Software Engineer at Microsoft
Required Skills
pythonc++ai/hpccloud infrastructurehigh-speed networkshpc storagevirtualizationcontainer technologiesgpu computing
About the Role
Senior Software Engineer role on Microsoft's Azure AI and HPC team, developing tools and systems to support large-scale supercomputing deployments in the cloud. Responsibilities include debugging HPC systems, ensuring platform performance and scalability, and contributing to architectural decisions for cloud-native supercomputers.Key Responsibilities
- Build tools and analyze system metrics to debug HPC system issues
- Partner with customers and teams to drive solutions for operating supercomputers in cloud
- Ensure Azure platform performance, scalability, and reliability for customer workloads
- Contribute to test-driven engineering culture to reduce regressions and bugs
- Be part of systems management team focused on operational excellence
Required Skills & Qualifications
Must Have:
- Bachelor's Degree in Computer Science or related field AND 4+ years technical engineering experience with coding (C, C++, C#, Java, JavaScript, Python) OR equivalent experience
- 3+ years experience operating AI/HPC systems, developing/running AI/HPC applications on clusters, or operating Cloud Infrastructure
- 2+ years specialized experience with AI/HPC system management OR High-Speed Networks OR HPC Storage OR managing Cloud Infrastructure
- Ability to pass Microsoft Cloud Background Check upon hire and every two years
Nice to Have:
- Bachelor's Degree AND 8+ years technical engineering experience OR Master's Degree AND 6+ years experience
- 1+ year experience running/troubleshooting machine learning workloads on GPU-based HPC systems
- 1+ year experience with cloud computing, virtualization, and container technologies
Benefits & Perks
- Industry leading healthcare