Back to jobsJob overview

About the role

Senior Software Engineer at Microsoft

Required Skills

pythonc++ai/hpccloud infrastructurehigh-speed networkshpc storagevirtualizationcontainer technologiesgpu computing

About the Role

Senior Software Engineer role on Microsoft's Azure AI and HPC team, developing tools and systems to support large-scale supercomputing deployments in the cloud. Responsibilities include debugging HPC systems, ensuring platform performance and scalability, and contributing to architectural decisions for cloud-native supercomputers.

Key Responsibilities

  • Build tools and analyze system metrics to debug HPC system issues
  • Partner with customers and teams to drive solutions for operating supercomputers in cloud
  • Ensure Azure platform performance, scalability, and reliability for customer workloads
  • Contribute to test-driven engineering culture to reduce regressions and bugs
  • Be part of systems management team focused on operational excellence

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related field AND 4+ years technical engineering experience with coding (C, C++, C#, Java, JavaScript, Python) OR equivalent experience
  • 3+ years experience operating AI/HPC systems, developing/running AI/HPC applications on clusters, or operating Cloud Infrastructure
  • 2+ years specialized experience with AI/HPC system management OR High-Speed Networks OR HPC Storage OR managing Cloud Infrastructure
  • Ability to pass Microsoft Cloud Background Check upon hire and every two years

Nice to Have:

  • Bachelor's Degree AND 8+ years technical engineering experience OR Master's Degree AND 6+ years experience
  • 1+ year experience running/troubleshooting machine learning workloads on GPU-based HPC systems
  • 1+ year experience with cloud computing, virtualization, and container technologies

Benefits & Perks

  • Industry leading healthcare