Back to jobsJob overview

About the role

Principal Software Engineer at Microsoft

Required Skills

pythonc++ai/hpccloud infrastructurehigh-speed networkshpc storagecontainer technologiessystem management

About the Role

Principal Software Engineer role at Microsoft Azure AI/HPC team focused on building tools for supercomputing reliability and performance. Responsibilities include system management, architectural changes, and influencing software/hardware roadmaps. Requires expertise in AI/HPC systems, cloud infrastructure, and coding in languages like C++ or Python.

Key Responsibilities

  • Partners with stakeholders to determine user requirements
  • Leads development of design documents for products/services
  • Mentors others to produce maintainable code
  • Drives project plans and work items across groups
  • Acts as Designated Responsible Individual (DRI) for system monitoring

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related field AND 6+ years technical engineering experience with coding (C, C++, C#, Java, JavaScript, Python) OR equivalent experience
  • 5+ years experience operating AI/HPC systems or developing/running AI/HPC applications on clusters or operating Cloud Infrastructure
  • 3+ years specialized experience with AI/HPC system management OR High-Speed Networks OR HPC Storage OR managing Cloud Infrastructure
  • Ability to pass Microsoft Cloud Background Check

Nice to Have:

  • Bachelor's Degree in Computer Science AND 10+ years technical engineering experience OR Master's Degree AND 8+ years experience
  • 1+ years operational experience running large scale HPC systems in Cloud environments
  • 1+ years experience running/troubleshooting machine learning workloads on GPU-based HPC systems
  • Expertise in Cloud Computing, Virtualization and Container Technologies
  • Familiarity with HPC software stack

Benefits & Perks

  • Industry leading healthcare