Back to jobsJob overview
About the role
Principal Software Engineer - Azure Core at Microsoft
Required Skills
linuxkubernetesai infrastructuregopythondata center managementhigh-performance computingdistributed storage
About the Role
Principal Software Engineer role in Azure Core, focusing on building and managing the core platform for Microsoft's cloud services. Responsibilities include designing hyperscale infrastructure, working with AI data centers, and leading technical projects. Requires expertise in Linux, Kubernetes, AI infrastructure, and data center management.Key Responsibilities
- Partners with stakeholders to determine user requirements for complex scenarios
- Provides technical leadership for design documents and dependencies
- Leads by example and mentors others to produce maintainable code
- Leads project plans and work items across multiple products
- Acts as Designated Responsible Individual (DRI) for system monitoring and mentoring
Required Skills & Qualifications
Must Have:
- Bachelor's Degree in Computer Science or related field AND 8+ years technical engineering experience with coding in Go, Rust, Bash, or Python OR equivalent experience
- 5+ years experience building and managing data centers
- Networking and security expertise in high-performance computing, RDMA over InfiniBand or RoCE, and eBPF
- Experience with storage and acceleration technologies for AI workloads, including distributed storage systems
Nice to Have:
- 1+ years experience with AI/ML job scheduling and orchestration at scale using SLURM, Ray, or Kueue
- Model training optimization for performance and scalability
- 1+ years experience improving model serving and inference efficiency for low latency and high throughput
Benefits & Perks
- Industry leading healthcare