Back to jobsJob overview
About the role
Senior Software Engineer at Microsoft
Required Skills
pythondistributed systemsflinkkafkakubernetesgpu inferencetritononnxopentelemetry
About the Role
Senior Software Engineer role focused on building real-time data pipelines and GPU inference serving systems for large-scale ML models. Responsibilities include designing ETL pipelines, optimizing performance, and collaborating with applied scientists. Requires extensive experience with distributed systems, streaming frameworks, and GPU inference technologies.Key Responsibilities
- Design & code real-time ETL/feature pipelines (e.g., Flink or Spark Structured Streaming)
- Define and meet SLOs with OpenTelemetry/Prometheus/Grafana for metrics, tracing, and alerting
- Implement robust queuing/streaming with Kafka/Pulsar
- Optimize GPU inference services on Triton Inference Server (or ONNX Runtime/TensorRT)
- Profile & optimize end-to-end: CPU/GPU utilization, memory, I/O, vectorization, caching
Required Skills & Qualifications
Must Have:
- Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including C, C++, C#, Java, JavaScript, or Python OR equivalent experience
- 6+ years building distributed systems in production
- Production experience with streaming frameworks (Flink or Spark) and messaging (Kafka)
- Hands-on with Kubernetes and containers; comfort with service ops (logs, metrics, scaling)
- Practical experience with GPU inference on Triton or ONNX Runtime/TensorRT (model packaging, runtime tuning, batching)
Nice to Have:
- Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience
- Experience with real-time feature stores or embedding pipelines
- Prior contributions to GPU batching, dynamic scheduling, or multi-model serving
Benefits & Perks
- Industry leading healthcare