Data Infrastructure

Microsoft Corporation
Full-time Multiple Locations, United States other-general
Posted:
June 13, 2026
Location:
Multiple Locations, United States, United States

Job Description

**Overview**

Help build the world’s most advanced multimodal dataset at Microsoft AI

We are on a mission to create the largest and most advanced multimodal dataset in the world. This dataset, spanning all modalities from across the web and beyond, will power the training of the world’s most capable AI frontier models, pushing the boundaries of scale, performance, and product deployment.

The AI Data Infra team at Microsoft AI is responsible for building data infrastructure to help MAI teams to generate the biggest and best training dataset. Our work involves data pipelines, Spark, Ray, Vector Databases, and all other aspects of data infra.

We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. In particular, we are looking for candidates who:

Are passionate about the role of data in large-scale AI model training

Will thrive in a highly collaborative, fast...

Apply for this Job

Submit your application for the Data Infrastructure position at Microsoft Corporation.

Apply Now Save for Later

Job Overview

Job Type: Full-time
Location: Multiple Locations, United States
Posted: June 13, 2026
Deadline: June 18, 2026