This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Sr. AI Data Engineer based in United States.
This role operates at the intersection of data engineering and machine learning systems, building the foundational pipelines that power next-generation generative AI models. You will design and scale complex, AI-augmented data workflows that process billions of images and integrate model-driven enrichment at every stage. The position requires deep expertise in distributed systems, data pipelines, and ML inference orchestration in high-scale environments. You will work on systems that combine traditional SQL-based transformations with real-time model invocations, ensuring quality, reliability, and performance. A key focus of the role is enabling high-quality training datasets for image generation models, directly influencing model performance across multiple dimensions. You will collaborate closely with ML researchers and engineers in a fast-paced, research-driven environment. This is a highly technical and impactful role shaping the future of generative AI infrastructure.