About Liquid AI
Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from data center accelerators to on-device hardware, ensuring low latency, minimal memory usage, privacy, and reliability. We partner with enterprises across consumer electronics, automotive, life sciences, and financial services. We are scaling rapidly and need exceptional people to help us get there.
The Opportunity
Our Data team builds the pipelines that power Liquid Foundation Models across pre-training, vision, audio, and emerging modalities. We need engineers who can collect, filter, and synthesize high-quality datasets at scale for pretraining, midtraining, SFT, and preference optimization.
Depending on your background, you'll work on foundation model pre-training data, post-training RL, vision-language datasets, audio pipelines, RL environments, or multimodal data challenges. We'll match you to the team where you'll have the most impact.
While San Francisco and Boston are preferred, we are open to other locations.
What We're Looking For
We need someone who: