ByteDance’s Seed team has open-sourced VeOmni, a PyTorch-based framework that helps train
AI models using text, images, audio, and video. Unlike conventional systems that mix model logic with parallel code, VeOmni separates the two, making it easier to add new modalities and scale training efficiently.