LingBot-World · 完整推理 Data Flow & Function Call 图

从首帧图像 → 文本编码 → 相机条件 → 扩散去噪循环 → 视频输出与行动解释（480P · 81帧示例）

← 架构总图