Omni-R1
reinforcement learning for omnimodal reasoning via two-system collaboration
Omni-R1 explores reinforcement learning for omnimodal reasoning under a two-system collaboration framework. I contributed to the multimodal RL pipeline, implementing training components and reward shaping for reasoning over multiple modalities.
Venue: NeurIPS 2025.