Omni-R1

reinforcement learning for omnimodal reasoning via two-system collaboration

Omni-R1 explores reinforcement learning for omnimodal reasoning under a two-system collaboration framework. I contributed to the multimodal RL pipeline, implementing training components and reward shaping for reasoning over multiple modalities.

Venue: NeurIPS 2025.

References