publications

Publications and preprints.

2026

  1. CVPR
    Preserving Source Video Realism: High-Fidelity Face Swapping for Cinematic Quality
    Zekai Luo, Zongze Du, Zhouhang Zhu, and 7 more authors
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026
  2. ICML
    ACTIVE-O3: Empowering MLLMs with Active Perception via Pure Reinforcement Learning
    Muzhi Zhu, Hao Zhong, Canyu Zhao, and 9 more authors
    In International Conference on Machine Learning, 2026
  3. ECCV
    Metric-Bench: Exploring In-context Spatial Metric Reasoning in VLMs for Indoor Scenes
    Yuling Xi, Haokai Zhang, Muzhi Zhu, and 10 more authors
    2026
    Under review

2025

  1. NeurIPS
    Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
    Hao Zhong, Muzhi Zhu, Zongze Du, and 6 more authors
    In Advances in Neural Information Processing Systems, 2025
    Equal contribution
  2. Preprint
    GAE: Unleashing Physical Potential of VLM with Generalizable Action Expert
    Mingyu Liu, Zheng Huang, Xiaoyi Lin, and 7 more authors
    2025
  3. Preprint
    NoTVLA: Narrowing of Dense Action Trajectories for Generalizable Robot Manipulation
    Zheng Huang, Mingyu Liu, Xiaoyi Lin, and 8 more authors
    2025