publications
Publications and preprints.
2026
- CVPRPreserving Source Video Realism: High-Fidelity Face Swapping for Cinematic QualityIn IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2026
- ICMLACTIVE-O3: Empowering MLLMs with Active Perception via Pure Reinforcement LearningIn International Conference on Machine Learning, 2026
- ECCVMetric-Bench: Exploring In-context Spatial Metric Reasoning in VLMs for Indoor Scenes2026Under review
2025
- NeurIPSOmni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System CollaborationIn Advances in Neural Information Processing Systems, 2025Equal contribution
- PreprintGAE: Unleashing Physical Potential of VLM with Generalizable Action Expert2025
- PreprintNoTVLA: Narrowing of Dense Action Trajectories for Generalizable Robot Manipulation2025