Xu Gu




About

My name is Xu Gu. I am a PhD Candidate at AIMind, Gaoling School of Artificial Intelligence(GSAI), Renmin University of China(RUC), advised by Prof. Ruihua Song.
I previously worked as a research intern at Shanghai AI Lab, and I am currently a visiting scholar at the National University of Singapore under the supervision of Prof. Roger ZIMMERMANN, funded by the China Scholarship Council (CSC).
My research interests are Multimodal large language model, Video Creation, Storyboard and Multimodal video understanding.



News



    • [08/2025] One paper accepted by PRCV 2025!
    • [08/2025] One paper accepted by ACM MM Workshop MUCG 2025!
    • [10/2024] One paper accepted by ACM MM Asia 2024!
    • [07/2023] One paper accepted by ACM Multimedia 2023!
    • [12/2022] Created my personal homepage!

Publications


All-in-One: Boosting Basic Capabilities in one Omni-MLLM to Enhance Movie Understanding
Shaojun Shi*, Yuchen Ren*, Xu Gu*, Wenhui Tan, Yuchong Sun, Jianghan Chao, Ruihua Song

Under Review.

A Mixture-of-Experts Framework based on Depth Images for Text to Video Storyboard Task
Xu Gu, Feiyue Ni, Ruihua Song

PRCV, 2025.

Towards Incremental Learning in Cross-Modal Text-Audio Retrieval
Xu Gu*, Yingfei Sun*, Wei Ji, Hanbin Zhao, Roger Zimmermann

Under Review.

ScaMo: Towards Text to Video Storyboard Generation Using Scale and Movement of Shots
Xu Gu, Xihua Wang, Chuhao Jin, Ruihua Song

ACM Multimedia Asia, 2024. [Paper] 

TeViS: Translating Text Synopses to Video Storyboards
Xu Gu*, Yuchong Sun*, Feiyue Ni, Shizhe Chen, Xihua Wang, Ruihua Song, Boyuan Li, Xiang Cao

ACM Multimedia, 2023. [Project]  [Paper]  [Code] 

(* represents the equal contribution)


Awards&Honors



    • 2024 SIGMM Student Travel Grant.
    • 2023 Third Prize of Outstanding Scholarship for Postgraduate Studies of Renmin University of China.
    • 2019 Merit Student of University of Chinese Academy of Sciences.
    • 2017 Merit Student of Henan Province.

Services



      ACM MM 2024, ACM MM 2025, CVPR 2025, ACL2025, PRCV2025, TOMM.