Shangzhe Di

Hi, I am a first-year PhD student at Shanghai Jiao Tong University (SJTU) under the guidance of Prof. Weidi Xie. My research focuses on video understanding and multimodal learning.

Prior to joining SJTU, I completed my master's and bachelor's degrees at Beihang University (BUAA). Under the supervision of Prof. Si Liu, I explored video BGM generation and visual object tracking.

Email  /  CV  /  Github  /  Google Scholar

profile photo
Education

  • PhD Student, Shanghai Jiao Tong University, Apr. 2023 -
  • M.Eng. in Computer Science, Beihang University, Sep. 2020 - Jan. 2023
  • Exchange Student, Technical University of Munich, Apr. 2019 - Aug. 2019
  • B.Eng. in Software Engineering, Beihang University, Sep. 2016 - Jun. 2020

  • Research
    Grounded Question-Answering in Long Egocentric Videos
    Shangzhe Di, Weidi Xie
    Technical Report, 2023.
    paper / project page

    Simultaneous query grounding and answering in long, egocentric videos.

    Linker: Learning Long Short-term Associations for Robust Visual Tracking
    Zizheng Xun*, Shangzhe Di*, Yulu Gao, Zongheng Tang, Gang Wang, Si Liu, Bo Li
    IEEE Transactions on Multimedia (TMM), 2023.
    paper


    Video Background Music Generation with Controllable Music Transformer
    Shangzhe Di*, Zeren Jiang*, Si Liu, Zhaokai Wang, Leyan Zhu, Zexin He, Hongming Liu, Shuicheng Yan
    ACM MM, 2021. (Best Paper Award)
    paper / project page / code / colab notebook / bibtex

    The first satisfying method for video background music generation.

    Honors and Awards

  • Best Paper Award, ACM MM 2021
  • Best Video Award, IJCAI 2021 Video Competition
  • First Prize Scholarship x 2 (Top 10%), Beihang University, 2019 & 2021
  • Full Scholarship for Exchange Program, China Scholarship Council, 2019
  • Special Prize Scholarship (Top 3%), Beihang University, 2018



  • The website template is borrowed from here.