Shijie Wang (王世杰)
Ph.D. Student

Department of Computer Science, Brown University

Office: Room 453, 115 Waterman Street, Providence, RI 02912
Email: shijie_wang [AT] brown [DOT] edu (prior)        wang98thu [AT] gmail [DOT] com

[Google Scholar] [GitHub] [LinkdeIn] [Twitter]

Short bio | Education | News | Papers | Experience | Awards | Service

Short Bio

I am a fourth year CS Ph.D. student at Brown University working with Prof. Chen Sun. Previously, I also worked in Google Research and Google DeepMind as a student researcher. I obtained my bachelor degree in software engineering at Tsinghua University in 2021.

My research interest is multimodal learning and video understanding & generation. In detail, I'm especially interested in squeezing knowledge from large-scale foundation models for downstream multimodal tasks and structural concept learning for vision-language models. Feel free to contact me for collaborations and casual chats. I'm now looking for opportunities for 2025 summer!

Education

News

Papers

    [8] MotiF: Making Text Count in Image Animation with Motion Focal Loss
    Shijie Wang, Samaneh Azadi, Rohit Girdhar, Sai Saketh Rambhatla, Chen Sun, and Xi Yin
    Under Review

    [7] Learning Visual Grounding from Generative Vision and Language Model [Link]
    Shijie Wang, Dahun Kim, Ali Taalimi, Chen Sun, and Weicheng Kuo
    WACV 2025

    [6] Vamos: Versatile Action Models for Video Understanding [Link] [Website] [Code]
    Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
    ECCV 2024

    [5] Do Pre-trained Vision-Language Models Encode Object States? [Link]
    Kaleb Newman, Shijie Wang, Yuan Zang, David Heffren, and Chen Sun
    ECCV 2024 Workshop EVAL-FoMo

    [4] AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? [Link] [Website] [Code]
    Qi Zhao*, Shijie Wang*, Ce Zhang, Changcheng Fu, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
    ICLR 2024

    [3] Object-centric Video Representation for Long-term Action Anticipation [Link] [Code]
    Ce Zhang*, Changcheng Fu*, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, and Chen Sun
    WACV 2024

    [2] Goal-Conditioned Predictive Coding as an Implicit Planner for Offline Reinforcement Learning [Link] [Website] [Code]
    Zilai Zeng, Ce Zhang, Shijie Wang, and Chen Sun
    NeurIPS 2023

    [1] Pose Recognition with Cascade Transformers [Link] [Code]
    Ke Li*, Shijie Wang*, Xiang Zhang*, Yifan Xu, Weijian Xu, and Zhuowen Tu
    CVPR 2021

Experience

Awards

  • 2022, 3rd Prize of Ego4D Object State Change Classification Challenge, ECCV 2022.
  • 2021, Outstanding Graduate Awards, Tsinghua University.
  • 2018 & 2019 & 2020, Scholarship for Academic Excellence, Tsinghua Univeristy.
  • 2019, Fisrt Prize in Student Research Training Program, Tsinghua University.
  • 2019, Member of Tsinghua University Initiative Scientific Research Program (funding: 30,000¥).
  • 2018, Champion of Yuehan Ma Campus Football Cup, Tsinghua University.

Service

Conference Reviewer:

  • The International Conference on Learning Representations (ICLR)        2024, 2025
  • The International Conference on Machine Learning (ICML)        2024
  • The Conference on Neural Information Processing Systems (NeurIPS)        2023, 2024
  • The Conference on Computer Vision and Pattern Recognition (CVPR)        2022, 2023
  • The International Conference on Computer Vision (ICCV)        2023
  • The European Conference on Computer Vision (ECCV)        2022, 2024
  • AAAI Conference on Artificial Intelligence (AAAI)        2023, 2024
  • Winter Conference on Applications of Computer Vision (WACV)        2023, 2024

Part of the page is generated by jemdoc.

Last Updated: .