Shijie Wang (王世杰)

Shijie Wang (王世杰)
Ph.D. Student

Department of Computer Science, Brown University

Office: Room 453, 115 Waterman Street, Providence, RI 02912
Email: shijie_wang [AT] brown [DOT] edu (primary) wang98thu [AT] gmail [DOT] com

[Google Scholar] [GitHub] [LinkdeIn] [Twitter]

Short bio | Education | News | Selected Papers | Experience | Awards | Service

Short Bio

I am a final year CS Ph.D. student at Brown University working with Prof. Chen Sun. Previously, I also worked at Google and Meta as a research intern. I obtained my bachelor degree in software engineering at Tsinghua University in 2021.

My research interests involve building physically grounded, reasoning-capable vision-language models and exploring their effective integration into the physical world. Feel free to contact me for collaborations and casual chats. I'm actively looking for industry full-time opportunities in 2026.

Education

09/2021 - NOW Ph.D. in Department of Computer Science, Brown University
08/2016 - 06/2021 B.S. in School of Software, Tsinghua University. (Outstanding Undergrad)

News

05/2025, I will join Salesforce AI Research as a research intern.
05/2024, I joined Meta GenAI as a research scientist intern.
10/2023, I started to work at Google DeepMind as a Student Researcher.
06/2023, I became a PhD candidate.
10/2022, I got 3rd prize in Ego4D Object State Change Classification Challenge.
05/2022, I started to work at Google as a Student Researcher.
08/2021, I moved to Providence and started my PhD career at Brown University!
06/2021, I graduated from Tsinghua University as an outstanding undergrad!
03/2021, My first paper was accepted in CVPR 2021!

Selected Papers

[9] MotiF: Making Text Count in Image Animation with Motion Focal Loss [Link] [Website] [Benchmark]
Shijie Wang, Samaneh Azadi, Rohit Girdhar, Sai Saketh Rambhatla, Chen Sun, and Xi Yin
CVPR 2025

[8] How Can Objects Help Video-Language Understanding? [Link]
Zitian Tang, Shijie Wang, Junho Cho, Jaewook Yoo, and Chen Sun
ICCV 2025

[7] Learning Visual Grounding from Generative Vision and Language Model [Link]
Shijie Wang, Dahun Kim, Ali Taalimi, Chen Sun, and Weicheng Kuo
WACV 2025

[6] Vamos: Versatile Action Models for Video Understanding [Link] [Website] [Code]
Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
ECCV 2024

[5] Do Pre-trained Vision-Language Models Encode Object States? [Link]
Kaleb Newman, Shijie Wang, Yuan Zang, David Heffren, and Chen Sun
ECCV 2024 Workshop EVAL-FoMo

[4] AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? [Link] [Website] [Code]
Qi Zhao*, Shijie Wang*, Ce Zhang, Changcheng Fu, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
ICLR 2024

[3] Object-centric Video Representation for Long-term Action Anticipation [Link] [Code]
Ce Zhang*, Changcheng Fu*, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, and Chen Sun
WACV 2024

[2] Goal-Conditioned Predictive Coding as an Implicit Planner for Offline Reinforcement Learning [Link] [Website] [Code]
Zilai Zeng, Ce Zhang, Shijie Wang, and Chen Sun
NeurIPS 2023

[1] Pose Recognition with Cascade Transformers [Link] [Code]
Ke Li*, Shijie Wang*, Xiang Zhang*, Yifan Xu, Weijian Xu, and Zhuowen Tu
CVPR 2021

Experience

06/2025 - NOW Research Intern at Salesforce AI Research with Dr. Juan Carlos Niebles and Dr. Honglu Zhou.
05/2024 - 11/2024 Research Scientist Intern at Meta GenAI with Dr. Xi Yin.
09/2023 - 03/2024 Student Researcher at Google DeepMind with Dr. Weicheng Kuo.
05/2022 - 12/2022 Student Researcher at Google Research with Dr. Yin Cui.
07/2020 - 03/2021 Research Assistant at UCSD with Prof. Zhuowen Tu.
07/2019 - 09/2019 Machine Learning Intern, Kwai.
12/2018 - 06/2020 Research Assistant at Tsinghua with Prof. Mingsheng Long.

Awards

2022, 3rd Prize of Ego4D Object State Change Classification Challenge, ECCV 2022.
2021, Outstanding Undergrad Awards, Tsinghua University.
2018 & 2019 & 2020, Scholarship for Academic Excellence, Tsinghua University.
2019, First Prize in Student Research Training Program, Tsinghua University.
2019, Member of Tsinghua University Initiative Scientific Research Program (funding: 30,000￥).
2018, Champion of Yuehan Ma Campus Football Cup, Tsinghua University.

Service

Reviewer:

IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
International Journal of Computer Vision (IJCV)
The International Conference on Learning Representations (ICLR) 2024, 2025
The International Conference on Machine Learning (ICML) 2024
The Conference on Neural Information Processing Systems (NeurIPS) 2023, 2024, 2025
The Conference on Computer Vision and Pattern Recognition (CVPR) 2022, 2023, 2025
The International Conference on Computer Vision (ICCV) 2023
The European Conference on Computer Vision (ECCV) 2022, 2024
AAAI Conference on Artificial Intelligence (AAAI) 2023, 2024
Winter Conference on Applications of Computer Vision (WACV) 2023, 2024

Part of the page is generated by jemdoc.

Last Updated: .