Shijie Wang (王世杰) |
[8] MotiF: Making Text Count in Image Animation with Motion Focal Loss
Shijie Wang, Samaneh Azadi, Rohit Girdhar, Sai Saketh Rambhatla, Chen Sun, and Xi Yin
Under Review
[7] Learning Visual Grounding from Generative Vision and Language Model
[Link]
Shijie Wang, Dahun Kim, Ali Taalimi, Chen Sun, and Weicheng Kuo
WACV 2025
[6] Vamos: Versatile Action Models for Video Understanding
[Link]
[Website]
[Code]
Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
ECCV 2024
[5] Do Pre-trained Vision-Language Models Encode Object States?
[Link]
Kaleb Newman, Shijie Wang, Yuan Zang, David Heffren, and Chen Sun
ECCV 2024 Workshop EVAL-FoMo
[4] AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
[Link]
[Website]
[Code]
Qi Zhao*, Shijie Wang*, Ce Zhang, Changcheng Fu, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
ICLR 2024
[3] Object-centric Video Representation for Long-term Action Anticipation
[Link]
[Code]
Ce Zhang*, Changcheng Fu*, Shijie Wang, Nakul Agarwal, Kwonjoon Lee, Chiho Choi, and Chen Sun
WACV 2024
[2] Goal-Conditioned Predictive Coding as an Implicit Planner for Offline Reinforcement Learning
[Link]
[Website]
[Code]
Zilai Zeng, Ce Zhang, Shijie Wang, and Chen Sun
NeurIPS 2023
[1] Pose Recognition with Cascade Transformers
[Link]
[Code]
Ke Li*, Shijie Wang*, Xiang Zhang*, Yifan Xu, Weijian Xu, and Zhuowen Tu
CVPR 2021
Conference Reviewer:
Part of the page is generated by jemdoc.
Last Updated: .