Publications

A collection of my research work.

VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models

VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models

Chongkai Gao, Zixuan Liu, Zhenghao Chi, Junshan Huang, Xin Fei, Yiwen Hou, Yuxuan Zhang, Yudi Lin, Zhirui Fang, Lin Shao

Advances in Neural Information Processing Systems (NeurIPS) 2025

Unified benchmarking reveals visually grounded hierarchical planning excels in VLAs.

Code
Unifarn: Unified transformer for facial reaction generation

Unifarn: Unified transformer for facial reaction generation

Cong Liang, Jiahe Wang, Haofan Zhang, Bing Tang, Junshan Huang, Shangfei Wang, Xiaoping Chen

Proceedings of the 31st ACM International Conference on Multimedia (ACM MM) 2023

Unified transformer predicts expressive facial reactions from multimodal conversational input.