Publications
A collection of my research work.

VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models
Chongkai Gao, Zixuan Liu, Zhenghao Chi, Junshan Huang, Xin Fei, Yiwen Hou, Yuxuan Zhang, Yudi Lin, Zhirui Fang, Lin Shao
Advances in Neural Information Processing Systems (NeurIPS) 2025
Unified benchmarking reveals visually grounded hierarchical planning excels in VLAs.

Unifarn: Unified transformer for facial reaction generation
Cong Liang, Jiahe Wang, Haofan Zhang, Bing Tang, Junshan Huang, Shangfei Wang, Xiaoping Chen
Proceedings of the 31st ACM International Conference on Multimedia (ACM MM) 2023
Unified transformer predicts expressive facial reactions from multimodal conversational input.