Publications

For the updated publication list, check out the Google Scholar page.

(* denotes equal contribution or alphabetic ordering, † denotes corresponding author)

2025

  • Veu-bench: Towards comprehensive understanding of video editing
    Bozheng Li†, Yongliang Wu, Yi Lu, Jiashuo Yu, Licheng Tang, Jiawang Cao, Wenqing Zhu, Yuyang Sun, Jay Wu, Wenbo Zhu†
    CVPR 2025 (Highlight)
    [Arxiv] [Benchmark]

  • Fully fine-tuned CLIP models are efficient few-shot learners
    Mushui Liu*, Bozheng Li*, Jun Dan, Ziqian Lu, Zhao Wang, Yunlong Yu†
    Knowledge-Based Systems
    [Arxiv]

  • RSVP: Reasoning Segmentation via Visual Prompting and Multi-modal Chain-of-Thought
    Yi Lu, Jiawang Cao, Yongliang Wu, Bozheng Li, Licheng Tang, Yangguang Ji, Chong Wu, Jay Wu, Wenbo Zhu
    ACL 2025
    [Arxiv]

2024

  • Frame Order Matters: A Temporal Sequence-Aware Model for Few-Shot Action Recognition
    Bozheng Li†, Mushui Liu, Gaoang Wang, Yunlong Yu†
    AAAI 2025
    [Arxiv]

  • Envisioning Class Entity Reasoning by Large Language Models for Few-shot Learning
    Mushui Liu, Fangtai Wu, Bozheng Li, Ziqian Lu, Yunlong Yu†, Xi Li
    AAAI 2025
    [Arxiv]

  • Video Repurposing from User Generated Content: A Large-scale Dataset and Benchmark
    Yongliang Wu, Wenbo Zhu, Jiawang Cao, Yi Lu, Yongliang Wu, Wenbo Zhu, Jiawang Cao, Yi Lu, Bozheng Li, Weiheng Chi, Zihan Qiu, Lirian Su, Haolin Zheng, Jay Wu, Xu Yang†
    AAAI 2025
    [Arxiv]

  • OmniCLIP: Adapting CLIP for Video Recognition with Spatial-Temporal Omni-Scale Feature Learning
    YMushui Liu, Bozheng Li, Yunlong Yu†
    AAAI 2025
    [Arxiv]

Preprints

  • OpusAnimation: Code-Based Dynamic Chart Generation
    Bozheng Li, Miao Yang, Zhenhan Chen, Jiawang Cao, Mushui Liu, Yi Lu, Yongliang Wu, Bin Zhang, Yangguang Ji, Licheng Tang, Jay Wu, Wenbo Zhu
    Preprint 2025
    [Arxiv]