Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding
[Paper]
[Code]
Yan Shu, Zheng Liu, Peitian Zhang, Minghao Qin, Junjie Zhou, Zhengyang Liang, Tiejun Huang, Bo Zhao
arXiv, 2024
Enabling high-quality and efficient video understanding over thousands of frames on a single A100 GPU.
|
MemoRAG: Moving towards Next-Gen RAG Via Memory-Inspired Knowledge Discovery
[Paper]
[Code]
Hongjin Qian, Peitian Zhang, Zheng Liu, Kelong Mao, Zhicheng Dou
arXiv, 2024
Enhancing retrieval-augmented generation (RAG) by acquiring a global understanding of the entire database from a memory module.
|
Long Context Compression with Activation Beacon
[Paper]
[Code]
Peitian Zhang, Zheng Liu, Shitao Xiao, Ninglu Shao, Qiwei Ye, Zhicheng Dou
arXiv, 2024
A plug-in module for transformer-based LLMs that enables effective, efficient, and flexible compression of long contexts.
|
Retrieve Anything to Augment Large Language Models
[Paper]
[Code]
Peitian Zhang, Shitao Xiao, Zheng Liu, Zhicheng Dou, Jian-Yun Nie
ACL, 2024
A unified embedding model that supports diverse retrieval augmentation scenarios.
|
C-Pack: Packed Resources For General Chinese Embeddings
[Paper]
[Code]
Shitao Xiao, Zheng Liu, Peitian Zhang, Niklas Muennighoff, Defu Lian, Jian-Yun Nie
SIGIR, 2024
A package of resources that significantly advance the field of general Chinese embeddings.
|
Education
|
-
[2022-2025]  M.E. Artificial Intelligence,    Renmin University of China
-
[2018-2022]  B.E. Computer Science,    Renmin University of China
|
Awards
|
-
[2024] JingDong Special Scholarship
-
[2022] Renmin University Excellent Graduate
|
|