I am now a final-year Ph.D. candidate at the Hong Kong University of Science and Technology, advised by Professor Tong Zhang. Currently, I am a visiting scholar in BLENDER LAB@UIUC, under the supervision of Professor Heng Ji. I was a research intern at ByteDance AI Lab with Dr. Hang Li and Dr. Xinsong Zhang, and Sinovation Ventures AI Institute. During my undergraduate study, I interned at WING Group @ NUS and PKU, where I was fortunate to work with Prof. Min-Yen Kan and Prof. Xiaojun Wan. I am passionate about the research in pre-training, efficient-tuning, and alignment of large foundation models.
I am currently active on the job market and seeking a position in the industry.

News



Publications


  1. Shizhe Diao*, Rui Pan*, Hanze Dong*, Ka Shun Shum, Jipeng Zhang, Wei Xiong, Tong Zhang
    LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models [NEW]
    NAACL 2024 Demo Track
  2. Hanning Zhang*, Shizhe Diao*, Yong Lin*, Yi R. Fung, Qing Lian, Xingyao Wang, Yangyi Chen, Heng Ji, Tong Zhang
    R-Tuning: Teaching Large Language Models to Refuse Unknown Questions [NEW]
    NAACL 2024
  3. Xu Liu, Junfeng Hu, Yuan Li, Shizhe Diao, Yuxuan Liang, Bryan Hooi, Roger Zimmermann
    UniTime: A Language-Empowered Unified Model for Cross-Domain Time Series Forecasting [NEW]
    WWW 2024
  4. Quyet V. Do, Tianqing Fang, Shizhe Diao, Zhaowei Wang, Yangqiu Song.
    ConstraintChecker: A Plugin for Large Language Models to Reason on Commonsense Knowledge Bases
    EACL 2024
  5. Hanze Dong*, Wei Xiong*, Deepanshu Goyal, Rui Pan, Shizhe Diao, Jipeng Zhang, Kashun Shum, Tong Zhang.
    RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment
    TMLR
  6. Kashun Shum*, Shizhe Diao*, Tong Zhang.
    Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data
    Findings of EMNLP 2023
  7. Renjie Pi*, Jiahui Gao*, Shizhe Diao*, Rui Pan, Hanze Dong, Jipeng Zhang, Lewei Yao, Jianhua Han, Hang Xu, Lingpeng Kong, Tong Zhang
    DetGPT: Detect What You Need via Reasoning
    EMNLP 2023
  8. Shizhe Diao*, Yongyu Lei*, Liangming Pan, Tianqing Fang, Wangchunshu Zhou, Sedrick Scott Keh, Min-Yen Kan, Tong Zhang.
    Doolittle: Benchmarks and Corpora for Academic Writing Formalization
    EMNLP 2023
  9. Zhihong Chen*, Shizhe Diao*, Benyou Wang, Guanbin Li, Xiang Wan.
    Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts
    ICCV 2023
  10. Shizhe Diao*, Tianyang Xu*, Ruijia Xu, Jiawei Wang, Tong Zhang.
    Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memories
    ACL 2023
  11. Zhihong Chen, Guiming Hardy Chen, Shizhe Diao, Xiang Wan, Benyou Wang.
    On the Difference of BERT-style and CLIP-style Text Encoders
    Findings of ACL 2023
  12. Shizhe Diao, Wangchunshu Zhou, Xinsong Zhang, Jiawei Wang.
    Write and Paint: Generative Vision-Language Models are Unified Modal Learners
    ICLR 2023
  13. Shizhe Diao, Zhichao Huang, Ruijia Xu, Xuechun Li, Yong Lin, Xiao Zhou, Tong Zhang.
    Black-box Prompt Learning for Pre-trained Language Models
    TMLR
  14. Shizhe Diao*, Sedrick Scott Keh*, Liangming Pan, Zhiliang Tian, Yan Song, Tong Zhang.
    Hashtag-Guided Low-Resource Tweet Classification
    WWW 2023
  15. Wangchunshu Zhou*, Yan Zeng*, Shizhe Diao*, Xinsong Zhang*.
    VLUE: A Multi-Task Benchmark for Evaluating Vision-Language Models
    ICML 2022
  16. Xiao Zhou, Weizhong Zhang, Zonghao Chen, Shizhe Diao, Tong Zhang.
    Efficient Neural Network Training via Forward and Backward Propagation Sparsification
    NeurIPS 2021
  17. Shizhe Diao, Ruijia Xu, Hongjin Su, Yilei Jiang, Yan Song, Tong Zhang.
    Taming Pre-trained Language Models with N-gram Representations for Low-Resource Domain Adaptation
    ACL 2021
  18. Shizhe Diao*, Xinwei Shen*, KaShun SHUM, Yan Song, Tong Zhang.
    TILGAN: Transformer-based Implicit Latent GAN for Diverse and Coherent Text Generation
    Findings of ACL 2021
  19. Shizhe Diao, Jiaxin Bai, Yan Song, Tong Zhang, and Yonggang Wang.
    ZEN: Pre-training Chinese Text Encoder Enhanced by N-gram Representations
    Findings of EMNLP 2020

Preprints


  1. Rui Pan, Xiang Liu, Shizhe Diao, Renjie Pi, Jipeng Zhang, Chi Han, Tong Zhang
    LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning [NEW]
  2. Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang
    Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards [NEW]
  3. Xin Xu, Shizhe Diao, Can Yang, Yang Wang
    Can We Verify Step by Step for Incorrect Answer Detection? [NEW]
  4. Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang
    The Instinctive Bias: Spurious Images lead to Hallucination in MLLMs
  5. Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang
    Mitigating the Alignment Tax of RLHF
  6. Rui Pan, Shuo Xing, Shizhe Diao, Xiang Liu, Kashun Shum, Jipeng Zhang, Tong Zhang
    Plum: Prompt Learning using Metaheuristic
  7. Ziqiang Zheng, Jipeng Zhang, Tuan-Anh Vu, Shizhe Diao, Yue Him Wong Tim, Sai-Kit Yeung
    MarineGPT: Unlocking Secrets of Ocean to the Public
  8. Shizhe Diao, Pengcheng Wang, Yong Lin, Tong Zhang.
    Active Prompting with Chain-of-Thought for Large Language Models
  9. Hanze Dong*, Shizhe Diao*, Weizhong Zhang, Tong Zhang.
    Normalizing Flow with Variational Latent Representation
  10. Rui Pan*, Shizhe Diao*, Jianlin Chen, Tong Zhang.
    ExtremeBERT: A Toolkit for Accelerating Pretraining of Customized BERT
    [Paper] [Code] [Documentation]

Recent Talks




Honors and Awards



Experience


Oct. 2021 - Jul. 2022
Research Intern, ByteDance AI Lab, China
Vision-Language Foundation Models
Advisor: Dr. Hang Li and Dr. Xinsong Zhang

Jun. 2019 - Jan. 2020
Research Intern, Sinovation Ventures AI Institute, China
Pre-trained Language Models
Advisor: Prof. Yan Song

Apr. 2018 - Oct. 2018
Research Intern, National University of Singapore (NUS), Singapore
Semi-supervised End-to-End Dialogue system
Advisor: Prof. Min-Yen Kan and Dr. Wenqiang Lei

Mar. 2017 - Mar. 2019
Research Intern, Peking University (PKU), China
Multimodal Chinese Poem Generation
Advisor: Prof. Xiaojun Wan

Sept. 2017 - Dec. 2017
Exchange Student
The Chinese University of Hong Kong (CUHK), HKSAR

Jul. 2017 - Aug. 2017
Visiting Student, Ben-Gurion University of the Negev (BGU), Israel
Cyber Security and Business Intelligence

Academic Service



Teaching Assistant



Miscellaneous