Yang Shu

Assistant Professor
School of Data Science and Engineering
East China Normal University

Email: yshu@dase.ecnu.edu.cn
Previous email: shu-y18@mails.tsinghua.edu.cn (no longer in use)
Mail: Room 312, Geography Building, 3663 North Zhongshan Road, Shanghai
Google Scholar


My research aims at creating strong learning machines that adapt to complex real world. I am working on transfer learning, out-of-distribution (OOD) generalization, domain adaptation and few-shot learning. I am also interested in related topics in machine learning and deep learning, such as foundation models, time series analysis and multi-modal learning.

I am currently an Assistant Professor, Chenhui Scholar, in School of Data Science and Engineering, East China Normal University. I received my Ph.D. degree from School of Software, Tsinghua University, advised by Mingsheng Long. I received my B.S. degree from Department of Automation, Tsinghua University.

Preprints

  1. Transferability in Deep Learning: A Survey
    Junguang Jiang, Yang Shu, Jianmin Wang, Mingsheng Long*
    [Paper] [Website]

Publications

  1. Towards a General Time Series Forecasting Model with Unified Representation and Adaptive Transfer
    Yihang Wang, Yuying Qiu, Peng Chen, Kai Zhao, Yang Shu, Zhongwen Rao*, Lujia Pan, Bin Yang, Chenjuan Guo*
    International Conference on Machine Learning (ICML), 2025 (accepted)

  2. LightGTS: A Lightweight General Time Series Forecasting Model
    Yihang Wang, Yuying Qiu, Peng Chen, Yang Shu, Zhongwen Rao, Lujia Pan, Bin Yang, Chenjuan Guo
    International Conference on Machine Learning (ICML), 2025 (accepted)

  3. Enhancing Diversity for Data-free Quantization
    Kai Zhao, Zhihao Zhuang, Miao Zhang, Chenjuan Guo*, Yang Shu, Bin Yang
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025 (accepted, oral)

  4. AimTS: Augmented Series and Image Contrastive Learning for Time Series Classification
    Yuxuan Chen, Shanshan Huang, Yunyao Cheng, Peng Chen, Zhongwen Rao, Yang Shu*, Bin Yang,
    Lujia Pan, Chenjuan Guo
    IEEE International Conference on Data Engineering (ICDE), 2025 [Paper]

  5. AID-SQL: Adaptive In-Context Learning of Text-to-SQL with Difficulty-Aware Instruction and Retrieval-Augmented Generation
    Xiuwen Li, Qifeng Cai, Yang Shu*, Chenjuan Guo, Bin Yang
    IEEE International Conference on Data Engineering (ICDE), 2025 (accepted)

  6. Towards a General Time Series Anomaly Detector with Adaptive Bottlenecks and Dual Adversarial Decoders
    Qichao Shentu, Beibu Li, Kai Zhao, Yang Shu*, Zhongwen Rao, Lujia Pan, Bin Yang, Chenjuan Guo
    International Conference on Learning Representations (ICLR), 2025 [Paper]

  7. Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation
    Sicong Liu, Yang Shu*, Chenjuan Guo, Bin Yang
    International Conference on Learning Representations (ICLR), 2025 [Paper]

  8. RCRank: Multimodal Ranking of Root Causes of Slow Queries in Cloud Database Systems
    Biao Ouyang, Yingying Zhang, Hanyin Cheng, Yang Shu*, Chenjuan Guo, Bin Yang, Qingsong Wen,
    Lunting Fan, Christian S. Jensen
    International Conference on Very Large Data Bases (VLDB), 2025 [Paper]

  9. Assessing Pre-trained Models for Transfer Learning through Distribution of Spectral Components
    Tengxue Zhang, Yang Shu*, Xinyang Chen*, Yifei Long, Chenjuan Guo, Bin Yang
    AAAI Conference on Artificial Intelligence (AAAI), 2025 [Paper]

  10. Boosting Transferability and Discriminability for Time Series Domain Adaptation
    Mingyang Liu, Xinyang Chen*, Yang Shu*, Xiucheng Li*, Weili Guan, Liqiang Nie
    Neural Information Processing Systems (NeurIPS), 2024 [Paper] [Code]

  11. Pathformer: Multi-scale Transformers with Adaptive Pathways for Time Series Forecasting
    Peng Chen, Yingying Zhang, Yunyao Cheng, Yang Shu*, Yihang Wang, Qingsong Wen, Bin Yang, Chenjuan Guo
    International Conference on Learning Representations (ICLR), 2024 [Paper] [Code]

  12. Omni-Training: Bridging Pre-Training and Meta-Training for Few-Shot Learning
    Yang Shu, Zhangjie Cao, Jinghan Gao, Jianmin Wang, Philip S. Yu, Mingsheng Long*
    IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 [Paper]

  13. CLIPood: Generalizing CLIP to Out-of-Distributions
    Yang Shu, Xingzhuo Guo, Jialong Wu, Ximei Wang, Jianmin Wang, Mingsheng Long*
    International Conference on Machine Learning (ICML), 2023 [Paper] [Code]

  14. Hub-Pathway: Transfer Learning from A Hub of Pre-trained Models
    Yang Shu, Zhangjie Cao, Ziyang Zhang, Jianmin Wang, Mingsheng Long*
    Neural Information Processing Systems (NeurIPS), 2022 [Paper]

  15. Zoo-Tuning: Adaptive Transfer from A Zoo of Models
    Yang Shu, Zhi Kou, Zhangjie Cao, Jianmin Wang, Mingsheng Long*
    International Conference on Machine Learning (ICML), 2021 [Paper] [Code]

  16. Open Domain Generalization with Domain-Augmented Meta-Learning
    Yang Shu, Zhangjie Cao, Chenyu Wang, Jianmin Wang, Mingsheng Long*
    IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021 [Paper] [Code]

  17. Transferable Curriculum for Weakly-Supervised Domain Adaptation
    Yang Shu, Zhangjie Cao, Mingsheng Long*, Jianmin Wang
    AAAI Conference on Artificial Intelligence (AAAI), 2019 [Paper] [Code]

Academic Services

PC Member
  • International Conference on Machine Learning (ICML), 2022, 2023, 2024, 2025
  • Neural Information Processing Systems (NeurIPS), 2022, 2023, 2024, 2025
  • International Conference on Learning Representations (ICLR), 2024, 2025
  • IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2021, 2022, 2023
  • IEEE International Conference on Computer Vision (ICCV), 2021, 2023, 2025
  • European Conference on Computer Vision (ECCV), 2022, 2024
  • ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025
  • AAAI Conference on Artificial Intelligence (AAAI), 2025
  • ACM International Conference on Multimedia (ACMMM), 2025
  • International Joint Conference on Artificial Intelligence (IJCAI), 2020
Reviewer
  • IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  • IEEE Transactions on Knowledge and Data Engineering (TKDE)
  • International Journal of Computer Vision (IJCV)
  • Artificial Intelligence Journal (AIJ)
  • Information Sciences
  • IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  • IEEE Transactions on Industrial Informatics (TII)
  • Transactions on Machine Learning Research (TMLR)