计算机科学与技术学院（软件学院）

个人简介

白琮，博士，浙江工业大学计算机科学与技术学院，教授，博士生导师。学院党委委员，院人才办主任，计算机视觉研究所执行所长。斯坦福大学全球前2%顶尖科学家入选者。研究方向为计算机视觉与智慧气象。主持国自然联合重点、面上、青年、科技部重点研发计划课题等省部级以上项目10项；在国内外知名学术期刊或会议上发表论文100余篇，包括Nature Communications、NeurIPS、ICML、AAAI、KDD、ACM MM、IEEE TGRS, IEEE TNNLS, IEEE TCBY, IEEE TMM, IEEE GRSL、GRL等领域内的权威期刊或会议等，其中ESI高被引论文1篇,《软件学报》2019年和2020年高影响力论文各1篇。曾5次获得国际/国内知名学术会议的最佳/优秀论文奖。担任Elsevier旗下SCI期刊《Displays》的编委会委员，IEEE 《Journal of Selected Topics in Applied Earth Observations and Remote Sensing》副编辑。

研究工作的数据及源代码请见：https://github.com/Zjut-MultimediaPlus

欢迎本校的优秀本科生进组进行科研实习！希望你四六级已过，绩点3.8以上。
请先通过邮件或电话（659417）进行联系。

欢迎27年保研学生随时邮件联系。
27年的考研学生，请出分后再联系。
招收:硕士生、博士生
招聘: 博士后，青年教师。

教育经历：
2009.9~2013.3 法国雷恩国立应用科学学院（INSA Rennes）信号与图像处理专业    工学博士
2006.9~2009.3 上海大学                                电路与系统            工学硕士
1999.9~2003.7 山东大学                               电子信息工程          工学学士

学术兼职：
教育部高等学校电子信息类专业教学指导委员会协作委员
中国气象学会气象人工智能专委会副秘书长
中国计算机学会高级会员、多媒体专委会委员
中国图形图像学会会员、多媒体专委会委员
Displays （An Elsevier journal indexed by SCI） Editorial Board Member
程序委员会委员：ICML 2025, KDD 2025，ECAI 2024,2025 IJCAI 2024, ACMMM Asia 2019， ACM MM 2018，2022，2023，2024，2025 MIPR 2020, 2018, PCM 2018
权威期刊审稿人：IEEE TNNLS，IEEE TMM， IEEE TAI,IEEE TBD,IEEE TGRS,IEEE TKDE等

育人成果

实验室传帮带氛围好。能保证充分的指导。
实验室不要求打卡，看实际科研进展！
毕业要求完成之后，鼓励有就业意向的学生去企业实习。

2025级硕士生：陈睿、黄天乐、杨梁（与产思贤副教授共同指导）、吴文强（与穆攀副教授共同指导）、洪升（与王铮博士共同指导）、张司扬（与马青副教授共同指导）、林佳杰（与王铮博士共同指导）、洪超（与穆攀副教授共同指导）、方正（与王琼博士共同指导）
2024级硕士生：朱宇超（与穆攀副教授共同指导）、卢珂宇（与马青副教授共同指导）、卢思宇、骆霖轩（研究生国奖，与穆攀副教授共同指导）、汪漾（与王琼博士共同指导）、李想（与穆攀副教授共同指导）、严明、陈浩然（与王铮博士共同指导）、洪滔（与王铮博士共同指导）
2023级硕士生：王泽弘（与产思贤副教授共同指导，金华电信）、倪鸿泽（与马青副教授共同指导，宁波选调生）、张燕玲（研究生国奖，与穆攀博士共同指导,浙江宇视）、黄堃（与王铮博士共同指导，硕博连读）、郑策（与王琼博士共同指导）、赵政棋（硕博连读，与郝鹏翼副教授共同指导）、胡至盈（与王琼博士共同指导，招银科技）、应楷（研究生国奖，达摩院，与王铮博士共同指导）、林仲豪（研究生国奖，浙商银行）、王奕欣（与穆攀博士共同指导，美团）、汪琰（与王琼博士共同指导，杭州润江汇科技）
2022级硕士生：严涵婷（研究生国奖，硕博连读），潘建成(清华联合培养,研究生国奖，两次！保加利亚INSAIT访问学生)，何波贤（浅海科技），蒋悦（浙江省妇保），张晗奕（萧山农商银行），方靖（杭州移动）
2021级硕士生：欧阳鹏翔(硕博连读，复旦联合培养)，孙泓迪（百度），赵董孝元（研究生国奖，瓴羊科技），蔡志鹏（浙商银行），潘莹莹（省直属事业单位）
2020级硕士生：郑安琪（研究生国奖，浙商银行），孙丰（研究生国奖，快手），张晓青（浙江水利水电学院），李雅丽（宁波大学）
2019级硕士生：顾婷菲（浙商银行），曾超（研究生国奖，校优秀硕士论文，Zoom），张敏靖（研究生国奖，武康交警），黄诚（研究生国奖，硕博连读）
2018级硕士生：李宏凯（研究生国奖，腾讯/网易），黄远（浙商银行）
2017级硕士生：陈健（研究生国奖，浙商银行）
2016级硕士生：黄玲（研究生国奖，校优秀硕士论文，浙工大研究生国际化培养资助美国交流半年，法国UTC读博，CSC全额奖学金，NUS博后在读）
2015级硕士生：陈佳楠（研究生国奖，法国驻华使馆访学资助法国访学一个月，法国INSA Rennes读博，CSC全额奖学金）
2014级硕士生：邹祎杰（爱立信）

2026级博士生：黄堃（硕博连读）、王骋誉（本硕博一体化）
2025级博士生：严涵婷 (硕博连读)
2024级博士生：姜子浔（本硕博一体化）、欧阳鹏翔（硕博连读，复旦大学联合培养）
2023级博士生：张诗琪（本硕博一体化，英国布里斯托大学联合培养）
2022级博士生：陶健（硕博连读，本校博后）
2021级博士生：黄诚（研究生国奖，英国布里斯托大学联合培养，台州学院教师）
2013级博士生：马青（浙江工业大学教师）

指导过的本科生：

2022级：卢美伊（保研浙江大学）
2021级：林曾荣（保研同济大学）、周彬加（保研浙江大学）
2020级：许函宁（保研复旦大学）、杜智颖（保研复旦大学）、姜子浔（保送本校本硕博一体化）

2019级：张诗琪（保送本校本硕博一体化）

2018级：查志伟（保研复旦大学），徐逸伦（保研浙江大学）

2017级：周鹏飞（本科生国奖，保研中科院计算所）

2016级：朱赟（保研浙江大学），郑安琪（保研本校），许柏舟（ UC San Diego 读研)

2015级：路彭悦（本科生国奖，保研华东师范大学）

2014级：傅宇倩（本科生国奖，保研复旦大学）

做为指导教师指导过以下课外科技：

2023年第十八届“挑战杯”全国大学生课外学术科技作品竞赛一等奖

2022年浙江省新苗人才计划立项并顺利结题

2020年第十一届中国大学生服务外包创新创业大赛二等奖

2019年国家级大学生创新创业训练计划立项并顺利结题

2019年浙江省新苗人才计划立项并顺利结题

2016年浙江省大学生服务外包创新应用大赛省二等奖

2015年国家级大学生创新创业训练计划立项并顺利结题

科研项目

10. 浙江省自然科学基金杰出青年基金延续资助项目，多源数据驱动的灾害天气智能预报， 2025.1~2027.12，主持
9. 国家自然科学基金区域创新发展联合基金重点项目，白内障显微手术机器人影像处理与自主导航关键问题研究，2021.1-2024.12，主持。
8. 浙江省自然科学基金杰出青年基金项目，多媒体内容理解与检索，2021.1~2023.12，主持。
7. 国家自然科学基金面上项目，面向图像检索的无监督深度特征学习方法研究， 2020.01~2023.12，主持。
6. 中国气象服务协会气象科技创新平台面上项目，多卫星降水联合智能反演技术研究，2023年5月-2024年4月，主持
5. 国家自然科学基金辽宁联合基金重点项目，多元医学影像数据智能分析交换技术与系统应用，2020.01~2023.12，合作单位负责人。
4. 国家重点研发计划政府间国际科技创新合作重点专项，基于气象卫星影像的典型灾害天气智能解译关键技术研究与应用示范，2020.01~2022.12，合作单位负责人。
3．国家自然科学基金青年科学基金项目，基于内容的图像检索中语义特征表示及语义融合,2016.01~2018.12,主持。
2. 浙江省自然科学基金面上项目，面向语义图像检索的深度卷积神经网络优化研究， 2018.01~2020.12，主持。
1. 浙江省自然科学基金面上项目，基于显著性模型的语义图像检索，2015.01~2017.12，主持。

科研成果

文章与专利列表
期
期刊文章：
[1]        C. Bai, H. Yan, P. Mu*, C. Huang, J. Zhang, S. Shu, “Estimating tropical cyclone maximum wind speed and radius using a multimodal hybrid guided network,” Journal of Geophysical Research: Atmospheres, 131, e2025JD044721, 2026
[2]        J. Tao, S. Chan*, C. Bai, X. Zhang, “GVLTrack: Global Vision-Language Tracking with Multi-Stage Modal Fusion,” ACM Trans. Multimedia Comput. Commun. 22, 5, Article 138, 2026
[3]        Zhang, Z., Li, G., Zhang, H. et al. A Novel Dataset and Lightweight Distillation Baseline for Highlight Transparent Object Detection. Int J Comput Vis134, 157, 2026
[4]        H. Li, D. Li, J. Zhang, Y. Zheng, C. Bai, “3D-MolGL: A multimodal framework for integrating 3D molecular graphs into language models,” Expert Systems with Applications, 312, 2026,
[5]        S. Chan, J. Zheng, S. Li, P. Mu and C. Bai*, Adaptive Feature Fusion based on Double Head Framework for Small Object Detection in Highway Scenarios, IEEE Transactions on Artificial Intelligence, doi: 10.1109/TAI.2026.3664780
[6]        Q. Ma, Y. Jiang, C. Bai*, “Boundary mutual information hashing for cross-modal retrieval,” Pattern Recognition, vol. 172, Art no.112658, 2026
[7]        Q. Wang, Y. Wang, H. Sun, Y. Feng, Z. Dong* and C. Bai*, Refocal Loss in Transformer for Long-Tailed Multi-Granularity Cataract Classification, in IEEE Journal of Biomedical and Health Informatics, vol. 30, no. 1, pp. 617-630, Jan. 2026
[8]        Z. Zhao, C. Bai, P. Hao*, “ GraphVSum: graph guided multimodal video summarization, ”Multimedia Systems vol. 32, Art no.38, 2026
[9]        C. Huang, P. Mu, J. Zhang, and C. Bai*, Benchmark dataset and deep learning method for global tropical cyclone forecasting, Nature Communications, vol. 16, p. 5923, 2025.
[10]     Z. Wang, B. He, C. Wang, B. Xu, and C. Bai*, Precipitation retrieval integrating multiple satellite observations: A dataset and a framework, IEEE Trans. Geoscience Remote Sensing, vol. 63, pp. 1-15, 2025, Art no. 4107015.
[11]     J. Chen, J. Wang, S. Chan*, and C. Bai, DBFA-TSNet: A Three-Stage Building Extraction Network Based on Dual-Branch Fusion and Adaptive Enhancement, IEEE Transactions on Geoscience and Remote Sensing, vol. 63, 2025, pp. 1–13.
[12]     P. Ouyang, Q. Ma*, and C. Bai, Sparse Information Perception Network for Remote Sensing Cross-Modal Retrieval, IEEE Transactions on Geoscience and Remote Sensing, vol. 63, 2025, Art. no. 5521715.
[13]     X. Wang, Y. Zhu, Y. Cui, X. Huang, D. Guo, P. Mu, and C. Bai, Lightweight Multi-Stage Aggregation Transformer for Robust Medical Image Segmentation, Medical Image Analysis, 2025, Art. no. 103569
[14]     J. Tao, S. Chan, Z. Shi, C. Bai, and S. Chen, FocTrack: Focus Attention for Visual Tracking, Pattern Recognition, vol. 160, 2025, Art. no. 111128.
[15]     L. Wang, Z. Wang, W. Hu, and C. Bai, RainHZLet: Hybrid High-Low Frequency and Cross-Scale Network for Precipitation Nowcasting, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2025.
[16]     P. Hao, C. Wu, and C. Bai*, HHGraphSum: Hierarchical Heterogeneous Graph Learning for Extractive Document Summarization, Displays, vol. 86, 2025, Art. no. 102884.
[17]     S. Chan, J. Cui, Y. Wu, H. Wang, and C. Bai, Multi-Scale Feature Correspondence and Restriction Mechanism for Visible X-Ray Baggage Re-Identification, Multimedia Systems, vol. 30, no. 6, 2024, Art. no. 315
[18]     S. Chan, X. Zeng, X. Wang, J. Hu, and C. Bai, Auxiliary Feature Fusion and Noise Suppression for HOI Detection, ACM Transactions on Multimedia Computing, Communications and Applications, vol. 20, no. 10, 2024
[19]     H. Zhang, Q. Miao, S. Li, C. Wang, S. Chan, J. Hu, and C. Bai, An Efficient and Real-Time Steel Surface Defect Detection Method Based on Single-Stage Detection Algorithm, Multimedia Tools and Applications, 2024
[20]     S. Chan, W. Wang, Z. Shao, Z. Wang, and C. Bai, Region Mining and Refined Query Improved HOI Detection in Transformer, IEEE Transactions on Emerging Topics in Computational Intelligence, 2024
[21]     Q. Wang, H. Sun, Y. Feng, Z. Dong*, and C. Bai, MGCNet: Multi-Granularity Cataract Classification Using Denoising Diffusion Probabilistic Model, Displays, vol. 83, 2024, Art. no. 102716
[22]     P. Hao, S. Liu, and C. Bai*, Intent-Aware Graph-Level Embedding Learning Based Recommendation, Journal of Computer Science and Technology, vol. 39, no.5, 2024, pp.1138–1152.
[23]     Q. Ma, J. Pan, and C. Bai*, Direction-Oriented Visual–Semantic Embedding Model for Remote Sensing Image–Text Retrieval, IEEE Transactions on Geoscience and Remote Sensing, vol.62, 2024, Art. no.4704014.
[24]     C. Bai, C. Zeng, Q. Ma, and J. Zhang*, Graph Convolutional Network Discrete Hashing for Cross-Modal Retrieval, IEEE Transactions on Neural Networks and Learning Systems, vol.35, no.4, 2024, pp.4756–4767.
[25]     P. Zhou, K. Ying, Z. Wang, D. Guo, and C. Bai*, Self-Supervised Enhancement for Named Entity Disambiguation via Multimodal Graph Convolution, IEEE Transactions on Neural Networks and Learning Systems, vol.35, no.1,2024, pp.231–245.
[26]     S. Chan, W. Meng, C. Bai*, J. Hu*, and S. Chen, Diverse-Feature Collaborative Progressive Learning for Visible-Infrared Person Re-Identification, IEEE Transactions on Industrial Informatics, vol.20, no.5,2024, pp.7754–7763.
[27]     S. Chan, M. Yu, Z. Chen*, J. Mao, and C. Bai, Regional Contextual Information Modeling for Small Object Detection on Highways, IEEE Transactions on Instrumentation and Measurement, vol.72,2023, Art. no.2531613.
[28]     D. Zhao, Q. Wang*, J. Zhang, and C. Bai, Mine Diversified Contents of Multispectral Cloud Images Along With Geographical Information for Multilabel Classification, IEEE Transactions on Geoscience and Remote Sensing, vol.61,2023, pp.1–15.
[29]     C. Bai, Z. Cai, X. Yin*, and J. Zhang, LSDSSIMR: Large-Scale Dust Storm Database Based on Satellite Images and Meteorological Reanalysis Data, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.16,2023, pp.10121–10131.
[30]     X. Zhang, C. Bai*, K. Kpalma, OMCBIR: Offline mobile content-based image retrieval with lightweight CNN optimization, Displays, Vol. 76, 2023, 102355
[31]     A. Zheng, S. Zheng, C. Bai, D. Chen*. Triple-level relationship enhanced transformer for image captioning. Multimedia Systems, Vol.29, pp. 1955–1966, 2023
[32]     P. Hao, Y. Li, C. Bai*. Meta-relationship for course recommendation in MOOCs. Multimedia Systems, Vol. 29, 235–246, 2023
[33]     C. Bai, M. Zhang, J. Zhang*, J. Zheng, and S. Chen, LSCIDMR: Large-Scale Satellite Cloud Image Database for Meteorological Research, IEEE Transactions on Cybernetics, vol.52, no.11, pp.12538–12550, 2022.
[34]     C. Zhu, X. Wang, SY. Chen, Zhongzhao Teng, C. Baiet al.Complex carotid artery segmentation in multi-contrast MR sequences by improved optimal surface graph cuts based on flow line learning. Medical & Biological Engineering & Computing 60, 2693–2706 (2022)
[35]     X. Wang, Y. Yuan, D. Guo, Y. Cui, M. Xia, Z. Wang, C. Bai and S. Chen, “SSA-Net: Spatial self-attention network for COVID-19 pneumonia infection   segmentation with semi-supervised few-shot learning” , Medical Image Analysis 79: 102459, 2022
[36]     S. Chan, J. Tao, X. Zhou, C. Bai, and X. Zhang, Siamese Implicit Region Proposal Network With Compound Attention for Visual Tracking, IEEE Transactions on Image Processing, vol.31,2022, pp.1882–1894.
[37]     X. Wang, L. Wang, Y. Sheng, C. Zhu, N. Jiang, C. Bai, M. Xia, Z. Shao, Z. Gu, and X. Huang, Automatic and Accurate Segmentation of Peripherally Inserted Central Catheter (PICC) From Chest X-Rays Using Multi-Stage Attention-Guided Learning, Neurocomputing, vol.482,2022, pp.82–97.
[38]     C. Bai, D. Zhao, M. Zhang and J. Zhang, Multimodal Information Fusion for Weather Systems and Clouds Identification From Satellite Images, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 15, pp. 7333-7345, 2022
[39]     P Zhou, C Bai*, Jie Xia, SY Chen, “CMRDF: A Real-Time Food Alerting System Based on Multimodal Data”, IEEE Internet of Things Journal, vol. 9, no. 9, pp. 6335-6349, 2022
[40]     Y. Li, Y. Chen, K. Zhu, C. Bai and J. Zhang, An effective federated learning verification strategy and its applications for fault diagnosis in industrial IOT systems, IEEE Internet of Things Journal, vol. 9, no. 18, pp. 16835-16849, 15 Sept.15, 2022
[41]     C Bai, F Sun, J Zhang*, Y Song, SY Chen, “Rainformer: Features Extraction Balanced Network for Radar-Based Precipitation Nowcasting”, IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1-5, 2022, Art no. 4023305, doi: 10.1109/LGRS.2022.3162882.
[42]     C Huang, C Bai*, S Chan, J Zhang, “MMSTN: A Multi‐Modal Spatial‐Temporal Network for Tropical Cyclone Short‐Term Prediction”, Geophysical Research Letters, 49 (4), e2021GL096898, 2022
[43]     Z Jia, S Yang, J Zhang, Y Zhang, Z Yang, K Xue, and C Bai. 2022. PRSOT: Precipitation Retrieval from Satellite Observations Based on Transformer Atmosphere 13, no. 12: 2048
[44]     C Bai, A Zheng, Y Huang, X Pan, N Chen, “Boosting Convolutional Image Captioning with Semantic Content and Visual Relationship”, Displays, Vol. 70, 2021, 102069
[45]     L Huang, C Bai*, Y Lu, S Zhang, SY Chen, “Unsupervised adversarial image retrieval”, MULTIMEDIA SYSTEMS, 28(2),pp. 673-685, 2022
[46]     J Zhang, Y Yang, Z Jia, C Bai*, “Superresolution imaging with a deep multipath network for the reconstruction of satellite cloud images”. Earth and Space Science, 8, e2020EA001559.
[47]     Y Feng, J Zheng, M Qin, C Bai, and J Zhang. 2021. 3D Octave and 2D Vanilla Mixed Convolutional Neural Network for Hyperspectral Image Classification with Limited Samples Remote Sensing 13, no. 21: 4407.
[48]     C. Bai, H LI, J. Zhang*, L. Huang, L. Zhang, “Unsupervised Adversarial Instance-level Image Retrieval, IEEE Transaction On Multimedia, vol. 23, pp. 2199-2207, 2021
[49]     顾婷菲，郝鹏翼，白琮，柳宁，“结合多通道注意力的糖尿病性视网膜病变分级”，中国图象图形学报，2021, 26 (7)
[50]     曾超，白琮*，马青，“基于对抗投影学习的跨模态哈希检索”，计算机辅助设计与图形学学报, 2021, 32(6)
[51]     X. Wang, L. Wang, X. Zhong, C. Bai, X. Huang, et al., PaI-Net: A Modified U-Net of Reducing Semantic Gap for Surgical Instrument Segmentation, IET Image Processing, vol.15, pp.2959–2969,2021.
[52]     J. Zheng, Y. Feng, C. Bai*, and J. Zhang, Hyperspectral image classification using mixed convolutions and covariance pooling, IEEE Trans. Geoscience Remote Sensing, vol. 59, no. 1, pp. 522-534, Jan. 2021（ESI High Cited Paper）
[53]     陈健，白琮*，马青，郝鹏翼，陈胜勇，“面向细粒度草图检索的对抗训练三元组网络”, 软件学报, 2020, 31(7), 1933-1942
[54]     张敏靖，白琮*，张敬林，郑建炜，“基于对抗和迁移学习的灾害天气卫星云图分类”，北京航空航天大学学报,2021,47(3)
[55]     黄远，白琮*，李宏凯，张敬林，陈胜勇，“基于条件生成对抗的图像描述生成方法”, 计算机辅助设计与图形学学报, 2020, 32(6)
[56]     C Bai, J Chen, Q Ma*, P Hao, SY Chen, “Cross-domain representation learning by domain-migration generative adversarial network for sketch based image retrieval”，Journal of Visual Communication and Image Representation, vol.71, 2020, 102835
[57]     P Hao, X Gao, Z Li, J Zhang*, F Wu, C Bai, “Multi-branch Fusion Network for Myocardial Infarction Screening from 12-lead ECG Images”, Computer Methods and Programs in Biomedicine, Volume 184, 105286, 2020
[58]     J Ren, Z Liu, X Zhou, C Bai, G Sun, “Co-saliency detection via integration of multi-layer convolutional features and inter-image propagation”, Neurocomputing, (371): 137-146, 2020
[59]     P Hao, S Chokuwa, X Xie, F Wu, J Wu, C Bai*, Skeletal bone age assessments for young children based on regression convolutional neural networks, Mathematical Biosciences and Engineering, 16(6): 6454-6466, 2019
[60]     Q. Ma, C. Bai*, J. Zhang, Z. Liu, and S. Chen, Supervised Learning Based Discrete Hashing for Image Retrieval, Pattern Recognition, vol.92,2019, pp.156–164
[61]     J. Zheng, K. Lou, X. Yang*, C. Bai*, and J. Tang, Weighted mixed-norm regularized regression for robust face identification, IEEE Trans. Neural Netw. Learn. Syst., vol. 30, no. 12, pp. 3788-3802, Dec. 2019.
[62]     C Bai, L Huang, X Pan, J Zheng, S Chen, Optimization of deep convolutional neural network for large scale image retrieval, Neurocomputing 303, 60-67, 2018
[63]     白琮，黄玲，陈佳楠，潘翔，陈胜勇， “面向大规模图像分类的深度卷积神经网络优化”, 软件学报, 2018,29(4):1029-1038.
[64]     C Bai, J Chen, L Huang, K Kpalma, S Chen, Saliency-based multi-feature modeling for semantic image retrieval, Journal of visual communication and image representation, vol. 50, 199-204, 2018
[65]     J Ren, Z Liu, X Zhou, G Sun, C Bai, Saliency integration driven by similar images, Journal of Visual Communication and Image Representation, vol. 50, 227-236, 2018
[66]     L Ye, Z Liu, L Li, L Shen, C Bai, Y Wang, Salient object segmentation via effective integration of saliency and objectness, IEEE Transactions on Multimedia 19 (8), 1742-1756, 2017
[67]     J Zhang, C Bai, JF Nezan, JG Cousin, Joint motion model for local stereo video-matching method, Optical Engineering 54 (12), 123108, 2015
[68]     C. Bai, J. Zhang, Z. Liu, and W. L. Zhao, K-means based histogram using multiresolution feature vectors for color texture database retrieval, Multimedia Tools Appl., vol. 74, no. 4, pp. 1469-1488, 2015.
[69]     W. Zou, C. Bai, K. Kpalma, and J. Ronsin, Online glocal transfer for automatic figure-ground segmentation, IEEE Trans. Image Process., vol. 23, no. 5, pp. 2109-2121, May 2014.
[70]     C. Bai, W. Zou, K. Kpalma, and J. Ronsin, Efficient colour texture image retrieval by combination of colour and texture features in wavelet domain, Electron. Lett., vol. 48, no. 23, pp. 1463-1465, 2012.

会议论文:
[1]        Q. Chen, Z. Zhang, H. Liu, J. Zhang*, C. Bai, “KFTD: Koopman-Fourier Time-Differentiable Network for Continuous Ocean Spatiotemporal Forecasting,” Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 1, pp. 94-103, 2026
[2]        J. Shen, Q. Chen, R. Wang, C. Xu, J. Zhang*, C. Bai, F. Zhang, “KAN-FIF: Spline-Parameterized Lightweight Physics-based Tropical Cyclone Estimation on Meteorological Satellite,” Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 1, pp. 2424-2433, 2026
[3]        J. Chen, S. Lu and C. Bai*, SAICR: Symmetric Alignment and Intra-Class Contrastive Refinement for Referring Image Segmentation, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2026, pp. 9677-9681
[4]        H. Yan, P. Mu, S. Zhang, Y. Zhu, J. Zhang, C. Bai*, “IDOL: Meeting Diverse Distribution Shifts with Prior Physics for Tropical Cyclone Multi-Task Estimation,” Advances in Neural Information Processing Systems (NeurIPS) 38, 2025
[5]        P. Ouyang, Q.Ma*, Z.Wang, C. Bai, “PMPGuard: Catching Pseudo-Matched Pairs in Remote Sensing Image–Text Retrieval,” Proceedings of the AAAI Conference on Artificial Intelligence, 40(10), 8197–8205, 2026.
[6]        P. Mu, Y. Zhu, S. Zhang, H. Yan, J. Zhang, C. Bai*, “iTCTSL: Interpretable Tropical Cyclone Track and Intensity Forecasting via Task Sensitive Learning,”Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings, 2026, pp. 1587-1596
[7]        C. Zheng, Q. Wang*, C. Bai, “Amodal-KAN: The First Look at Kolmogorov-Arnold Network for Amodal Instance Segmentation,” Proceedings of the 7th ACM International Conference on Multimedia in Asia (MMAsia '25). Association for Computing Machinery, New York, NY, USA, Article 113, 1–7
[8]        C. Huang, P. Mu, C. Bai*, and P. A. Watson, TCP-Diffusion: A multi-modal diffusion model for global tropical cyclone precipitation forecasting with change awareness, in Proc. Int. Conf. Mach. Learn. (ICML), vol. 267, pp. 25634–25653, 2025.
[9]        L. Luo, P. Mu*, and C. Bai, Physics-coupled frequency dynamic adaptation network for domain generalized underwater object detection, in Proc. ACM Int. Conf. Multimed. (MM), 2025, pp. 2284–2293.
[10]     Z. Wang, K. Ying, B. Xu, C. Wang, and C. Bai*, From swath to full-disc: Advancing precipitation retrieval with multimodal knowledge expansion, in Proc. ACM SIGKDD Conf. Knowl. Discov. Data Min. (KDD), Aug. 3–7, 2025.
[11]     Z. Lin, Z. Wang*, T. Qian, P. Mu, S. Chan, and C. Bai, NeighborRetr: Balancing hub centrality in cross-modal retrieval, in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 9263–9273, 2025.
[12]     S. Zhang, P. Mu, C. Huang, J. Zhang, and C. Bai*, TC-Diffuser: Bi-condition multi-modal diffusion for tropical cyclone forecasting, in Proc. AAAI Conf. Artif. Intell., vol. 39, no. 1, pp. 1120–1128, Apr. 2025.
[13]     C. Bai, Z. Lin, J. Zhang, and S. Chen, Dust-Mamba: An efficient dust storm detection network with multiple data sources, in Proc. AAAI Conf. Artif. Intell., vol. 39, no. 27, pp. 27813–27821, Apr. 2025.
[14]     Z. Zhang, Q. Chen, M. Xiong, S. Ding, Z. Su, X. Yao, C. Bai,et al., Zero-shot learning in industrial scenarios: New large-scale benchmark, challenges and baseline, in Proc. AAAI Conf. Artif. Intell., vol. 39, no. 10, pp. 10357–10366, Apr. 2025.
[15]     Y. Zhang, L. Luo, P. Mu*, and C. Bai, Prompt-UIE: A unified prompt-driven framework for underwater image enhancement, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), pp. 1–5, 2025.
[16]     Z. Wang, H. Zhang, and C. Bai*, PiCNet: Physics-infused convolution network for radar-based precipitation nowcasting, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), pp. 1–5, 2025.
[17]     Z. Wang, K. Huang, Z. Lin, and C. Bai*, Event-driven hybrid and cross-stage guide for video corpus moment retrieval, in Proc. Int. Conf. Multimed. Retr. (ICMR), pp. 1367–1376, 2025.
[18]     S. Wang, H. Peng, F. Wu, and C. Bai, Live on the hump: Self knowledge distillation via virtual teacher-students mutual learning, in Proc. ACM Int. Conf. Multimed. (MM), pp. 3028–3036, 2024.
[19]     H. Yan, P. Mu*, C. Huang, J. Zhang, and C. Bai, Phy-CoCo: Physical constraint-based correlation learning for tropical cyclone intensity and size estimation, in Proc. Eur. Conf. Artif. Intell. (ECAI), 2024, Art. no. 744.
[20]     P. Ouyang, J. Chen, Q. Ma*, Z. Wang, and C. Bai, Distinguishing visually similar images: Triplet contrastive learning framework for image-text retrieval, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), Niagara Falls, ON, Canada, 2024, pp. 1–6.
[21]     J. Pan, Q. Ma, and C. Bai*, A prior instruction representation framework for remote sensing image-text retrieval, in Proc. ACM Int. Conf. Multimed. (MM), Ottawa, ON, Canada, Oct. 29–Nov. 3, 2023, pp. 1–10.
[22]     P. Mu, Z. Du, J. Liu*, and C. Bai, Little strokes fell great oaks: Boosting the hierarchical features for multi-exposure image fusion, in Proc. ACM Int. Conf. Multimed. (MM), Ottawa, ON, Canada, Oct. 29–Nov. 3, 2023, pp. 1–10.
[23]     P. Mu, H. Xu, Z. Liu, Z. Wang, S. Chan, and C. Bai*, A generalized physical-knowledge-guided dynamic model for underwater image enhancement, in Proc. ACM Int. Conf. Multimed. (MM), Ottawa, ON, Canada, Oct. 29–Nov. 3, 2023, pp. 1–10.
[24]     J. Pan, Q. Ma*, and C. Bai, Reducing semantic confusion: Scene-aware aggregation network for remote sensing cross-modal retrieval, in Proc. Int. Conf. Multimed. Retr. (ICMR), Thessaloniki, Greece, Jun. 12–15, 2023, pp. 1–10.
[25]     H. Cheng, C. Bai*, S. Chan, J. Zhang, and Y. Wu, MGTCF: Multi-generator tropical cyclone forecasting with heterogeneous meteorological data, in Proc. AAAI Conf. Artif. Intell., vol. 37, no. 4, pp. 5096–5104, 2023.
[26]     X. Wang, M. Shao, D. Guo, Y. Cui, X. Huang, M. Xia, C. Bai,Multi-stage aggregation transformer for medical image segmentation, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), 2023, pp. 1–5.
[27]     S. Chan, W. Wang, Z. Shao, C. Bai, SGPT: The secondary path guides the primary path in transformers for HOI detection, in Proc. IEEE Int. Conf. Robot. Autom. (ICRA), 2023, pp. 1–7.
[28]     D. Cai, P. Mu*, S. Chan, Z. Shao and C. Bai, Towards General and Fast Video Derain via Knowledge Distillation, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), Brisbane, Australia, 2023, pp. 1949–1954.
[29]     P. Mu, J. Fang, H. Qian, and C. Bai*, Transmission and color-guided network for underwater image enhancement, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), Brisbane, Australia, 2023, pp. 1337–1342.
[30]     Z. Liu, P. Mu*, H. Xu and C. Bai, Histogram-guided Video Colorization Structure with Spatial-Temporal Connection, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), Brisbane, Australia, 2023, pp. 2579–2584.
[31]     S. Chan, J. Cui, Y. Wu, H. Wang* and C. Bai, Visible-Xray Cross-Modality Package Re-Identification, 2023 IEEE International Conference on Multimedia and Expo (ICME), Brisbane, Australia, 2023, pp. 2579-2584
[32]     P Mu, H Qian, and C Bai*. Structure-Inferred Bi-level Model for Underwater Image Enhancement. in Proc. ACM Int. Conf. Multimed. (MM), Lisboa, Portugal, 2022, pp. 2286–2295.
[33]     J. Chen, L. Zhang, Q. Wang, C. Bai and K. Kpalma, Intra-Modal Constraint Loss for Image-Text Retrieval, in Proc. IEEE Int. Conf. Image Process. (ICIP), 2022, pp. 4023–4027.
[34]     F. Sun, C. Bai*, Y. Song and J. Zhang, MMINR: Multi-frame-to-Multi-frame Inference with Noise Resistance for Precipitation Nowcasting with Radar, in Proc. Int. Conf. Pattern Recognit. (ICPR), 2022, pp. 97–103.
[35]     Z Zha, P Zhou, C Bai*, “Exploring Implicit and Explicit Relations with the Dual Relation-Aware Network for Image Captioning”. in Proc. Int. Conf. Multimed. Model., 2022, pp. 97–108.
[36]     T Han, P Hao, C Bai, “Structural and Temporal Learning for Dropout Prediction in MOOCs”, in Proc. Int. Conf. Knowl. Sci., Eng. Manage. (KSEM), 2022, pp. 1–15.
[37]     J. Dang, Y. Liu, W. Shuai, C. Bai and S. Tian, Edge-preserving Image Smoothing via Counting-weighted Total Variation, in Proc. IEEE Int. Workshop Multimed. Signal Process. (MMSP), 2022, pp. 1–6.
[38]     K. Ying, Z. Wang, C. Bai and P. Zhou, ISDA: Position-Aware Instance Segmentation with Deformable Attention, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), 2022, pp. 2619–2623
[39]     D Wu, P Hao, Y Zheng, T Han, C Bai, “Classmates Enhanced Diversity-Self-Attention Network for Dropout Prediction in MOOCs”, in Proc. Int. Conf. Neural Inf. Process. (ICONIP), 2021, pp. 609–620.
[40]     B Ye, S Mao, P Hao, W Chen, C Bai. Community Enhanced Course Concept Recommendation in MOOCs with Multiple Entities. in Proc. Int. Conf. Knowl. Sci., Eng. Manage. (KSEM), 2021, pp. 1–12. 3
[41]     P Hao, X Xie, T Han, and C Bai. Overlap classification mechanism for skeletal bone age assessment. in Proc. ACM Int. Conf. Multimed. Asia (MMAsia), 2021, pp. 1–7.
[42]     C Huang, S Chan, C Bai*, W Ding, J Zhang*. Tropical Cyclones Tracking Based on Satellite Cloud Images: Database and Comprehensive Study. in Proc. Int. Conf. Multimed. Model., 2021, pp. 1–12.
[43]     P Zhou, C Bai*, K Ying, J Xia and Lixin Huang, “ RWMF: A Real-World Multimodal Foodlog Database”, in Proc. Int. Conf. Pattern Recognit. (ICPR), 2021, pp. 962–968.
[44]     J Chen, L Zhang, C Bai* and K Kpalma, Review of Recent Deep Learning Based Methods for Image-Text Retrieval, in Proc. IEEE Conf. Multimed. Inf. Process. Retr. (MIPR), 2020, pp. 167–172.
[45]     C Bai, C Zeng, Q Ma, J Zhang, and SY Chen, “Deep Adversarial Discrete Hashing for Cross-Modal Retrieval”, in Proc. Int. Conf. Multimed. Retr. (ICMR), 2020, pp. 525–531.
[46]     H Li, C Bai*, L Huang, Y Jiang, S Chen, “Instance Image Retrieval with Generative Adversarial Training”, in Proc. Int. Conf. Multimed. Model., 2020, pp. 381–392.
[47]     P Hao, E Manhando, T Ye, C Bai*, “Video Summarization based on Sparse Subspace Clustering with Automatically Estimated Number of Clusters”, in Proc. ACM Multimed. Asia, 2019, pp. 1–6.
[48]     Y Fu, C Wang, Y Fu, YX Wang, C Bai, X Xue, YG Jiang, “Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent”, in Proc. ACM Int. Conf. Multimed. (MM), 2019, pp. 411–419.
[49]     W Zhang, Q Zhang, J Cheng, C Bai, P Hao, “End-to-End Panoptic Segmentation with Pixel-Level Non-Overlapping Embedding”, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), 2019, pp. 976–981.
[50]    P Zhou, C Bai*, Jie Xia, ZJUTCVR Team at ImageCLEFlifelog2019 Lifelog Moment Retrieval Task, in Proc. Conf. Labs Evaluation Forum (CLEF), 2019, pp. 1–8.
[51]    L Huang, C Bai*, Y Lu, S Chen, Q Tian. Adversarial Learning for Content-Based Image Retrieval, in Proc. IEEE Conf. Multimed. Inf. Process. Retr. (MIPR), 2019, pp. 1–6.
[52]    C Bai, J Chen, Q Ma, Z Liu, S Chen. Deep Residual Net Based Compact Feature Representation for Image Retrieval, in Proc. Pacific Rim Conf. Multimed. (PCM), 2018, pp. 737–747.
[53]    P Hao, Y Chen, S Chokuwa, F Wu, C Bai*. Skeletal Bone Age Assessment Based on Deep Convolutional Neural Networks, in Proc. Pacific Rim Conf. Multimed. (PCM), 2018, pp. 408–417.
[54]    J Chen, C Bai*, L Huang, Z Liu, S Chen, Visual Saliency Fusion Based Multi-feature for Semantic Image Retrieval, in Proc. CCF Chin. Conf. Comput. Vis., 2017, pp. 126–136.
[55]    P Hao, E Manhando, C Bai*, Y Huang, Hierarchical Tree Representation Based Face Clustering for Video Retrieval, in Proc. Pacific Rim Conf. Multimed. (PCM), 2017, pp. 347–357.
[56]    JW Zheng, P Yang, S Fang, C Bai*, Block Cluster Based Dictionary Learning for Image De-noising and De-blurring, in Proc. Pacific Rim Conf. Multimed. (PCM), 2017, pp. 818–826.
[57]    C Bai, J Chen, J Zhang, K Kpalma, J Ronsin. Sparse representation based histogram in color texture retrieval, in Proc. Pacific Rim Conf. Multimed. (PCM), 2016, pp. 55–64.
[58]    H Song, Z Liu, H Du, G Sun, C Bai, Saliency detection for RGBD images, in Proc. Int. Conf. Internet Multimed. Comput. Service, 2015, pp. 72–77.
[59]    W Lu, C Bai, K Kpalma, J Ronsin. Multi-object tracking using sparse representation, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), 2013, pp. 2312–2316.
[60]    K Kpalma, C Bai, MMezouar, KBelloulata, N Taleb, LBelhallouche, D Boukerroui. A new histogram-based descriptor for images retrieval from databases, in Advances Intell. Anal. Med. Data Decision Support Syst., 2013, pp. 101–112.
[61]    C Bai, K Kpalma, J Ronsin. Color textured image retrieval by combining texture and color features, in Proc. Eur. Signal Process. Conf. (EUSIPCO), 2012, pp. 170–174.
[62]    C Bai, K Kpalma, J Ronsin. Analysis of histogram descriptor for image retrieval in DCT domain, in Intell. Interactive Multimed. Syst. Services, 2011, pp. 227–235.

授权发明专利：
1.   201910467500.9: 一种基于条件生成对抗网络的图像描述生成方法(第一发明人)
2.   201810154813.4: 一种无监督对抗训练的基于内容的图像检索方法(第一发明人)
3.   201910467500.9: 一种基于条件生成对抗网络的图像描述生成方法(第一发明人)
4.   201811538815.X: 一种基于深度哈希学习优化的图像检索方法（第二发明人）
5.   201710411668.9：一种面向图像分类的深度卷积神经网络优化方法（第一发明人）
6.       201710411660.2:一种基于深度学习的快速图像分类方法(第一发明人)
7.       200810200309.X: 在彩色视频图像中提取车辆的方法 (第一发明人)
200810200310.2: 复杂场景中单个人脸的实时跟踪方法 (第二发明人)

获奖与荣誉

综合获奖情况：
2026.05    浙江工业大学校级优秀博士学位论文（指导教师）
2026.05    浙江工业大学校级优秀硕士学位论文（指导教师）
2026.01    浙江工业大学计算机学院十佳教师
2024.01    浙江工业大学计算机学院优秀教师
2023.01    浙江工业大学计算机学院优秀教师
2022.06    浙江工业大学校级优秀硕士学位论文指导教师
2021.01    浙江工业大学研究生“我心目中的好导师”
2020.10    浙江工业大学毕业设计优秀指导教师
2020.01    浙江工业大学计算机学院优秀教师
2020.01    浙江工业大学本科教学优课优酬奖
2019.06    浙江工业大学校级优秀硕士学位论文指导教师
2019.01    浙江工业大学计算机学院十佳党员
2019.01    浙江工业大学本科教学优课优酬奖
2018.10 浙江工业大学优秀班主任
2018.01    浙江工业大学本科教学优课优酬奖
2017.10    浙江工业大学毕业设计优秀指导教师
2017.01    浙江工业大学计算机学院十佳党员
2016.01    浙江工业大学计算机学院优秀教师
2015.06    浙江工业大学计算机学院青年教师讲课比赛十佳
2009.09 国家留学基金委国家公派留学生奖学金 （UT-INSA项目）
2008.12     上海市优秀学生
2005.02     山东农业大学优秀班主任
2007.05    上海大学社区优秀学生
2007.07    上海大学优秀研究生党务工作者
2007.12    上海大学优秀学生

学术获奖：
2024.10 中国计算机学会技术发明二等奖
2023.12      中国图象图形学学会科技进步二等奖
2023.08 中国多媒体大会最佳论文奖
2020.09 中国多媒体大会最佳墙报奖
2019.10 IFTC 最佳论文奖
2019.08 中国多媒体大会优秀论文奖
2018.09 Pacific Rim Conference on Multimedia （CCF C类会议）最佳审稿人奖
2017.08      中国多媒体大会最佳Poster奖

个人简介

白琮，博士，浙江工业大学计算机科学与技术学院，教授，博士生导师。学院党委委员，院人才办主任，计算机视觉研究所执行所长。斯坦福大学全球前2%顶尖科学家入选者。研究方向为计算机视觉与智慧气象。主持国自然联合重点、面上、青年、科技部重点研发计划课题等省部级以上项目10项；在国内外知名学术期刊或会议上发表论文100余篇，包括Nature Communications、NeurIPS、ICML、AAAI、KDD、ACM MM、IEEE TGRS, IEEE TNNLS, IEEE TCBY, IEEE TMM, IEEE GRSL、GRL等领域内的权威期刊或会议等，其中ESI高被引论文1篇,《软件学报》2019年和2020年高影响力论文各1篇。曾5次获得国际/国内知名学术会议的最佳/优秀论文奖。担任Elsevier旗下SCI期刊《Displays》的编委会委员，IEEE 《Journal of Selected Topics in Applied Earth Observations and Remote Sensing》副编辑。

研究工作的数据及源代码请见：https://github.com/Zjut-MultimediaPlus

欢迎本校的优秀本科生进组进行科研实习！希望你四六级已过，绩点3.8以上。
请先通过邮件或电话（659417）进行联系。

欢迎27年保研学生随时邮件联系。
27年的考研学生，请出分后再联系。
招收:硕士生、博士生
招聘: 博士后，青年教师。

教育经历：
2009.9~2013.3 法国雷恩国立应用科学学院（INSA Rennes）信号与图像处理专业    工学博士
2006.9~2009.3 上海大学                                电路与系统            工学硕士
1999.9~2003.7 山东大学                               电子信息工程          工学学士

学术兼职：
教育部高等学校电子信息类专业教学指导委员会协作委员
中国气象学会气象人工智能专委会副秘书长
中国计算机学会高级会员、多媒体专委会委员
中国图形图像学会会员、多媒体专委会委员
Displays （An Elsevier journal indexed by SCI） Editorial Board Member
程序委员会委员：ICML 2025, KDD 2025，ECAI 2024,2025 IJCAI 2024, ACMMM Asia 2019， ACM MM 2018，2022，2023，2024，2025 MIPR 2020, 2018, PCM 2018
权威期刊审稿人：IEEE TNNLS，IEEE TMM， IEEE TAI,IEEE TBD,IEEE TGRS,IEEE TKDE等

育人成果

实验室传帮带氛围好。能保证充分的指导。
实验室不要求打卡，看实际科研进展！
毕业要求完成之后，鼓励有就业意向的学生去企业实习。

2025级硕士生：陈睿、黄天乐、杨梁（与产思贤副教授共同指导）、吴文强（与穆攀副教授共同指导）、洪升（与王铮博士共同指导）、张司扬（与马青副教授共同指导）、林佳杰（与王铮博士共同指导）、洪超（与穆攀副教授共同指导）、方正（与王琼博士共同指导）
2024级硕士生：朱宇超（与穆攀副教授共同指导）、卢珂宇（与马青副教授共同指导）、卢思宇、骆霖轩（研究生国奖，与穆攀副教授共同指导）、汪漾（与王琼博士共同指导）、李想（与穆攀副教授共同指导）、严明、陈浩然（与王铮博士共同指导）、洪滔（与王铮博士共同指导）
2023级硕士生：王泽弘（与产思贤副教授共同指导，金华电信）、倪鸿泽（与马青副教授共同指导，宁波选调生）、张燕玲（研究生国奖，与穆攀博士共同指导,浙江宇视）、黄堃（与王铮博士共同指导，硕博连读）、郑策（与王琼博士共同指导）、赵政棋（硕博连读，与郝鹏翼副教授共同指导）、胡至盈（与王琼博士共同指导，招银科技）、应楷（研究生国奖，达摩院，与王铮博士共同指导）、林仲豪（研究生国奖，浙商银行）、王奕欣（与穆攀博士共同指导，美团）、汪琰（与王琼博士共同指导，杭州润江汇科技）
2022级硕士生：严涵婷（研究生国奖，硕博连读），潘建成(清华联合培养,研究生国奖，两次！保加利亚INSAIT访问学生)，何波贤（浅海科技），蒋悦（浙江省妇保），张晗奕（萧山农商银行），方靖（杭州移动）
2021级硕士生：欧阳鹏翔(硕博连读，复旦联合培养)，孙泓迪（百度），赵董孝元（研究生国奖，瓴羊科技），蔡志鹏（浙商银行），潘莹莹（省直属事业单位）
2020级硕士生：郑安琪（研究生国奖，浙商银行），孙丰（研究生国奖，快手），张晓青（浙江水利水电学院），李雅丽（宁波大学）
2019级硕士生：顾婷菲（浙商银行），曾超（研究生国奖，校优秀硕士论文，Zoom），张敏靖（研究生国奖，武康交警），黄诚（研究生国奖，硕博连读）
2018级硕士生：李宏凯（研究生国奖，腾讯/网易），黄远（浙商银行）
2017级硕士生：陈健（研究生国奖，浙商银行）
2016级硕士生：黄玲（研究生国奖，校优秀硕士论文，浙工大研究生国际化培养资助美国交流半年，法国UTC读博，CSC全额奖学金，NUS博后在读）
2015级硕士生：陈佳楠（研究生国奖，法国驻华使馆访学资助法国访学一个月，法国INSA Rennes读博，CSC全额奖学金）
2014级硕士生：邹祎杰（爱立信）

2026级博士生：黄堃（硕博连读）、王骋誉（本硕博一体化）
2025级博士生：严涵婷 (硕博连读)
2024级博士生：姜子浔（本硕博一体化）、欧阳鹏翔（硕博连读，复旦大学联合培养）
2023级博士生：张诗琪（本硕博一体化，英国布里斯托大学联合培养）
2022级博士生：陶健（硕博连读，本校博后）
2021级博士生：黄诚（研究生国奖，英国布里斯托大学联合培养，台州学院教师）
2013级博士生：马青（浙江工业大学教师）

指导过的本科生：

2022级：卢美伊（保研浙江大学）
2021级：林曾荣（保研同济大学）、周彬加（保研浙江大学）
2020级：许函宁（保研复旦大学）、杜智颖（保研复旦大学）、姜子浔（保送本校本硕博一体化）

2019级：张诗琪（保送本校本硕博一体化）

2018级：查志伟（保研复旦大学），徐逸伦（保研浙江大学）

2017级：周鹏飞（本科生国奖，保研中科院计算所）

2016级：朱赟（保研浙江大学），郑安琪（保研本校），许柏舟（ UC San Diego 读研)

2015级：路彭悦（本科生国奖，保研华东师范大学）

2014级：傅宇倩（本科生国奖，保研复旦大学）

做为指导教师指导过以下课外科技：

2023年第十八届“挑战杯”全国大学生课外学术科技作品竞赛一等奖

2022年浙江省新苗人才计划立项并顺利结题

2020年第十一届中国大学生服务外包创新创业大赛二等奖

2019年国家级大学生创新创业训练计划立项并顺利结题

2019年浙江省新苗人才计划立项并顺利结题

2016年浙江省大学生服务外包创新应用大赛省二等奖

2015年国家级大学生创新创业训练计划立项并顺利结题

科研项目

10. 浙江省自然科学基金杰出青年基金延续资助项目，多源数据驱动的灾害天气智能预报， 2025.1~2027.12，主持
9. 国家自然科学基金区域创新发展联合基金重点项目，白内障显微手术机器人影像处理与自主导航关键问题研究，2021.1-2024.12，主持。
8. 浙江省自然科学基金杰出青年基金项目，多媒体内容理解与检索，2021.1~2023.12，主持。
7. 国家自然科学基金面上项目，面向图像检索的无监督深度特征学习方法研究， 2020.01~2023.12，主持。
6. 中国气象服务协会气象科技创新平台面上项目，多卫星降水联合智能反演技术研究，2023年5月-2024年4月，主持
5. 国家自然科学基金辽宁联合基金重点项目，多元医学影像数据智能分析交换技术与系统应用，2020.01~2023.12，合作单位负责人。
4. 国家重点研发计划政府间国际科技创新合作重点专项，基于气象卫星影像的典型灾害天气智能解译关键技术研究与应用示范，2020.01~2022.12，合作单位负责人。
3．国家自然科学基金青年科学基金项目，基于内容的图像检索中语义特征表示及语义融合,2016.01~2018.12,主持。
2. 浙江省自然科学基金面上项目，面向语义图像检索的深度卷积神经网络优化研究， 2018.01~2020.12，主持。
1. 浙江省自然科学基金面上项目，基于显著性模型的语义图像检索，2015.01~2017.12，主持。

科研成果

文章与专利列表
期
期刊文章：
[1]        C. Bai, H. Yan, P. Mu*, C. Huang, J. Zhang, S. Shu, “Estimating tropical cyclone maximum wind speed and radius using a multimodal hybrid guided network,” Journal of Geophysical Research: Atmospheres, 131, e2025JD044721, 2026
[2]        J. Tao, S. Chan*, C. Bai, X. Zhang, “GVLTrack: Global Vision-Language Tracking with Multi-Stage Modal Fusion,” ACM Trans. Multimedia Comput. Commun. 22, 5, Article 138, 2026
[3]        Zhang, Z., Li, G., Zhang, H. et al. A Novel Dataset and Lightweight Distillation Baseline for Highlight Transparent Object Detection. Int J Comput Vis134, 157, 2026
[4]        H. Li, D. Li, J. Zhang, Y. Zheng, C. Bai, “3D-MolGL: A multimodal framework for integrating 3D molecular graphs into language models,” Expert Systems with Applications, 312, 2026,
[5]        S. Chan, J. Zheng, S. Li, P. Mu and C. Bai*, Adaptive Feature Fusion based on Double Head Framework for Small Object Detection in Highway Scenarios, IEEE Transactions on Artificial Intelligence, doi: 10.1109/TAI.2026.3664780
[6]        Q. Ma, Y. Jiang, C. Bai*, “Boundary mutual information hashing for cross-modal retrieval,” Pattern Recognition, vol. 172, Art no.112658, 2026
[7]        Q. Wang, Y. Wang, H. Sun, Y. Feng, Z. Dong* and C. Bai*, Refocal Loss in Transformer for Long-Tailed Multi-Granularity Cataract Classification, in IEEE Journal of Biomedical and Health Informatics, vol. 30, no. 1, pp. 617-630, Jan. 2026
[8]        Z. Zhao, C. Bai, P. Hao*, “ GraphVSum: graph guided multimodal video summarization, ”Multimedia Systems vol. 32, Art no.38, 2026
[9]        C. Huang, P. Mu, J. Zhang, and C. Bai*, Benchmark dataset and deep learning method for global tropical cyclone forecasting, Nature Communications, vol. 16, p. 5923, 2025.
[10]     Z. Wang, B. He, C. Wang, B. Xu, and C. Bai*, Precipitation retrieval integrating multiple satellite observations: A dataset and a framework, IEEE Trans. Geoscience Remote Sensing, vol. 63, pp. 1-15, 2025, Art no. 4107015.
[11]     J. Chen, J. Wang, S. Chan*, and C. Bai, DBFA-TSNet: A Three-Stage Building Extraction Network Based on Dual-Branch Fusion and Adaptive Enhancement, IEEE Transactions on Geoscience and Remote Sensing, vol. 63, 2025, pp. 1–13.
[12]     P. Ouyang, Q. Ma*, and C. Bai, Sparse Information Perception Network for Remote Sensing Cross-Modal Retrieval, IEEE Transactions on Geoscience and Remote Sensing, vol. 63, 2025, Art. no. 5521715.
[13]     X. Wang, Y. Zhu, Y. Cui, X. Huang, D. Guo, P. Mu, and C. Bai, Lightweight Multi-Stage Aggregation Transformer for Robust Medical Image Segmentation, Medical Image Analysis, 2025, Art. no. 103569
[14]     J. Tao, S. Chan, Z. Shi, C. Bai, and S. Chen, FocTrack: Focus Attention for Visual Tracking, Pattern Recognition, vol. 160, 2025, Art. no. 111128.
[15]     L. Wang, Z. Wang, W. Hu, and C. Bai, RainHZLet: Hybrid High-Low Frequency and Cross-Scale Network for Precipitation Nowcasting, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2025.
[16]     P. Hao, C. Wu, and C. Bai*, HHGraphSum: Hierarchical Heterogeneous Graph Learning for Extractive Document Summarization, Displays, vol. 86, 2025, Art. no. 102884.
[17]     S. Chan, J. Cui, Y. Wu, H. Wang, and C. Bai, Multi-Scale Feature Correspondence and Restriction Mechanism for Visible X-Ray Baggage Re-Identification, Multimedia Systems, vol. 30, no. 6, 2024, Art. no. 315
[18]     S. Chan, X. Zeng, X. Wang, J. Hu, and C. Bai, Auxiliary Feature Fusion and Noise Suppression for HOI Detection, ACM Transactions on Multimedia Computing, Communications and Applications, vol. 20, no. 10, 2024
[19]     H. Zhang, Q. Miao, S. Li, C. Wang, S. Chan, J. Hu, and C. Bai, An Efficient and Real-Time Steel Surface Defect Detection Method Based on Single-Stage Detection Algorithm, Multimedia Tools and Applications, 2024
[20]     S. Chan, W. Wang, Z. Shao, Z. Wang, and C. Bai, Region Mining and Refined Query Improved HOI Detection in Transformer, IEEE Transactions on Emerging Topics in Computational Intelligence, 2024
[21]     Q. Wang, H. Sun, Y. Feng, Z. Dong*, and C. Bai, MGCNet: Multi-Granularity Cataract Classification Using Denoising Diffusion Probabilistic Model, Displays, vol. 83, 2024, Art. no. 102716
[22]     P. Hao, S. Liu, and C. Bai*, Intent-Aware Graph-Level Embedding Learning Based Recommendation, Journal of Computer Science and Technology, vol. 39, no.5, 2024, pp.1138–1152.
[23]     Q. Ma, J. Pan, and C. Bai*, Direction-Oriented Visual–Semantic Embedding Model for Remote Sensing Image–Text Retrieval, IEEE Transactions on Geoscience and Remote Sensing, vol.62, 2024, Art. no.4704014.
[24]     C. Bai, C. Zeng, Q. Ma, and J. Zhang*, Graph Convolutional Network Discrete Hashing for Cross-Modal Retrieval, IEEE Transactions on Neural Networks and Learning Systems, vol.35, no.4, 2024, pp.4756–4767.
[25]     P. Zhou, K. Ying, Z. Wang, D. Guo, and C. Bai*, Self-Supervised Enhancement for Named Entity Disambiguation via Multimodal Graph Convolution, IEEE Transactions on Neural Networks and Learning Systems, vol.35, no.1,2024, pp.231–245.
[26]     S. Chan, W. Meng, C. Bai*, J. Hu*, and S. Chen, Diverse-Feature Collaborative Progressive Learning for Visible-Infrared Person Re-Identification, IEEE Transactions on Industrial Informatics, vol.20, no.5,2024, pp.7754–7763.
[27]     S. Chan, M. Yu, Z. Chen*, J. Mao, and C. Bai, Regional Contextual Information Modeling for Small Object Detection on Highways, IEEE Transactions on Instrumentation and Measurement, vol.72,2023, Art. no.2531613.
[28]     D. Zhao, Q. Wang*, J. Zhang, and C. Bai, Mine Diversified Contents of Multispectral Cloud Images Along With Geographical Information for Multilabel Classification, IEEE Transactions on Geoscience and Remote Sensing, vol.61,2023, pp.1–15.
[29]     C. Bai, Z. Cai, X. Yin*, and J. Zhang, LSDSSIMR: Large-Scale Dust Storm Database Based on Satellite Images and Meteorological Reanalysis Data, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol.16,2023, pp.10121–10131.
[30]     X. Zhang, C. Bai*, K. Kpalma, OMCBIR: Offline mobile content-based image retrieval with lightweight CNN optimization, Displays, Vol. 76, 2023, 102355
[31]     A. Zheng, S. Zheng, C. Bai, D. Chen*. Triple-level relationship enhanced transformer for image captioning. Multimedia Systems, Vol.29, pp. 1955–1966, 2023
[32]     P. Hao, Y. Li, C. Bai*. Meta-relationship for course recommendation in MOOCs. Multimedia Systems, Vol. 29, 235–246, 2023
[33]     C. Bai, M. Zhang, J. Zhang*, J. Zheng, and S. Chen, LSCIDMR: Large-Scale Satellite Cloud Image Database for Meteorological Research, IEEE Transactions on Cybernetics, vol.52, no.11, pp.12538–12550, 2022.
[34]     C. Zhu, X. Wang, SY. Chen, Zhongzhao Teng, C. Baiet al.Complex carotid artery segmentation in multi-contrast MR sequences by improved optimal surface graph cuts based on flow line learning. Medical & Biological Engineering & Computing 60, 2693–2706 (2022)
[35]     X. Wang, Y. Yuan, D. Guo, Y. Cui, M. Xia, Z. Wang, C. Bai and S. Chen, “SSA-Net: Spatial self-attention network for COVID-19 pneumonia infection   segmentation with semi-supervised few-shot learning” , Medical Image Analysis 79: 102459, 2022
[36]     S. Chan, J. Tao, X. Zhou, C. Bai, and X. Zhang, Siamese Implicit Region Proposal Network With Compound Attention for Visual Tracking, IEEE Transactions on Image Processing, vol.31,2022, pp.1882–1894.
[37]     X. Wang, L. Wang, Y. Sheng, C. Zhu, N. Jiang, C. Bai, M. Xia, Z. Shao, Z. Gu, and X. Huang, Automatic and Accurate Segmentation of Peripherally Inserted Central Catheter (PICC) From Chest X-Rays Using Multi-Stage Attention-Guided Learning, Neurocomputing, vol.482,2022, pp.82–97.
[38]     C. Bai, D. Zhao, M. Zhang and J. Zhang, Multimodal Information Fusion for Weather Systems and Clouds Identification From Satellite Images, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 15, pp. 7333-7345, 2022
[39]     P Zhou, C Bai*, Jie Xia, SY Chen, “CMRDF: A Real-Time Food Alerting System Based on Multimodal Data”, IEEE Internet of Things Journal, vol. 9, no. 9, pp. 6335-6349, 2022
[40]     Y. Li, Y. Chen, K. Zhu, C. Bai and J. Zhang, An effective federated learning verification strategy and its applications for fault diagnosis in industrial IOT systems, IEEE Internet of Things Journal, vol. 9, no. 18, pp. 16835-16849, 15 Sept.15, 2022
[41]     C Bai, F Sun, J Zhang*, Y Song, SY Chen, “Rainformer: Features Extraction Balanced Network for Radar-Based Precipitation Nowcasting”, IEEE Geoscience and Remote Sensing Letters, vol. 19, pp. 1-5, 2022, Art no. 4023305, doi: 10.1109/LGRS.2022.3162882.
[42]     C Huang, C Bai*, S Chan, J Zhang, “MMSTN: A Multi‐Modal Spatial‐Temporal Network for Tropical Cyclone Short‐Term Prediction”, Geophysical Research Letters, 49 (4), e2021GL096898, 2022
[43]     Z Jia, S Yang, J Zhang, Y Zhang, Z Yang, K Xue, and C Bai. 2022. PRSOT: Precipitation Retrieval from Satellite Observations Based on Transformer Atmosphere 13, no. 12: 2048
[44]     C Bai, A Zheng, Y Huang, X Pan, N Chen, “Boosting Convolutional Image Captioning with Semantic Content and Visual Relationship”, Displays, Vol. 70, 2021, 102069
[45]     L Huang, C Bai*, Y Lu, S Zhang, SY Chen, “Unsupervised adversarial image retrieval”, MULTIMEDIA SYSTEMS, 28(2),pp. 673-685, 2022
[46]     J Zhang, Y Yang, Z Jia, C Bai*, “Superresolution imaging with a deep multipath network for the reconstruction of satellite cloud images”. Earth and Space Science, 8, e2020EA001559.
[47]     Y Feng, J Zheng, M Qin, C Bai, and J Zhang. 2021. 3D Octave and 2D Vanilla Mixed Convolutional Neural Network for Hyperspectral Image Classification with Limited Samples Remote Sensing 13, no. 21: 4407.
[48]     C. Bai, H LI, J. Zhang*, L. Huang, L. Zhang, “Unsupervised Adversarial Instance-level Image Retrieval, IEEE Transaction On Multimedia, vol. 23, pp. 2199-2207, 2021
[49]     顾婷菲，郝鹏翼，白琮，柳宁，“结合多通道注意力的糖尿病性视网膜病变分级”，中国图象图形学报，2021, 26 (7)
[50]     曾超，白琮*，马青，“基于对抗投影学习的跨模态哈希检索”，计算机辅助设计与图形学学报, 2021, 32(6)
[51]     X. Wang, L. Wang, X. Zhong, C. Bai, X. Huang, et al., PaI-Net: A Modified U-Net of Reducing Semantic Gap for Surgical Instrument Segmentation, IET Image Processing, vol.15, pp.2959–2969,2021.
[52]     J. Zheng, Y. Feng, C. Bai*, and J. Zhang, Hyperspectral image classification using mixed convolutions and covariance pooling, IEEE Trans. Geoscience Remote Sensing, vol. 59, no. 1, pp. 522-534, Jan. 2021（ESI High Cited Paper）
[53]     陈健，白琮*，马青，郝鹏翼，陈胜勇，“面向细粒度草图检索的对抗训练三元组网络”, 软件学报, 2020, 31(7), 1933-1942
[54]     张敏靖，白琮*，张敬林，郑建炜，“基于对抗和迁移学习的灾害天气卫星云图分类”，北京航空航天大学学报,2021,47(3)
[55]     黄远，白琮*，李宏凯，张敬林，陈胜勇，“基于条件生成对抗的图像描述生成方法”, 计算机辅助设计与图形学学报, 2020, 32(6)
[56]     C Bai, J Chen, Q Ma*, P Hao, SY Chen, “Cross-domain representation learning by domain-migration generative adversarial network for sketch based image retrieval”，Journal of Visual Communication and Image Representation, vol.71, 2020, 102835
[57]     P Hao, X Gao, Z Li, J Zhang*, F Wu, C Bai, “Multi-branch Fusion Network for Myocardial Infarction Screening from 12-lead ECG Images”, Computer Methods and Programs in Biomedicine, Volume 184, 105286, 2020
[58]     J Ren, Z Liu, X Zhou, C Bai, G Sun, “Co-saliency detection via integration of multi-layer convolutional features and inter-image propagation”, Neurocomputing, (371): 137-146, 2020
[59]     P Hao, S Chokuwa, X Xie, F Wu, J Wu, C Bai*, Skeletal bone age assessments for young children based on regression convolutional neural networks, Mathematical Biosciences and Engineering, 16(6): 6454-6466, 2019
[60]     Q. Ma, C. Bai*, J. Zhang, Z. Liu, and S. Chen, Supervised Learning Based Discrete Hashing for Image Retrieval, Pattern Recognition, vol.92,2019, pp.156–164
[61]     J. Zheng, K. Lou, X. Yang*, C. Bai*, and J. Tang, Weighted mixed-norm regularized regression for robust face identification, IEEE Trans. Neural Netw. Learn. Syst., vol. 30, no. 12, pp. 3788-3802, Dec. 2019.
[62]     C Bai, L Huang, X Pan, J Zheng, S Chen, Optimization of deep convolutional neural network for large scale image retrieval, Neurocomputing 303, 60-67, 2018
[63]     白琮，黄玲，陈佳楠，潘翔，陈胜勇， “面向大规模图像分类的深度卷积神经网络优化”, 软件学报, 2018,29(4):1029-1038.
[64]     C Bai, J Chen, L Huang, K Kpalma, S Chen, Saliency-based multi-feature modeling for semantic image retrieval, Journal of visual communication and image representation, vol. 50, 199-204, 2018
[65]     J Ren, Z Liu, X Zhou, G Sun, C Bai, Saliency integration driven by similar images, Journal of Visual Communication and Image Representation, vol. 50, 227-236, 2018
[66]     L Ye, Z Liu, L Li, L Shen, C Bai, Y Wang, Salient object segmentation via effective integration of saliency and objectness, IEEE Transactions on Multimedia 19 (8), 1742-1756, 2017
[67]     J Zhang, C Bai, JF Nezan, JG Cousin, Joint motion model for local stereo video-matching method, Optical Engineering 54 (12), 123108, 2015
[68]     C. Bai, J. Zhang, Z. Liu, and W. L. Zhao, K-means based histogram using multiresolution feature vectors for color texture database retrieval, Multimedia Tools Appl., vol. 74, no. 4, pp. 1469-1488, 2015.
[69]     W. Zou, C. Bai, K. Kpalma, and J. Ronsin, Online glocal transfer for automatic figure-ground segmentation, IEEE Trans. Image Process., vol. 23, no. 5, pp. 2109-2121, May 2014.
[70]     C. Bai, W. Zou, K. Kpalma, and J. Ronsin, Efficient colour texture image retrieval by combination of colour and texture features in wavelet domain, Electron. Lett., vol. 48, no. 23, pp. 1463-1465, 2012.

会议论文:
[1]        Q. Chen, Z. Zhang, H. Liu, J. Zhang*, C. Bai, “KFTD: Koopman-Fourier Time-Differentiable Network for Continuous Ocean Spatiotemporal Forecasting,” Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 1, pp. 94-103, 2026
[2]        J. Shen, Q. Chen, R. Wang, C. Xu, J. Zhang*, C. Bai, F. Zhang, “KAN-FIF: Spline-Parameterized Lightweight Physics-based Tropical Cyclone Estimation on Meteorological Satellite,” Proceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 1, pp. 2424-2433, 2026
[3]        J. Chen, S. Lu and C. Bai*, SAICR: Symmetric Alignment and Intra-Class Contrastive Refinement for Referring Image Segmentation, IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, Spain, 2026, pp. 9677-9681
[4]        H. Yan, P. Mu, S. Zhang, Y. Zhu, J. Zhang, C. Bai*, “IDOL: Meeting Diverse Distribution Shifts with Prior Physics for Tropical Cyclone Multi-Task Estimation,” Advances in Neural Information Processing Systems (NeurIPS) 38, 2025
[5]        P. Ouyang, Q.Ma*, Z.Wang, C. Bai, “PMPGuard: Catching Pseudo-Matched Pairs in Remote Sensing Image–Text Retrieval,” Proceedings of the AAAI Conference on Artificial Intelligence, 40(10), 8197–8205, 2026.
[6]        P. Mu, Y. Zhu, S. Zhang, H. Yan, J. Zhang, C. Bai*, “iTCTSL: Interpretable Tropical Cyclone Track and Intensity Forecasting via Task Sensitive Learning,”Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Findings, 2026, pp. 1587-1596
[7]        C. Zheng, Q. Wang*, C. Bai, “Amodal-KAN: The First Look at Kolmogorov-Arnold Network for Amodal Instance Segmentation,” Proceedings of the 7th ACM International Conference on Multimedia in Asia (MMAsia '25). Association for Computing Machinery, New York, NY, USA, Article 113, 1–7
[8]        C. Huang, P. Mu, C. Bai*, and P. A. Watson, TCP-Diffusion: A multi-modal diffusion model for global tropical cyclone precipitation forecasting with change awareness, in Proc. Int. Conf. Mach. Learn. (ICML), vol. 267, pp. 25634–25653, 2025.
[9]        L. Luo, P. Mu*, and C. Bai, Physics-coupled frequency dynamic adaptation network for domain generalized underwater object detection, in Proc. ACM Int. Conf. Multimed. (MM), 2025, pp. 2284–2293.
[10]     Z. Wang, K. Ying, B. Xu, C. Wang, and C. Bai*, From swath to full-disc: Advancing precipitation retrieval with multimodal knowledge expansion, in Proc. ACM SIGKDD Conf. Knowl. Discov. Data Min. (KDD), Aug. 3–7, 2025.
[11]     Z. Lin, Z. Wang*, T. Qian, P. Mu, S. Chan, and C. Bai, NeighborRetr: Balancing hub centrality in cross-modal retrieval, in Proc. IEEE/CVF Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 9263–9273, 2025.
[12]     S. Zhang, P. Mu, C. Huang, J. Zhang, and C. Bai*, TC-Diffuser: Bi-condition multi-modal diffusion for tropical cyclone forecasting, in Proc. AAAI Conf. Artif. Intell., vol. 39, no. 1, pp. 1120–1128, Apr. 2025.
[13]     C. Bai, Z. Lin, J. Zhang, and S. Chen, Dust-Mamba: An efficient dust storm detection network with multiple data sources, in Proc. AAAI Conf. Artif. Intell., vol. 39, no. 27, pp. 27813–27821, Apr. 2025.
[14]     Z. Zhang, Q. Chen, M. Xiong, S. Ding, Z. Su, X. Yao, C. Bai,et al., Zero-shot learning in industrial scenarios: New large-scale benchmark, challenges and baseline, in Proc. AAAI Conf. Artif. Intell., vol. 39, no. 10, pp. 10357–10366, Apr. 2025.
[15]     Y. Zhang, L. Luo, P. Mu*, and C. Bai, Prompt-UIE: A unified prompt-driven framework for underwater image enhancement, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), pp. 1–5, 2025.
[16]     Z. Wang, H. Zhang, and C. Bai*, PiCNet: Physics-infused convolution network for radar-based precipitation nowcasting, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), pp. 1–5, 2025.
[17]     Z. Wang, K. Huang, Z. Lin, and C. Bai*, Event-driven hybrid and cross-stage guide for video corpus moment retrieval, in Proc. Int. Conf. Multimed. Retr. (ICMR), pp. 1367–1376, 2025.
[18]     S. Wang, H. Peng, F. Wu, and C. Bai, Live on the hump: Self knowledge distillation via virtual teacher-students mutual learning, in Proc. ACM Int. Conf. Multimed. (MM), pp. 3028–3036, 2024.
[19]     H. Yan, P. Mu*, C. Huang, J. Zhang, and C. Bai, Phy-CoCo: Physical constraint-based correlation learning for tropical cyclone intensity and size estimation, in Proc. Eur. Conf. Artif. Intell. (ECAI), 2024, Art. no. 744.
[20]     P. Ouyang, J. Chen, Q. Ma*, Z. Wang, and C. Bai, Distinguishing visually similar images: Triplet contrastive learning framework for image-text retrieval, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), Niagara Falls, ON, Canada, 2024, pp. 1–6.
[21]     J. Pan, Q. Ma, and C. Bai*, A prior instruction representation framework for remote sensing image-text retrieval, in Proc. ACM Int. Conf. Multimed. (MM), Ottawa, ON, Canada, Oct. 29–Nov. 3, 2023, pp. 1–10.
[22]     P. Mu, Z. Du, J. Liu*, and C. Bai, Little strokes fell great oaks: Boosting the hierarchical features for multi-exposure image fusion, in Proc. ACM Int. Conf. Multimed. (MM), Ottawa, ON, Canada, Oct. 29–Nov. 3, 2023, pp. 1–10.
[23]     P. Mu, H. Xu, Z. Liu, Z. Wang, S. Chan, and C. Bai*, A generalized physical-knowledge-guided dynamic model for underwater image enhancement, in Proc. ACM Int. Conf. Multimed. (MM), Ottawa, ON, Canada, Oct. 29–Nov. 3, 2023, pp. 1–10.
[24]     J. Pan, Q. Ma*, and C. Bai, Reducing semantic confusion: Scene-aware aggregation network for remote sensing cross-modal retrieval, in Proc. Int. Conf. Multimed. Retr. (ICMR), Thessaloniki, Greece, Jun. 12–15, 2023, pp. 1–10.
[25]     H. Cheng, C. Bai*, S. Chan, J. Zhang, and Y. Wu, MGTCF: Multi-generator tropical cyclone forecasting with heterogeneous meteorological data, in Proc. AAAI Conf. Artif. Intell., vol. 37, no. 4, pp. 5096–5104, 2023.
[26]     X. Wang, M. Shao, D. Guo, Y. Cui, X. Huang, M. Xia, C. Bai,Multi-stage aggregation transformer for medical image segmentation, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), 2023, pp. 1–5.
[27]     S. Chan, W. Wang, Z. Shao, C. Bai, SGPT: The secondary path guides the primary path in transformers for HOI detection, in Proc. IEEE Int. Conf. Robot. Autom. (ICRA), 2023, pp. 1–7.
[28]     D. Cai, P. Mu*, S. Chan, Z. Shao and C. Bai, Towards General and Fast Video Derain via Knowledge Distillation, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), Brisbane, Australia, 2023, pp. 1949–1954.
[29]     P. Mu, J. Fang, H. Qian, and C. Bai*, Transmission and color-guided network for underwater image enhancement, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), Brisbane, Australia, 2023, pp. 1337–1342.
[30]     Z. Liu, P. Mu*, H. Xu and C. Bai, Histogram-guided Video Colorization Structure with Spatial-Temporal Connection, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), Brisbane, Australia, 2023, pp. 2579–2584.
[31]     S. Chan, J. Cui, Y. Wu, H. Wang* and C. Bai, Visible-Xray Cross-Modality Package Re-Identification, 2023 IEEE International Conference on Multimedia and Expo (ICME), Brisbane, Australia, 2023, pp. 2579-2584
[32]     P Mu, H Qian, and C Bai*. Structure-Inferred Bi-level Model for Underwater Image Enhancement. in Proc. ACM Int. Conf. Multimed. (MM), Lisboa, Portugal, 2022, pp. 2286–2295.
[33]     J. Chen, L. Zhang, Q. Wang, C. Bai and K. Kpalma, Intra-Modal Constraint Loss for Image-Text Retrieval, in Proc. IEEE Int. Conf. Image Process. (ICIP), 2022, pp. 4023–4027.
[34]     F. Sun, C. Bai*, Y. Song and J. Zhang, MMINR: Multi-frame-to-Multi-frame Inference with Noise Resistance for Precipitation Nowcasting with Radar, in Proc. Int. Conf. Pattern Recognit. (ICPR), 2022, pp. 97–103.
[35]     Z Zha, P Zhou, C Bai*, “Exploring Implicit and Explicit Relations with the Dual Relation-Aware Network for Image Captioning”. in Proc. Int. Conf. Multimed. Model., 2022, pp. 97–108.
[36]     T Han, P Hao, C Bai, “Structural and Temporal Learning for Dropout Prediction in MOOCs”, in Proc. Int. Conf. Knowl. Sci., Eng. Manage. (KSEM), 2022, pp. 1–15.
[37]     J. Dang, Y. Liu, W. Shuai, C. Bai and S. Tian, Edge-preserving Image Smoothing via Counting-weighted Total Variation, in Proc. IEEE Int. Workshop Multimed. Signal Process. (MMSP), 2022, pp. 1–6.
[38]     K. Ying, Z. Wang, C. Bai and P. Zhou, ISDA: Position-Aware Instance Segmentation with Deformable Attention, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), 2022, pp. 2619–2623
[39]     D Wu, P Hao, Y Zheng, T Han, C Bai, “Classmates Enhanced Diversity-Self-Attention Network for Dropout Prediction in MOOCs”, in Proc. Int. Conf. Neural Inf. Process. (ICONIP), 2021, pp. 609–620.
[40]     B Ye, S Mao, P Hao, W Chen, C Bai. Community Enhanced Course Concept Recommendation in MOOCs with Multiple Entities. in Proc. Int. Conf. Knowl. Sci., Eng. Manage. (KSEM), 2021, pp. 1–12. 3
[41]     P Hao, X Xie, T Han, and C Bai. Overlap classification mechanism for skeletal bone age assessment. in Proc. ACM Int. Conf. Multimed. Asia (MMAsia), 2021, pp. 1–7.
[42]     C Huang, S Chan, C Bai*, W Ding, J Zhang*. Tropical Cyclones Tracking Based on Satellite Cloud Images: Database and Comprehensive Study. in Proc. Int. Conf. Multimed. Model., 2021, pp. 1–12.
[43]     P Zhou, C Bai*, K Ying, J Xia and Lixin Huang, “ RWMF: A Real-World Multimodal Foodlog Database”, in Proc. Int. Conf. Pattern Recognit. (ICPR), 2021, pp. 962–968.
[44]     J Chen, L Zhang, C Bai* and K Kpalma, Review of Recent Deep Learning Based Methods for Image-Text Retrieval, in Proc. IEEE Conf. Multimed. Inf. Process. Retr. (MIPR), 2020, pp. 167–172.
[45]     C Bai, C Zeng, Q Ma, J Zhang, and SY Chen, “Deep Adversarial Discrete Hashing for Cross-Modal Retrieval”, in Proc. Int. Conf. Multimed. Retr. (ICMR), 2020, pp. 525–531.
[46]     H Li, C Bai*, L Huang, Y Jiang, S Chen, “Instance Image Retrieval with Generative Adversarial Training”, in Proc. Int. Conf. Multimed. Model., 2020, pp. 381–392.
[47]     P Hao, E Manhando, T Ye, C Bai*, “Video Summarization based on Sparse Subspace Clustering with Automatically Estimated Number of Clusters”, in Proc. ACM Multimed. Asia, 2019, pp. 1–6.
[48]     Y Fu, C Wang, Y Fu, YX Wang, C Bai, X Xue, YG Jiang, “Embodied One-Shot Video Recognition: Learning from Actions of a Virtual Embodied Agent”, in Proc. ACM Int. Conf. Multimed. (MM), 2019, pp. 411–419.
[49]     W Zhang, Q Zhang, J Cheng, C Bai, P Hao, “End-to-End Panoptic Segmentation with Pixel-Level Non-Overlapping Embedding”, in Proc. IEEE Int. Conf. Multimed. Expo (ICME), 2019, pp. 976–981.
[50]    P Zhou, C Bai*, Jie Xia, ZJUTCVR Team at ImageCLEFlifelog2019 Lifelog Moment Retrieval Task, in Proc. Conf. Labs Evaluation Forum (CLEF), 2019, pp. 1–8.
[51]    L Huang, C Bai*, Y Lu, S Chen, Q Tian. Adversarial Learning for Content-Based Image Retrieval, in Proc. IEEE Conf. Multimed. Inf. Process. Retr. (MIPR), 2019, pp. 1–6.
[52]    C Bai, J Chen, Q Ma, Z Liu, S Chen. Deep Residual Net Based Compact Feature Representation for Image Retrieval, in Proc. Pacific Rim Conf. Multimed. (PCM), 2018, pp. 737–747.
[53]    P Hao, Y Chen, S Chokuwa, F Wu, C Bai*. Skeletal Bone Age Assessment Based on Deep Convolutional Neural Networks, in Proc. Pacific Rim Conf. Multimed. (PCM), 2018, pp. 408–417.
[54]    J Chen, C Bai*, L Huang, Z Liu, S Chen, Visual Saliency Fusion Based Multi-feature for Semantic Image Retrieval, in Proc. CCF Chin. Conf. Comput. Vis., 2017, pp. 126–136.
[55]    P Hao, E Manhando, C Bai*, Y Huang, Hierarchical Tree Representation Based Face Clustering for Video Retrieval, in Proc. Pacific Rim Conf. Multimed. (PCM), 2017, pp. 347–357.
[56]    JW Zheng, P Yang, S Fang, C Bai*, Block Cluster Based Dictionary Learning for Image De-noising and De-blurring, in Proc. Pacific Rim Conf. Multimed. (PCM), 2017, pp. 818–826.
[57]    C Bai, J Chen, J Zhang, K Kpalma, J Ronsin. Sparse representation based histogram in color texture retrieval, in Proc. Pacific Rim Conf. Multimed. (PCM), 2016, pp. 55–64.
[58]    H Song, Z Liu, H Du, G Sun, C Bai, Saliency detection for RGBD images, in Proc. Int. Conf. Internet Multimed. Comput. Service, 2015, pp. 72–77.
[59]    W Lu, C Bai, K Kpalma, J Ronsin. Multi-object tracking using sparse representation, in Proc. IEEE Int. Conf. Acoust., Speech Signal Process. (ICASSP), 2013, pp. 2312–2316.
[60]    K Kpalma, C Bai, MMezouar, KBelloulata, N Taleb, LBelhallouche, D Boukerroui. A new histogram-based descriptor for images retrieval from databases, in Advances Intell. Anal. Med. Data Decision Support Syst., 2013, pp. 101–112.
[61]    C Bai, K Kpalma, J Ronsin. Color textured image retrieval by combining texture and color features, in Proc. Eur. Signal Process. Conf. (EUSIPCO), 2012, pp. 170–174.
[62]    C Bai, K Kpalma, J Ronsin. Analysis of histogram descriptor for image retrieval in DCT domain, in Intell. Interactive Multimed. Syst. Services, 2011, pp. 227–235.

授权发明专利：
1.   201910467500.9: 一种基于条件生成对抗网络的图像描述生成方法(第一发明人)
2.   201810154813.4: 一种无监督对抗训练的基于内容的图像检索方法(第一发明人)
3.   201910467500.9: 一种基于条件生成对抗网络的图像描述生成方法(第一发明人)
4.   201811538815.X: 一种基于深度哈希学习优化的图像检索方法（第二发明人）
5.   201710411668.9：一种面向图像分类的深度卷积神经网络优化方法（第一发明人）
6.       201710411660.2:一种基于深度学习的快速图像分类方法(第一发明人)
7.       200810200309.X: 在彩色视频图像中提取车辆的方法 (第一发明人)
200810200310.2: 复杂场景中单个人脸的实时跟踪方法 (第二发明人)

获奖与荣誉

综合获奖情况：
2026.05    浙江工业大学校级优秀博士学位论文（指导教师）
2026.05    浙江工业大学校级优秀硕士学位论文（指导教师）
2026.01    浙江工业大学计算机学院十佳教师
2024.01    浙江工业大学计算机学院优秀教师
2023.01    浙江工业大学计算机学院优秀教师
2022.06    浙江工业大学校级优秀硕士学位论文指导教师
2021.01    浙江工业大学研究生“我心目中的好导师”
2020.10    浙江工业大学毕业设计优秀指导教师
2020.01    浙江工业大学计算机学院优秀教师
2020.01    浙江工业大学本科教学优课优酬奖
2019.06    浙江工业大学校级优秀硕士学位论文指导教师
2019.01    浙江工业大学计算机学院十佳党员
2019.01    浙江工业大学本科教学优课优酬奖
2018.10 浙江工业大学优秀班主任
2018.01    浙江工业大学本科教学优课优酬奖
2017.10    浙江工业大学毕业设计优秀指导教师
2017.01    浙江工业大学计算机学院十佳党员
2016.01    浙江工业大学计算机学院优秀教师
2015.06    浙江工业大学计算机学院青年教师讲课比赛十佳
2009.09 国家留学基金委国家公派留学生奖学金 （UT-INSA项目）
2008.12     上海市优秀学生
2005.02     山东农业大学优秀班主任
2007.05    上海大学社区优秀学生
2007.07    上海大学优秀研究生党务工作者
2007.12    上海大学优秀学生

学术获奖：
2024.10 中国计算机学会技术发明二等奖
2023.12      中国图象图形学学会科技进步二等奖
2023.08 中国多媒体大会最佳论文奖
2020.09 中国多媒体大会最佳墙报奖
2019.10 IFTC 最佳论文奖
2019.08 中国多媒体大会优秀论文奖
2018.09 Pacific Rim Conference on Multimedia （CCF C类会议）最佳审稿人奖
2017.08      中国多媒体大会最佳Poster奖

教师个人主页

导航

白琮

个人简介

育人成果

科研项目

科研成果

获奖与荣誉

个人简介

育人成果

科研项目

科研成果

获奖与荣誉

链接