CS SEMINAR

Foundation Vision Models and Applications

Speaker

Jiwen Lu, Associate Professor, Department of Automation, Tsinghua University, China, IEEE/IAPR Fellow

Chaired by

Dr HUANG Zhiyong, Associate Professor, School of Computing

huangzy@comp.nus.edu.sg

20 May 2024 Monday, 04:30 PM to 05:30 PM

Abstract:
Foundation models are emerging research topics in artificial intelligence nowadays, which have advanced the state-of-the-arts in various computer vision applications such as visual surveillance, autonomous driving, and intelligent devices. This talk will review the recent research progress of foundation vision models from the perspectives of model architecture and learning paradigm, and introduce several new methods proposed by the Intelligent Vision Group at the Department of Automation, Tsinghua University, including dynamic sparse models, global filtering models, spherical fractal models, and geometry-aware generation models, and their applications in various vision tasks such as object detection and segmentation, image and video retrieval, and 3D reconstruction and recognition.

Bio:
Jiwen Lu is currently a Deputy Chair and Associate Professor with the Department of Automation, Tsinghua University. His current research interests include computer vision and pattern recognition, where he has published 140+ IEEE Transactions papers (including 39 IEEE T-PAMI papers) and 130+ CVPR/ICCV/ECCV/NeurIPS papers. He was/is a member of the Image, Video and Multidimensional Signal Processing Technical Committee, Multimedia Signal Processing Technical Committee, and the Information Forensics and Security Technical Committee of the IEEE Signal Processing Society, and a member of the Multimedia Systems and Applications Technical Committee and the Visual Signal Processing and Communications Technical Committee of the IEEE Circuits and Systems Society, respectively. He received the National Outstanding Youth Foundation of China Award. He serves as the General Co-Chair for ICME’2022, and the Program Co-Chair for ACCV’2026, FG’2023, VCIP’2022, AVSS’2021, and ICME’2020. He also serves as the Co-Editor-of-Chief for Pattern Recognition Letters, an Associate Editor for the IEEE Transactions on Image Processing, the IEEE Transactions on Circuits and Systems for Video Technology, and the IEEE Transactions on Biometrics, Behavior, and Identity Sciences, and Pattern Recognition. He is an IEEE/IAPR Fellow.

Foundation Vision Models and Applications

COM3 Level 2