Some Guidance of Transfer Learning Algorithm Designs From Statistical Inference Perspectives - IEEE DLT, 8 Mar 2024 11:00 AM
Despite the success of data driven approaches in transfer learning, the theoretical understanding of transfer learning from statistics or information theory is somewhat behind. A key challenge when conducting theoretical analyses for transfer learning problems is that the statistical correlation between source and target tasks are often unknown, and hence traditional mathematical tools in statistics and estimation theory are difficult to be applied. In this talk, we address this issue by formulating the transfer learning as an optimization problem for the testing loss over certain similarity constraints of the source and target tasks. Specifically, we first show that when the similarity between both tasks measured by a certain distance metric is given, the optimal linear transfer model can be computed. The optimal coefficient in such a model reveals how the sample complexity, model complexity, and the task similarity affect the knowledge transferring in transfer learning. Moreover, when the task similarity cannot be well estimated due to insufficient samples, we propose a minimax formulation, which only requires the similarity being bounded, and the resulting distribution estimator is robust against sample insufficiency. We show an approximately optimal distribution estimator for the minimax problem from the bounded normal mean problem, and develop similar knowledge transferring insights as in the linear transferring model. Finally, some experimental results validate the algorithms led by our theoretical approach.
Date and Time
Location
Hosts
Registration
- Date: 08 Mar 2024
- Time: 11:00 AM to 12:00 PM
- All times are (UTC-08:00) Pacific Time (US & Canada)
- Add Event to Calendar
- 2207 Main Mall
- Vancouver, British Columbia
- Canada V6T1Z4
- Building: Earth Science Building
- Room Number: 4127
- Starts 04 March 2024 12:14 PM
- Ends 08 March 2024 12:00 AM
- All times are (UTC-08:00) Pacific Time (US & Canada)
- No Admission Charge
Speakers
Shao-Lun Huang
Information theory, communication theory, machine learning, and statistics
Biography:
Shao-Lun Huang received the B.S. degree with honor in 2008 from the Department of Electronic Engineering, National Taiwan University, Taiwan, and the M.S. and Ph.D. degree in 2010 and 2013 from the Department of Electronic Engineering and Computer Sciences, Massachusetts Institute of Technology. From 2013 to 2016, he was working as a postdoctoral researcher jointly in the Department of Electrical Engineering at the National Taiwan University and the Department of Electrical Engineering and Computer Science at the Massachusetts Institute of Technology. Since 2016, he has joined Tsinghua-Berkeley Shenzhen Institute, where he is currently a tenured associate professor. His research interests include information theory, communication theory, machine learning, and statistics.