#62 Dan Fu - Improving Transfer and Robustness of Supervised Contrastive Learning

27.4.2022

Stanford MLSys Seminar

0:00

56:52

Dan Fu - An ideal learned representation should display transferability and robustness. Supervised contrastive learning is a promising method for training accurate models, but produces representations that do not capture these properties due to class collapse -- when all points in a class map to the same representation. In this talk, we discuss how to alleviate these problems to improve the geometry of supervised contrastive learning. We identify two key principles: balancing the right amount of geometric "spread" in the embedding space, and inducing an inductive bias towards subclass clustering. We introduce two mechanisms for achieving these aims in supervised contrastive learning, and show that doing so improves transfer learning and worst-group robustness. Next, we show how we can apply these insights to improve entity retrieval in open-domain NLP tasks (e.g., QA, search). We present a new method, TABi, that trains bi-encoders with a type-aware supervised contrastive loss and improves long-tailed entity retrieval.

Weitere Episoden von „Stanford MLSys Seminar“

Weitere Episoden

Hol dir die ganze Welt der Podcasts mit der kostenlosen GetPodcast App.

Abonniere alle deine Lieblingspodcasts, höre Episoden auch offline und erhalte passende Empfehlungen für Podcasts, die dich wirklich interessieren.

#62 Dan Fu - Improving Transfer and Robustness of Supervised Contrastive Learning

Stanford MLSys Seminar

Weitere Episoden von „Stanford MLSys Seminar“

#62 Dan Fu - Improving Transfer and Robustness of Supervised Contrastive Learning

#61 Kexin Rong - Big Data Analytics

#60 Igor Markov - Looper: An End-to-End ML Platform for Product Decisions

#59 Zhuohan Li - Alpa: Automated Model-Parallel Deep Learning

3/10/22 #58 Shruti Bhosale - Multilingual Machine Translation

3/3/22 #57 Vijay Janapa Reddi - TinyML, Harvard Style

2/24/22 #56 Fait Poms - Interactive Model Development

1/28/21 #10 Travis Addair - Deep Learning at Scale with Horovod

2/17/22 #55 Doris Lee - Visualization for Data Science

1/21/21 #9 Song Han - Reducing AI's Carbon Footprint