Latent Space: The AI Engineer Podcast podcast

Information Theory for Language Models: Jack Morris

02/07/2025

Latent Space: The AI Engineer Podcast

0:00

1:18:13

Our last AI PhD grad student feature was Shunyu Yao, who happened to focus on Language Agents for his thesis and immediately went to work on them for OpenAI. Our pick this year is Jack Morris, who bucks the “hot” trends by -not- working on agents, benchmarks, or VS Code forks, but is rather known for his work on the information theoretic understanding of LLMs, starting from embedding models and latent space representations (always close to our heart). Jack is an unusual combination of doing underrated research but somehow still being to explain them well to a mass audience, so we felt this was a good opportunity to do a different kind of episode going through the greatest hits of a high profile AI PhD, and relate them to questions from AI Engineering. Papers and References made AI grad school: https://x.com/jxmnop/status/1933884519557353716A new type of information theory: https://x.com/jxmnop/status/1904238408899101014EmbeddingsText Embeddings Reveal (Almost) As Much As Text: https://arxiv.org/abs/2310.06816Contextual document embeddings https://arxiv.org/abs/2410.02525Harnessing the Universal Geometry of Embeddings: https://arxiv.org/abs/2505.12540Language modelsGPT-style language models memorize 3.6 bits per param: https://x.com/jxmnop/status/1929903028372459909Approximating Language Model Training Data from Weights: https://arxiv.org/abs/2506.15553https://x.com/jxmnop/status/1936044666371146076LLM Inversion"There Are No New Ideas In AI.... Only New Datasets"https://x.com/jxmnop/status/1910087098570338756https://blog.jxmo.io/p/there-are-no-new-ideas-in-ai-onlymisc reference: https://junyanz.github.io/CycleGAN/ — for others hiring AI PhDs, Jack also wanted to shout out his coauthor Zach Nussbaum, his coauthor on Nomic Embed: Training a Reproducible Long Context Text Embedder.

Mais episódios de "Latent Space: The AI Engineer Podcast"

Mais episódios

Descobre o mundo dos podcasts com a app gratuita GetPodcast.

Subscreve os teus podcasts preferidos, ouve episódios offline e obtém recomendações fantásticas.

Information Theory for Language Models: Jack Morris

Latent Space: The AI Engineer Podcast

Mais episódios de "Latent Space: The AI Engineer Podcast"

🕰️ The Oral History of Windsurf (ft. Varun Mohan, Scott Wu, Jeff Wang, Kevin Hou, Anshul R)

AI is Eating Search

The Future of Notebooks - with Akshay Agrawal of Marimo

Cline: the open source coding agent that doesn't cut costs

Personalized AI Language Education — with Andrew Hsu, Speak

AI Video Is Eating The World — Olivia and Justine Moore, a16z

Information Theory for Language Models: Jack Morris

Scaling Test Time Compute to Multi-Agent Civilizations — Noam Brown, OpenAI

The Shape of Compute (Chris Lattner of Modular)

The Utility of Interpretability — Emmanuel Amiesen