Last Week in AI podcast

#206 - Llama 4, Nova Act, xAI buys X, PaperBench

0:00
1:13:44
Retroceder 15 segundos
Avanzar 15 segundos

Our 206th episode with a summary and discussion of last week's big AI news! Recorded on 04/07/2025

Try out the Astrocade demo here! https://www.astrocade.com/

Hosted by Andrey Kurenkov and Jeremie Harris. Feel free to email us your questions and feedback at [email protected] and/or [email protected]

Read out our text newsletter and comment on the podcast at https://lastweekin.ai/.

Join our Discord here! https://discord.gg/nTyezGSKwP

In this episode:

  • Meta releases LlAMA-4, a series of advanced large language models, sparking debate on performance and release timing, with models featuring up to 2 trillion parameters for different configurations and applications.
  • Amazon's AGI Lab debuts NOVA Act, an AI agent for web browser control, boasting competitive benchmarking against OpenAI's and Anthropic's best agents.
  • OpenAI's image generation capabilities and ongoing financing developments, notably a $40 billion funding round led by SoftBank, highlight significant advancements and strategic shifts in the tech giant’s operations.

Timestamps + Links:

  • (00:00:00) Intro / Banter

Tools & Apps

Applications & Business

Research & Advancements

Policy & Safety

  • (00:58:28) Taking a responsible path to AGI
  • (01:02:32) This A.I. Forecast Predicts Storms Ahead
  • (01:06:24) The Secrets and Misdirection Behind Sam Altman’s Firing From OpenAI
  • OpenAI's new image generation capabilities represent significant advancements in AI tools, showcasing impressive benchmarks and multimodal functionalities.
  • OpenAI is finalizing a historic $40 billion funding round led by SoftBank, and Sam Altman shifts focus to technical direction while COO Brad Lightcap takes on more operational responsibilities.,
  • Anthropic unveils groundbreaking interpretability research, introducing cross-layer tracers and showcasing deep insights into model reasoning through applications on Claude 3.5.
  • New challenging benchmarks such as ARC AGI 2 and complex Sudoku variations aim to push the boundaries of reasoning and problem-solving capabilities in AI models.

Otros episodios de "Last Week in AI"