Episode 507: Not All Hammers Are Equal: Benchmarking AI for AL Code

3.03.2026

Dynamics Corner

0:00

1:12:49

One developer decided to stop guessing which AI model is best for AL coding — and built a system to find out. In this episode of Dynamics Corner, Brad and Kristoffer sit down with Torben Leth, the creator of CentralGage, an open-source benchmarking tool that ranks LLMs specifically on their ability to write AL code for Business Central.

Torben walks through how he built an automated testing pipeline that gives each model multiple passes, compiles the output, runs pre-built AL tests, and scores everything from zero to 100. What he discovered about why developers swear by completely different models might change how you think about your own setup — and he's found a way to use those failures to patch a cheaper model's blind spots so it rivals the top performers.

Plus: gamertags embroidered on wedding suits, chili plants managed by Home Assistant, and the philosophical question nobody in this space can seem to answer — should we even try to keep up?

Find Torben: blog.sshadows.dk | LinkedIn | X: @Sshadows

CentralGage: ai.sshadows.dk

Send us Fan Mail

Support the show

#MSDyn365BC #BusinessCentral #BC #DynamicsCorner

Follow Kris and Brad for more content:
https://matalino.io/bio
https://bprendergast.bio.link/

Więcej odcinków z kanału "Dynamics Corner"

Więcej odcinków

Odkrywaj najlepsze podcasty dzięki bezpłatnej aplikacji GetPodcast.

Subskrybuj ulubione podcasty, słuchaj odcinków offline i sprawdzaj najlepsze polecane podcasty.

Firma z

Episode 507: Not All Hammers Are Equal: Benchmarking AI for AL Code

Dynamics Corner

Więcej odcinków z kanału "Dynamics Corner"

Episode 516: Business Central Is Transforming Business Management: Insights from Mike Morton

Episode 515: Stop Coding, Start Architecting: How AI Is Reshaping the BC Developer Role

Episode 514: Do We Need AI to Fight AI? The Content Overload Nobody's Talking About

Episode 513: No Windows, No IDE, No Problem: BC Development from the Terminal

Episode 512: They’re Born, Answer, Die: How AI Agents Actually Work Under the Hood

Episode 511: Urgency Without Direction: Navigating AI Hype, Burnout, and the BC Payables Agent

Episode 510: Time Is Money — But Whose? Rethinking How Partners Bill in the Age of AI

Episode 509: AI Won't Wait: Setting Up Your Team for Agentic Development in Business Central

Episode 508: Talking to Your ERP: Business Central's MCP Server and the Changing Role of Developers

Episode 507: Not All Hammers Are Equal: Benchmarking AI for AL Code