Number Cookbook

2024-11-08

LlamaCast

0:00

16:11

📓 Number Cookbook: Number Understanding of Language Models and How to Improve It

This research paper examines the numerical understanding and processing abilities (NUPA) of large language models (LLMs). The authors create a benchmark to test LLMs on four numerical representations (integers, floating-point numbers, fractions, and scientific notation) across 17 tasks grouped into four ability categories. They find that, despite strong problem-solving capabilities, LLMs struggle with basic numerical operations. The paper evaluates methods to enhance NUPA during pretraining and finetuning, such as specialized tokenizers, positional encodings, and data formats, and notes the limitations of chain-of-thought techniques for numerical tasks. The authors call for further research to improve LLMs' fundamental numerical capabilities.

📎 Link to paper

Fler avsnitt från "LlamaCast"

Fler avsnitt

Kom åt alla poddsändningar i gratisappen GetPodcast.

Prenumerera på dina favoritpoddar, lyssna på avsnitt offline och få spännande rekommendationer.

Number Cookbook

LlamaCast

Fler avsnitt från "LlamaCast"

Marco-o1

Scaling Laws for Precision

Test-Time Training

Qwen2.5-Coder

Attacking Vision-Language Computer Agents via Pop-ups

Number Cookbook

Jigsaw Puzzles

Multi-expert Prompting with LLMs

Investigating the Role of Prompting and External Tools in Hallucination Rates of LLMs

Mind Your Step (by Step)