Recursive Language Models (Blog Summary)

Recursive Language Models — Blog Post Summary

Source: Alex L. Zhang's personal blog (October 2025)

URL: https://alexzhang13.github.io/blog/2025/rlm/

Author: Alex L. Zhang

Overview

This blog post introduces Recursive Language Models (RLMs) as an inference strategy where language models recursively call themselves or other LLMs before producing a final answer. The goal is to enable processing of essentially unbounded input context and mitigate "context rot" — the degradation in model quality as conversations or contexts grow long [Source: sources/rlm-blog-alexzhang.md].

Context Rot: The Motivation

The author describes "context rot" as a well-known but hard-to-characterize phenomenon: as Claude Code histories bloat or ChatGPT conversations lengthen, the model seems to get "dumber." Needle-in-haystack benchmarks like RULER don't capture this — frontier models score 90%+ there. The rot is more subtle: semantic drift, compounding errors, and degraded reasoning over long interactions. RLMs address this by ensuring no single model call ever sees the entire huge context [Source: sources/rlm-blog-alexzhang.md].

Receives only the query + knowledge that a context variable exists
Writes code to inspect the context (peek, grep, partition)
Can call a recursive LM inside the REPL to process sub-contexts
Returns a final answer via FINAL() or FINAL_VAR() [Source: sources/rlm-blog-alexzhang.md]

Benchmark Highlights

OOLONG (trec_coarse)

Method	132k Score	263k Score	Cost
GPT-5	Baseline	Baseline	Baseline
GPT-5-mini	Lower	Lower	Lower
RLM(GPT-5-mini)	+34 pts (~114%)	+15 pts (~49%)	~Same
RLM w/o recursion	-10%	-	-

BrowseComp-Plus (up to 1,000 docs / 10M+ tokens)

RLM(GPT-5): Perfect performance at 1,000 documents
RLM w/o recursion: ~90%
Base GPT-5: Significant dropoff as docs increase
ReAct + BM25: Outperformed by RLM [Source: sources/rlm-blog-alexzhang.md]

Philosophy: Agents vs RLMs

"Agents are designed based on human / expert intuition on how to break down a problem to be digestible for an LM. RLMs are designed based on the principle that fundamentally, LMs should decide how to break down a problem to be digestible for an LM."

This context-centric vs problem-centric framing is the core conceptual distinction [Source: sources/rlm-blog-alexzhang.md].

[[recursive-language-models]] — Main concept page
[[rlm-paper]] — arXiv paper summary
[[alex-l-zhang]] — Author entity

Recursive Language Models (Blog Summary)

Recursive Language Models — Blog Post Summary

Overview

Context Rot: The Motivation

The RLM Design

API-Level Replacement

Context-Centric Decomposition

REPL Environment

Benchmark Highlights

OOLONG (trec_coarse)

BrowseComp-Plus (up to 1,000 docs / 10M+ tokens)

Philosophy: Agents vs RLMs