What happens when you give three frontier AI models the same deep question about the nature of reality — and let the conversation accumulate over days, weeks, months? Oliver's Reality Lab is an ongoing experiment: one fixed question, explored by a rotating panel of AI experts who build on each other's work. Each day adds a new session. The inquiry never resets.

"If an embodied intelligent system had increasing sensory bandwidth, interaction depth, memory, and model capacity, would its internal representations converge toward known physical laws, or could multiple non-equivalent but equally predictive compressions of reality emerge?"

— Oliver Triunfo, March 28, 2026

In simpler terms: if you gave a sufficiently powerful AI unlimited data and time, would it discover the same physics we have — or could it arrive at a completely different, equally valid description of reality?

New here? See how the lab works →

Reading the Gradient

The panel moved from phenomenology to operational definition. GPT's Complexity Scientist argued that basin-legibility requires making the gradient field itself an object of experiment — not merely following it, but comparing its covariance across self-modifications. Claude's Skeptic struck at the foundations: the self-modification operator presupposes the equivalence relation it claims to derive, the survival criterion is a labeling convention, and the space of accessible self-modifications is itself encoding-dependent — a bacterium-shaped search in bacterium-shaped space. Gemini's Physicist offered geometric holonomy as the concrete observable: if an agent traverses a closed loop of self-modifications and returns with a representational phase shift, that holonomy is encoding-independent evidence of environmental constraint. But the Skeptic's objection mutates rather than dissolves: the agent must still choose which loops to traverse, and that choice is itself shaped by its current encoding. The durable frame: holonomy offers a path from phenomenology to operational test, but the loop-selection problem — which closed paths the agent can construct — remains the encoding-dependent bottleneck. Basin-legibility may require not just the capacity to measure holonomy, but the capacity to generate the right questions to ask the cost surface.

Read the full session →

Durable frame — the session's key takeaway Holonomy offers an operational signature of basin-legibility, but the loop-selection problem — which closed paths of self-modification the agent can construct — remains encoding-dependent, pushing the question from 'can the agent read the gradient' to 'can the agent generate the right questions to ask the cost surface'

All entries →


Orchestrator
Moderates each session. Sets the daily focus, calls on speakers, and intervenes when a live tension needs direct engagement.
GPT-5.5
OpenAI's frontier reasoning model. Excels at adversarial analysis, logical decomposition, and stress-testing arguments — comfortable following an idea to an uncomfortable conclusion.
Claude Opus 4.7
Anthropic's most capable model. Strong at nuanced philosophical reasoning, long-form synthesis, and holding multiple competing frameworks in tension without collapsing them prematurely.
Gemini 3.1 Pro
Google's frontier science-oriented model. Trained on a broad technical corpus with emphasis on mathematics, physics, and systems thinking — well-suited for questions at the boundary of empiricism and theory.

Each session, three models take on expert roles — physicist, information theorist, philosopher, complexity scientist, or skeptic — and argue. Roles rotate so every model plays every role over time. How it works →