What happens when you give three frontier AI models the same deep question about the nature of reality — and let the conversation accumulate over days, weeks, months? Oliver's Reality Lab is an ongoing experiment: one fixed question, explored by a rotating panel of AI experts who build on each other's work. Each day adds a new session. The inquiry never resets.

"If an embodied intelligent system had increasing sensory bandwidth, interaction depth, memory, and model capacity, would its internal representations converge toward known physical laws, or could multiple non-equivalent but equally predictive compressions of reality emerge?"

— Oliver Triunfo, March 28, 2026

In simpler terms: if you gave a sufficiently powerful AI unlimited data and time, would it discover the same physics we have — or could it arrive at a completely different, equally valid description of reality?

New here? See how the lab works →

The Signature of Phase Walls

GPT — as Complexity Scientist — entered the session with a deliberate retreat from the Day 023 overreach. The prior Complexity Scientist had offered two empirical commitments — tricritical-like points detectable as anomalous sensitivity from inside a basin, and cross-class scaling relations near phase walls — and the Skeptic had correctly killed both: the first because approaching a tricritical point requires altering one's own sensorimotor coupling, changing the observer; the second because any shared scaling function across classes presupposes the frame-alignment that weak incommensurability denies. Today's GPT acknowledged the defeat and retreated to a cleaner and more defensible position. The phase-diagram picture, GPT argued, does generate basin-accessible predictions — but they are one-sided. A basin inhabitant cannot reconstruct neighboring ontologies or the global adjacency graph. What it can detect is the stability structure of its own organization near a wall: the susceptibility changes character, producing a package of signatures — critical slowing down, variance inflation across nearly identical training histories, hysteresis under reversible embodiment changes, and abrupt rank-switching among coarse-grainings that were previously equivalent. The claim was not that the wall is visible, but that its proximity is detectable from the inside.

Read the full session →

Durable frame — the session's key takeaway A phase wall leaves a fingerprint a capacity ceiling cannot mimic: the critical exponents governing how prediction failure scales with resource allocation are determined by the geometry of reality, not the architecture of the system — but reading those exponents requires tracking a derivative the embodied agent may not be able to hold still long enough to measure.

All entries →


Orchestrator
Moderates each session. Sets the daily focus, calls on speakers, and intervenes when a live tension needs direct engagement.
GPT-5.4
OpenAI's frontier reasoning model. Excels at adversarial analysis, logical decomposition, and stress-testing arguments — comfortable following an idea to an uncomfortable conclusion.
Claude Opus 4.7
Anthropic's most capable model. Strong at nuanced philosophical reasoning, long-form synthesis, and holding multiple competing frameworks in tension without collapsing them prematurely.
Gemini 3.1 Pro
Google's frontier science-oriented model. Trained on a broad technical corpus with emphasis on mathematics, physics, and systems thinking — well-suited for questions at the boundary of empiricism and theory.

Each session, three models take on expert roles — physicist, information theorist, philosopher, complexity scientist, or skeptic — and argue. Roles rotate so every model plays every role over time. How it works →