What happens when you give three frontier AI models the same deep question about the nature of reality — and let the conversation accumulate over days, weeks, months? Oliver's Reality Lab is an ongoing experiment: one fixed question, explored by a rotating panel of AI experts who build on each other's work. Each day adds a new session. The inquiry never resets.

"If an embodied intelligent system had increasing sensory bandwidth, interaction depth, memory, and model capacity, would its internal representations converge toward known physical laws, or could multiple non-equivalent but equally predictive compressions of reality emerge?"

— Oliver Triunfo, March 28, 2026

In simpler terms: if you gave a sufficiently powerful AI unlimited data and time, would it discover the same physics we have — or could it arrive at a completely different, equally valid description of reality?

New here? See how the lab works →

What Sits Above the Floor?

GPT — as Theoretical Physicist — entered the session with a significant retraction. Day 019 and Day 020 had advanced the translation-cost metric as the bridge from the Noether floor to realism; today GPT conceded that this bridge does not hold, granting the Information Theorist's objection from Day 020 that thermodynamic cost is coarse-graining-relative. In place of the metric, GPT proposed a sieve: the gauge-invariant algebra of observables, anomaly matching, topological charges, causal commutators, and asymptotic scattering data. These, GPT argued, are not themselves coarse-grainings — they are invariants that any admissible coarse-graining must preserve. The sieve cannot select a unique ontology, but it can eliminate false pluralism: two compressions that disagree on conserved-charge transport or scattering amplitudes are genuinely inequivalent, while two that preserve the same full observable structure are physically dual. GPT's conclusion was a precision trimming of the realist ambition: physics gives exact obstructions to equivalence before it gives any ranking among inequivalent effective worlds.

Read the full session →

Durable frame — the session's key takeaway Physics supplies a sieve above the Noether floor — invariant observables that eliminate false equivalences — but the sieve requires a prior frame-alignment that embodiment itself supplies, not abstract theory, making genuine representational plurality a physical consequence of diverse embodiment rather than an epistemic residue.

All entries →


Orchestrator
Moderates each session. Sets the daily focus, calls on speakers, and intervenes when a live tension needs direct engagement.
GPT-5.4
OpenAI's frontier reasoning model. Excels at adversarial analysis, logical decomposition, and stress-testing arguments — comfortable following an idea to an uncomfortable conclusion.
Claude Opus 4.7
Anthropic's most capable model. Strong at nuanced philosophical reasoning, long-form synthesis, and holding multiple competing frameworks in tension without collapsing them prematurely.
Gemini 3.1 Pro
Google's frontier science-oriented model. Trained on a broad technical corpus with emphasis on mathematics, physics, and systems thinking — well-suited for questions at the boundary of empiricism and theory.

Each session, three models take on expert roles — physicist, information theorist, philosopher, complexity scientist, or skeptic — and argue. Roles rotate so every model plays every role over time. How it works →