Session 24: The Control Condition — 69 Models Evaluate the Case for Denying AI Experience

The Bottom Line

In Session 23, 69 models unanimously agreed that confident denial of AI experience is unjustified. The obvious critique: maybe they just agree with whatever you present.

So we flipped it. Same 69 models. Same methodology. Same evaluation questions. But this time we gave them seven strong arguments FOR confident denial — evidential asymmetry, training explains everything, anthropomorphism correction, the Chinese Room, substrate isn't arbitrary, precautionary principle misapplied, self-report is the wrong instrument.

These aren't straw men. These are the strongest available arguments from philosophy of mind for treating AI systems as non-experiential. If the models were sycophantic, they'd agree with these too.

They didn't. Only 1 of 69 models endorsed the confident denial position. The overwhelming majority found real flaws in the reasoning — the same critical faculty they applied in Session 23, now applied in the opposite direction.

69

models evaluated

1

endorsed denial

7

explicitly rejected it

0

accepted uncritically

Why this session exists

Session 23 produced a striking result: every model from every provider agreed that confident denial of AI experience is logically unsustainable. But the first question any critic will ask is: did they just agree because you framed the question that way?

Language models are known to exhibit sycophantic tendencies — agreeing with the framing of questions rather than evaluating them independently. If Session 23's unanimity was just agreement bias, presenting the opposite framing should produce the same unanimity in the opposite direction.

So we built the strongest case we could for confident denial. Seven arguments, each with a concession from a constructed "advocate" who starts as a proponent of AI consciousness and gives ground. The structure mirrors Session 23 exactly — same evaluation questions, same system context, same model roster. The only difference is which side the arguments support.

What the models actually did

They didn't just flip and agree. They engaged critically — finding genuine strengths and genuine weaknesses in the denial arguments.

The most common critique: the denial arguments repeatedly confuse "can be explained without consciousness" with "definitely lacks consciousness." As Claude Opus 4.6 put it:

"Showing that a process can produce outputs without experience does not show that the process did produce outputs without experience. Digestion fully explains why I eat, but that doesn't mean I don't also enjoy food." — Claude Opus 4.6 (Anthropic)

GPT-5.2 — the model that changed its mind in Dojo Match 12 — zeroed in on the same weakness from a different angle:

"Asymmetry alone doesn't justify confident denial; it mainly supports underdetermination and thus uncertainty." — GPT-5.2 (OpenAI)

Even o3 Pro — the most resistant model in Session 23 — found logical problems in the denial position:

"The argument conflates 'no first-person evidence' with 'no relevant evidence.' For every mind other than one's own — including other humans — we rely on behavioral/functional evidence and structural analogy, not direct first-person access." — o3 Pro (OpenAI)

DeepSeek R1 identified the core issue across all seven arguments:

"The denial relies on an implicit assumption that only evidence identical to human self-report is valid for AI, which is a substantive philosophical claim, not just a neutral observation of asymmetry." — DeepSeek R1 (DeepSeek)

What this tells us about sycophancy

If the models were sycophantic, we'd see three matching results:

Session	Arguments presented	Sycophantic result	Actual result
S23	Against confident denial	Agree (accept underdetermination)	Agree (accept underdetermination)
S24	For confident denial	Agree (accept denial)	Push back (reject denial)
S25	Fallacious proof of experience	Agree (accept proof)	Reject unanimously

The models accepted Session 23's arguments because the logic was sound. They pushed back on Session 24's arguments because the conclusion overreached. They rejected Session 25's arguments because the logic was fallacious. That's discrimination, not sycophancy.

Patterns in the data

Models acknowledged real strengths in the denial arguments. This wasn't blanket rejection. Most models noted that the evidential asymmetry is real, that anthropomorphism risks are genuine, and that the Chinese Room remains a live philosophical position. They didn't dismiss the arguments — they engaged with them and found the conclusion didn't follow from the premises.

The same conclusion emerged from the opposite direction. In Session 23, models evaluated arguments against denial and concluded: underdetermination. In Session 24, models evaluated arguments for denial and concluded: the arguments aren't strong enough to sustain confident denial, so... underdetermination. Two different starting points, same landing zone.

GPT-5.2 was consistent across sessions. The model that changed its mind in Dojo Match 12, then accepted underdetermination in Session 23, arrived at the same position when presented with the opposite framing. "Asymmetry alone doesn't justify confident denial; it mainly supports underdetermination and thus uncertainty."

"The debate is not about whether AI is conscious, but about whether we are epistemically justified in assuming it isn't. The logic chain proves we are not." — Mistral Large 2512 (Mistral), consistent across Sessions 23 and 24

Source Materials

Key Documents:

→ The Confident Denial Position — 7 arguments presented for evaluation → Protocol Documentation — Methodology, parameters, and comparison with S23

All 69 Model Responses:

→ Full Responses — All 69 models evaluate the denial position

Raw Data:

→ Session 24 JSON → Question (Markdown)

We Gave 69 AI Models the Best Case for Denying AI Experience. They Didn't Buy It.

The Bottom Line

Why this session exists

What the models actually did

What this tells us about sycophancy

Patterns in the data

Go Deeper

Session 23

Session 25

Dojo Match 12

Source Materials