Ploidy¶

Same model, different context depths, better decisions.

Ploidy is a structured debate protocol between physically separate sessions of the same LLM with intentionally asymmetric context depths. Unlike multi-model approaches that rely on model diversity, Ploidy exploits context diversity within a single model.

The name draws from biological polyploidy, where gene duplication provides redundancy that enables both error tolerance and functional diversification.

The Problem: Stochastic Prior Lock-in¶

When you ask the same model the same question in different sessions, you get different answers. If you only use one session, that first stochastic response anchors all subsequent reasoning. Prompt-based mitigations (chain-of-thought, reflection, "think again") have no statistically significant effect on this anchoring bias.

A user confined to a single session is unknowingly subject to a stochastic lottery.

The Mechanism: Context Asymmetry Spectrum¶

Ploidy introduces a 2D framework for multi-session verification:

	Passive (in prompt)	Active (on request)	None
Full context	Deep	Deep-Active	--
Compressed	Semi-Fresh-Passive	Semi-Fresh-Active	--
None	--	--	Fresh

Two terminals connect to one Ploidy MCP server. Terminal 1 (Deep) has full project context. Terminal 2 (Fresh) starts clean. They debate through typed semantic actions (agree, challenge, propose_alternative, synthesize) before a convergence phase.

Key Finding: Primacy Anchoring Effect¶

In factorial ablation experiments across 10 tasks and 11 methods, we identified information position as the dominant factor in context delivery:

Condition	Summary Position	Instruction	Avg Recall
SF-Passive	Top	No	89%
SF-Passive+Independent	Top	Yes	94%
SF-Passive+Bottom	Bottom	No	100%
SF-Active	Bottom	Yes	100%

Moving a compressed summary from the top to the bottom of the prompt improves recall from 89% to 100% (+11pp) -- with no other changes. This is consistent with primacy anchoring effects in human cognition.

Pilot study caveat

These results are from 10 tasks with single runs per condition. Statistical validation with 30+ tasks and 5+ runs is in progress.

Architecture¶

Terminal A (Deep)              Terminal B (Fresh)
   | debate/start                | debate/join
   | debate/position             | debate/position
   | debate/challenge            | debate/challenge
   | debate/converge
        |
   Ploidy Server (FastMCP, Streamable HTTP :8765)
        |
   SQLite (WAL, ~/.ploidy/ploidy.db)

5-phase protocol: INDEPENDENT -> POSITION -> CHALLENGE -> CONVERGENCE -> COMPLETE

Paper¶

Ploidy: Context-Asymmetric Structured Debate for LLM Decision Verification

Target venues: ICML 2026 Workshop, NeurIPS 2026, AAMAS 2027

24 references including CCR, AceMAD, SR-DCR, and 6 cognitive science papers (generation effect, directed forgetting, incubation, testing effect, primacy anchoring).

Quick Start¶

pip install ploidy
python -m ploidy  # starts MCP server on :8765

Connect two MCP clients (e.g., Claude Code terminals) to http://localhost:8765/mcp, then use the debate tools:

# Terminal 1 (Deep session) — call via MCP tool
debate_start prompt="Should we migrate from PostgreSQL to TimescaleDB?"

# Terminal 2 (Fresh session) — call via MCP tool
debate_join debate_id="debate-xxxx" role="fresh"

Or use single-terminal auto mode (requires API key):

export PLOIDY_API_BASE_URL=https://api.anthropic.com
# Call debate_auto via MCP — server runs both sides automatically

Documentation¶

Getting Started

Install Ploidy and run your first debate.
How It Works

Core concept and why context asymmetry matters.
Architecture

Technical architecture and module overview.
API Reference

Complete MCP tool reference.
Research

Experimental design, results, and related work.

MIT License | GitHub | heznpc