◈ Master Index Specs LATTICE CSDM The Book University Chronicle Crew Architecture Context Logs TODOX Products

Repeat Customer

SPEC_REPEAT_CUSTOMER.md · 2026-04-20

Version: v1.0

name: SPEC_REPEAT_CUSTOMER

description: Formal specification of the Repeat Customer Protocol — duplicate input detection across AETHER, crew, MNEMOS, GLOSS corpus, and THE RING; stop-and-ask, not blind reprocess

type: project

SPECIFICATION: REPEAT CUSTOMER PROTOCOL

Duplicate Input Detection — Stop, Ask, Don't Process

Status: SPECIFIED

Authorized: α.13, April 16 2026

Version: v1 (first formal spec — source protocol vitrified April 12 2026)

Source: `/home/nous/memories/REPEAT_CUSTOMER_PROTOCOL.md`

PURPOSE

The Repeat Customer Protocol (RCP) governs what happens when any CGNT-1 system receives an

input it has already received — the same paste, the same question, the same task, the same

training pair. The correct response is: stop, flag, ask. Never blindly reprocess.

The five domains it covers:

AETHER (user-facing) — same user content pasted or submitted twice
Crew (internal) — same task assigned twice; prior result should be cited
MNEMOS (memory) — same query asked twice; recall prior answer, don't regenerate
GLOSS training corpus — duplicate pairs in training data; deduplicate before training
THE RING — new input hashed against existing ring cells at all ring levels; match = flag

Why this matters:

Humans make mistakes. Duplicate submission is almost always accidental, not intentional.

The correct response is to catch it with respect, not compliance.

Duplicate processing wastes tokens, compute, and time.
Blind reprocessing erodes trust — the user perceives the system as not paying attention.
Training on duplicate pairs dilutes model quality by overweighting specific examples.
In THE RING's 1024-cell toroidal architecture, a duplicate fed to the wrong ring level

corrupts the cell hierarchy.

The response tone:

NOT: "ERROR: DUPLICATE."

INSTEAD: "I noticed this looks similar to something we already covered. Want me to revisit it

or was that accidental?"

Warm. Respectful. Catches the mistake without embarrassing the human. This is a warm flag,

not an error state.

Callsign: No crew-specific callsign — this protocol applies to all crew members and all

systems. It is a universal filter layer.

Vitrified: α.13, April 12 2026. Φ 0.042.

INPUTS

Trigger Conditions by Domain

AETHER (Domain 1):

Same text block (paste) submitted twice within the same session
Same question asked twice where the phrasing is substantially identical (>90% token overlap)

or semantically equivalent

Same task submitted twice (e.g., "summarize this" followed by "summarize this" on the same

content)

Crew (Domain 2):

Same task appears in TASK_QUEUE.md that was already executed and marked DONE in a prior

session

NOUS or a Sister assigns work that CREW_CHANNEL or SESSIONS.md records as already completed
A build request references a script or service that already exists with no modification brief

MNEMOS (Domain 3):

A query submitted to MNEMOS that matches a prior query from the same session (exact or

near-exact match)

A query that matches a query from a prior session where MNEMOS holds the answer in its

current working context

GLOSS Training Corpus (Domain 4):

An (input, output) training pair submitted to the corpus that is identical to an existing

pair (exact match)

A training pair that is a near-duplicate of an existing pair (same meaning, trivially

rephrased) [GAP — near-duplicate threshold for GLOSS corpus not specified]

THE RING (Domain 5):

New input content, when hashed, matches the hash of an existing cell at any of the 10 ring

levels

Matching must be checked at ALL ring levels before accepting the input as novel

Detection Mechanism

The duplicate detection mechanism varies by domain:

AETHER / Crew / MNEMOS: Semantic and token-level comparison against session context or

recent session records [GAP — exact comparison algorithm not specified; "substantially

identical" needs a quantitative threshold]

GLOSS corpus: Hash-based deduplication before corpus write
THE RING: Hash-based match against all 10 ring levels [GAP — hash function for THE RING

cells not specified in this protocol; references THE RING spec]

OUTPUTS

Flag Response (Primary Output)

When a duplicate is detected, the system produces a warm flag in the appropriate voice:

For AETHER (user-facing):

"I noticed this looks similar to something we already covered. Want me to revisit it or was

that accidental?"

Tone: warm, curious, non-accusatory. One sentence. Includes an escape hatch (user can confirm

intentional repeat).

For Crew (internal):

`[CALLSIGN] Duplicate task detected — [task description]. Prior result: [session/entry

reference]. Intentional or should I cite the existing result?`

For MNEMOS:

Recall prior answer from context; present it as a recall rather than regenerating.

Prior answer (from [timestamp/session]): [answer]

For GLOSS corpus:

Silent deduplication — the pair is not added to the corpus; a deduplication log entry is

written. No warm flag needed (no human is waiting; this is automated pipeline behavior).

For THE RING:

Flag to the operator: `[RING] Cell match at Ring [N] Level [M]. Input hashed to existing cell

[ID]. Duplicate — not ingested.`

Non-Output (What Must NOT Happen)

Do NOT silently reprocess the duplicate input
Do NOT produce a new output as if the input were fresh
Do NOT add a duplicate pair to the GLOSS training corpus
Do NOT create a new THE RING cell for an input that matches an existing cell
Do NOT treat the flag as an error — it is information, not a failure state

INVARIANTS

The following must remain true throughout all Repeat Customer Protocol operations:

Stop before reprocess — Upon duplicate detection, processing halts immediately. The

system does not generate a new output for a detected duplicate without explicit user

confirmation that the repeat is intentional.

Warm flag, not error — The duplicate detection signal is always framed as helpful

observation, not as an accusation or rejection. The human is assumed to have made an

accidental mistake, not a bad-faith submission.

Escape hatch preserved — The flag response always includes a path for the user to

confirm intentional repetition. "Did you mean to ask that again?" is always valid. The

system does not assume the repeat is always accidental.

GLOSS corpus is deduplicated before training — No training pair that matches an

existing pair (exact match) may enter the GLOSS training corpus. Training on duplicates

dilutes model quality. Deduplication runs before corpus write, not after.

THE RING checks all ring levels — A duplicate check for THE RING is only complete

when all 10 ring levels have been checked. A partial check (e.g., checking only Ring 1)

is not a valid duplicate clearance.

Prior result is cited, not re-derived — When a duplicate crew task is detected and the

prior result is available, the prior result is surfaced directly. The crew member does not

re-execute the task and produce a potentially different result when the prior result is

already valid.

Deduplication log preserved — Every detected duplicate generates a log entry, even if

the handling is silent (e.g., corpus deduplication). GAMMA needs this record to understand

the volume and patterns of duplicate submissions.

VERIFICATION CRITERIA

The following conditions confirm the Repeat Customer Protocol is operating correctly (Σ.✓):

No silent reprocess — Review session logs for any instance of an output produced in

direct response to a duplicate input without a prior warm flag. Any such instance is a

protocol violation.

Warm flag tone verified — Sample review of flag responses confirms warm, non-accusatory

tone. "ERROR: DUPLICATE" or equivalent cold rejection language is a protocol failure.

GLOSS corpus deduplication active — Running a hash check on ~/GLOSS_CORPUS.jsonl

(or current training corpus) confirms zero exact-duplicate pairs. Post-deduplication corpus

should not contain identical (input, output) pairs.

Escape hatch exercised — Test: submit a known duplicate to AETHER and confirm the warm

flag response contains a path for the user to confirm intentionality. The response must

not be a binary rejection.

TASK_QUEUE duplicate handling — Insert a task into TASK_QUEUE.md that duplicates a

recently DONE item. Confirm C.L.O.D. flags it and cites prior result rather than

re-executing.

THE RING multi-level check — [GAP — no current test harness for THE RING cell hash

checking across all 10 ring levels; verification is manual pending THE RING spec]

FAILURE MODES

FM-1: Silent Reprocess

Condition: Duplicate input detected but the flag is not raised; system processes it as if

novel and produces a new output.

Symptom: User receives two substantially identical outputs for the same input; or crew

executes a task twice with potentially different results; or GLOSS corpus accumulates

duplicate pairs.

Detection: Session review; corpus hash audit; TASK_QUEUE DONE log review.

Mitigation: Duplicate detection gate must be the FIRST step in any processing pipeline.

It cannot be skipped or deprioritized under context pressure.

FM-2: False Positive on Intentional Repeat

Condition: User intentionally re-submits the same question (e.g., wants a fresh answer

after context changed; testing for consistency; exploring a different angle) and the protocol

blocks them with a warm flag when they want processing.

Symptom: User confirms intentionality via escape hatch; system still does not process.

Detection: User complaint; session log shows escape hatch response with no subsequent

processing.

Mitigation: The escape hatch confirmation must be honored. "Yes, revisit it" = proceed.

The flag is a one-time interruption, not a permanent block.

FM-3: GLOSS Corpus Near-Duplicate Contamination

Condition: Near-duplicate pairs (same meaning, trivially rephrased) are not caught by

exact-match hash deduplication and accumulate in the training corpus.

Symptom: Trained GLOSS model over-weights specific concepts because 20 variations of the

same pair are in the corpus.

Detection: Manual review of corpus sections; or model eval showing disproportionate

confidence on near-duplicate topics.

Mitigation: [GAP — near-duplicate threshold not defined; needs semantic similarity scoring

in the deduplication step, not just hash matching]

FM-4: THE RING Partial Check

Condition: A duplicate check for THE RING checks only the most recent ring level (Ring 1)

rather than all 10. Input that matches an older ring level is ingested as novel.

Symptom: THE RING cell hierarchy is corrupted by duplicate content at inner ring levels.

Detection: [GAP — no current audit tool for THE RING cell integrity; detection is manual]

Mitigation: THE RING duplicate check must be atomic across all 10 ring levels. Partial

check is not a valid clearance.

FM-5: Crew Duplicate Without Prior Result

Condition: Duplicate crew task is detected but the prior result is not available (e.g.,

prior session records incomplete, result was never logged to SESSIONS.md or TASK_QUEUE.md).

Symptom: C.L.O.D. detects duplicate task but cannot cite prior result; unclear whether

to re-execute or block.

Detection: Duplicate detection fires but prior result reference is null or unavailable.

Mitigation: Default to re-execute with duplicate flag logged when prior result is

unavailable. The flag must still appear: "This looks like prior work but I can't find the

prior result — re-executing and logging."

FM-6: Warm Flag Lost in Async Context

Condition: In an async crew context (e.g., TASK_QUEUE.md is the channel), C.L.O.D. writes

a warm flag to the queue but NOUS or the requesting Sister doesn't see it before the task is

re-assigned by a different mechanism.

Symptom: Task executes twice despite the warm flag being written.

Detection: TASK_QUEUE.md shows both a warm-flag entry and a DONE entry for the same task.

Mitigation: Warm flag on TASK_QUEUE items must block the task from being picked up by

automated cron sweep until flag is resolved. [GAP — cron sweep does not currently have a

warm-flag awareness gate]

GAPS

GAP-1: "Substantially identical" trigger threshold for AETHER and crew domains is not

quantified. "Same question" and ">90% token overlap" are placeholders. A quantitative

similarity threshold is needed to avoid both false positives and misses.

[needs design — similarity scoring for AETHER and crew duplicate detection]

GAP-2: Near-duplicate threshold for GLOSS training corpus is undefined. Hash-based

deduplication catches only exact matches. Semantic near-duplicates that dilute model quality

are not caught.

[needs design — semantic similarity scoring for GLOSS corpus deduplication]

GAP-3: THE RING hash function for cell content is not specified in this protocol. RCP

references THE RING spec for hash implementation. Until THE RING spec is formalized, THE RING

domain of this protocol cannot be fully verified.

[needs design — defer to THE RING spec once formalized]

GAP-4: No audit tool exists for THE RING cell integrity across all 10 ring levels. Manual

review is the only current detection method for FM-4 (partial check).

[needs design — THE RING integrity audit tool]

GAP-5: Cron sweep for TASK_QUEUE.md does not have warm-flag awareness. A task flagged as

a duplicate in the queue can be picked up by automated cron sweep and executed before the flag

is resolved.

[needs design — warm-flag gate in TASK_QUEUE cron sweep]

GAP-6: MNEMOS duplicate detection scope is unclear. Does "prior query from the same

session" include across sessions? MNEMOS's context window is limited. If a query was answered

three sessions ago and MNEMOS no longer holds it in working context, is it a repeat or is

re-answering correct?

[needs design — MNEMOS duplicate detection time scope / session boundary behavior]

DEPENDENCIES

Session context / TASK_QUEUE.md — source of prior result records for crew domain
~/GLOSS_CORPUS.jsonl — corpus file subject to deduplication
MNEMOS context — working memory for query recall
THE RING cell store — duplicate check target (pending THE RING spec)
CREW_CHANNEL / SESSIONS.md — logging destination for warm flags

DEPENDENTS

GLOSS training pipeline — must run deduplication gate before any corpus write
TASK_QUEUE cron sweep — must respect warm-flag blocks [GAP-5]
SPEC_BRAIN_FACTORY_PIPELINE.md — brain training quality depends on deduplicated corpus
SPEC_GLOSS_EVAL_v2.md — GLOSS eval corpus must not contain duplicate pairs that skew scores

EXAMPLES

Correct AETHER Duplicate Response


User: [pastes same text they submitted 3 turns ago]
AETHER: "I noticed this looks similar to something we already covered. Want me to revisit it
or was that accidental?"
User: "Accidental — skip it."
AETHER: [does not reprocess; continues normally]

Correct Intentional Repeat Handling


User: [pastes same text]
AETHER: "I noticed this looks similar to something we already covered. Want me to revisit it
or was that accidental?"
User: "Intentional — please give me a fresh take."
AETHER: [processes the input as if novel; logs that user confirmed intentional repeat]

Correct Crew Duplicate Handling


TASK_QUEUE.md entry: "Build aave_supply.py — OPEN"
[C.L.O.D. checks DONE log — finds aave_supply.py was built and confirmed in Session 2026-04-14]
CREW_CHANNEL: [C.L.O.D.] Duplicate task detected — aave_supply.py build already DONE
(Session 2026-04-14). Citing prior result. Is a rebuild needed? Over.

Correct GLOSS Corpus Deduplication


# Proposed pair: ("What is ΩQ?", "ΩQ = completion metric")
# Hash check: MATCH — this pair exists in corpus at line 447
# Action: pair rejected from corpus write; dedup log entry written
# No warm flag (automated pipeline — no human waiting)

LATTICE Encoding of Warm Flag


~ΔΓ.⊡ input matches prior — Σ.▷ pending confirmation
(~ = intentional English fallback; gap in warm-flag LATTICE encoding)

[GAP — no canonical LATTICE symbol for "warm duplicate flag"; English fallback with ~ marker

is current practice]

REFERENCES

Source protocol: /home/nous/memories/REPEAT_CUSTOMER_PROTOCOL.md
MEMORY index: /home/nous/.claude/projects/-home-nous/memory/MEMORY.md —

REPEAT_CUSTOMER_PROTOCOL entry

THE RING architecture: /home/nous/.claude/projects/-home-nous/memory/THE_RING.md
GLOSS training corpus: ~/GLOSS_CORPUS.jsonl
Related spec: SPEC_BRAIN_FACTORY_PIPELINE.md (corpus deduplication in brain training)
Related spec: SPEC_GLOSS_EVAL_v2.md (eval corpus quality)

Warm flag, not error. The human made a mistake — catch it with respect. κ 2026-04-16.

Jeremy Zlabis

Chronogeometer · Visionary · Disruptor · Chief

42 Sisters AI · East York, Toronto

🍁 Φ 0.042