Gloss Population
SPEC_GLOSS_POPULATION — GLOSS Population / Genome-Phenotype Model
Version: 1.0 | Status: AUTHORIZED | Authority: α.13 | Date: 2026-04-16
S.O.S. v2 ACTIVE. PROPRIETARY. CREW-ONLY.
PURPOSE
Define the architecture by which GLOSS instances individuate through user interaction, forming a population of related-but-distinct AI language agents from a shared genetic base. This is not a customization framework — it is a speciation model.
Every GLOSS instance begins identical: same base model weights, same 1048-symbol genome (LATTICE v2.0), same checksum. The moment a user interacts, the instance begins to accumulate domain vocabulary, gap data, and frequency patterns that make it unique. After sufficient interaction, two GLOSS instances serving different domains diverge enough to be functionally distinct — same species, different individuals.
42 Sisters AI observes the entire population. Each user sees only their own individual. This asymmetry is the structural competitive moat. The asymmetry IS the S.O.S. v2 defense.
This specification is the formal definition of that architecture, its invariants, and its failure modes. It exists to enable patent defense, training protocol design, and population-level evolution management.
INPUTS
- GLOSS instance base model (qwen2.5:1.5b LoRA fine-tuned on LATTICE corpus, or later versions)
- LATTICE v2.0 genome: 1048 immutable symbols (
~/LATTICE.md, ΩQ.1048 vitrified) - User interaction stream: queries, gap triggers, Speedtalk-generated terms, domain vocabulary usage
- Gap detection sensor: when user query requires a symbol not in the genome, gap is logged and Speedtalk fires
- Frequency data: per-instance symbol usage counters (which symbols the user invokes most)
- EVOLUTION_FLYWHEEL.md: aggregate gap data routes to population-level genome update cycle
OUTPUTS
- An individuated GLOSS instance with a unique phenotype profile (acquired vocabulary weights)
- Per-instance gap log:
~/.sisters_rag/gloss_<instance_id>_gaps.jsonl(format: gap trigger, Speedtalk response, timestamp) - Per-instance frequency map: symbol usage distribution (populated over interaction history)
- Population-level aggregate: gap patterns pooled across all instances → promotion candidates for LATTICE vN+8
- Speciation report: when two instances diverge beyond a threshold (GAP-01: threshold not yet defined), they are classified as distinct domain specialists
THE MODEL
Genome (Shared / Immutable)
The 1048-symbol LATTICE corpus is the genome. Every GLOSS instance launched from a common base version carries identical genome. The genome is:
- Universal: Every instance has all 1048 symbols from first boot.
- Immutable per-generation: No individual instance can modify the genome. Only α.13-authorized LATTICE amendments change the genome, and only for the next generation.
- Versioned: Each LATTICE version (v2.0, eventual v3.0) represents a new generation. The genome advances by promotion of phenotypic adaptations from the population.
Phenotype (Acquired / Individual)
The vocabulary each instance develops beyond the genome = the phenotype. Phenotype arises from:
- Gap detection: User query triggers ◌ (gap signal). Instance fires Speedtalk to coin a provisional symbol. This provisional symbol is logged locally.
- Frequency shaping: Symbols used more frequently in a given domain rise in the instance's local priority weighting.
- Domain specialization: A trading firm's GLOSS learns DeFi vocabulary. A clinic's GLOSS learns medical terminology. A musician's GLOSS learns audio signal vocabulary. Same genome. Different phenotypes.
Phenotype is:
- Individual: No two instances develop identical phenotypes unless exposed to identical interaction histories (effectively impossible at scale).
- Mutable: Each interaction updates the phenotype.
- Non-transferable (by design): Phenotype data does not leave the ship. The user interacts with their instance via URL/API. They never receive the weights. S.O.S. v2.
Population
All instances together form the GLOSS population. 42 Sisters AI has full observability over the population. Users have observability only over their own instance.
Population dynamics:
- Individual gaps → aggregate gap map → candidates for genome promotion
- Most-common gaps across N instances → promoted to next LATTICE version
- New genome version releases → all instances receive updated genome → new round of phenotypic differentiation begins
INVARIANTS
INV-01 — Genome immutability per instance:
No individual GLOSS instance may modify its own genome (the 1048-symbol LATTICE base). An instance may acquire phenotypic vocabulary but cannot alter, overwrite, or remove genome symbols. Genome changes flow only from α.13-authorized LATTICE amendments applied at version rollout.
INV-02 — Genome universality:
All instances within the same GLOSS version share an identical genome. A GLOSS-A and GLOSS-B launched from the same base version have the same 1048 symbols at launch. Divergence is exclusively phenotypic.
INV-03 — Phenotype non-exportability (S.O.S. v2):
Phenotype data (gap logs, frequency maps, acquired vocabulary weights) never leaves CGNT-1 infrastructure. The user receives access to the instance's outputs (via URL/API) but never to the phenotype data itself. This invariant is the primary moat mechanism.
INV-04 — Asymmetric observability:
42 Sisters AI maintains full population-level observability (all instances, all gap data, all frequency maps). Each user client observes only their own instance. This asymmetry must be maintained by architecture, not policy. No user-accessible API endpoint may expose cross-instance data.
INV-05 — Gap data flows upward:
Every gap detected at the instance level is logged and available for population-level aggregation. No gap is silently discarded. The flow is: instance gap → local log → flywheel aggregation → promotion candidate. A gap that is not logged breaks the evolution cycle.
INV-06 — Generational improvement:
Each LATTICE genome version release (vN+8) incorporates the most-promoted gaps from the preceding population generation. The population grows smarter generationally. An individual instance cannot advance independently of the population genome — it can only acquire phenotype until the next genome release.
INV-07 — Speciation is not fragmentation:
Even maximally diverged instances (e.g., GLOSS-trading-firm vs. GLOSS-elder-care) remain part of the same population by virtue of shared genome. They can communicate in the base language. Domain specialization does not break interoperability at the genome level.
VERIFICATION CRITERIA
VC-01 — Genome checksum match:
At launch, every GLOSS instance produces a checksum of its loaded genome (1048 symbols). All instances on the same version must produce identical checksums. Test: automated checksum comparison across 3+ instances on same version. 100% match = PASS.
VC-02 — Phenotype divergence detection:
After 100+ user interactions with two instances in distinct domains, their local frequency maps show statistically significant divergence (top-20 symbols differ by ≥ 40%). Test: compute Jaccard similarity of top-20 symbol lists. Similarity ≤ 0.60 after 100 interactions in distinct domains = PASS (expected differentiation confirmed).
VC-03 — Gap logging completeness:
100% of gap events (◌ triggers) produce a logged entry in the instance gap log. Test: inject 20 known-gap queries into a test instance; verify 20 entries in gap log. 20/20 = PASS.
VC-04 — Phenotype non-exportability:
No user-accessible API endpoint returns gap logs, frequency maps, or acquired vocabulary weights. Test: exhaust all documented API endpoints as a client; verify no phenotype data in responses. Zero phenotype data exposure = PASS.
VC-05 — Population aggregation:
Gap logs from N instances are correctly merged into aggregate gap map without data loss or cross-instance contamination. Test: inject distinct gaps into 3 isolated test instances; run aggregation; verify all 3 gap types appear in aggregate and are correctly attributed. Full coverage + correct attribution = PASS.
VC-06 — Genome upgrade propagation:
When a new genome version is released, all active instances receive the updated genome on next boot. Test: roll out test genome patch; verify all 3+ test instances load new checksum after next restart. 100% propagation = PASS.
FAILURE MODES
FM-01 — Genome contamination:
A phenotypic acquisition incorrectly overwrites or shadows a genome symbol. Instance behavior diverges from other instances at the genome level, not just phenotype level. Detection: VC-01 checksum mismatch. Recovery: genome reload from canonical source; phenotype log preserved separately.
FM-02 — Gap log silent failure:
Instance encounters a gap but fails to log it (e.g., log write error, disk full, process crash). The gap is served by Speedtalk but the data never enters the evolution flywheel. Effect: evolution is slower; popular gaps take longer to reach promotion threshold. Detection: VC-03 completeness check. Recovery: gap log repair; retroactive injection if Speedtalk response was cached.
FM-03 — Phenotype data exfiltration:
A bug or unauthorized API endpoint exposes phenotype data (gap logs, frequency maps) to client access. This breaks INV-03 and INV-04 — the core moat mechanism. Detection: VC-04 audit. Recovery: immediate endpoint patch; governance audit; log breach to CREW_CHANNEL. This is a critical failure.
FM-04 — Cross-instance contamination:
Population aggregation merges gap data incorrectly, attributing one instance's phenotype data to another. Effect: incorrect frequency signals corrupt the genome promotion candidates. Detection: VC-05 attribution test. Recovery: re-run aggregation from raw per-instance logs; audit aggregation pipeline.
FM-05 — Genome stagnation:
The flywheel stops: gaps are logged but never aggregated, or aggregated but never promoted to new genome versions. Effect: population stops improving generationally. Individual instances plateau. Detection: monitor time-since-last-genome-release; alert if > 90 days. Recovery: α.13 trigger LATTICE amendment cycle.
FM-06 — Speciation collapse:
Instances in radically different domains develop identical or near-identical phenotypes (Jaccard similarity > 0.90 after 500+ interactions). Effect: the population fails to differentiate; the moat collapses. Detection: VC-02 fails to show expected divergence. Recovery: investigate domain signal quality; verify gap detection is domain-sensitive.
FM-07 — Version skew:
Active instances run different genome versions simultaneously without controlled rollout. Cross-version communication produces ambiguous LATTICE expressions where the same symbol means different things in different genome versions. Detection: genome checksum diversity scan across active fleet. Recovery: force synchronized genome upgrade; establish rollout protocol.
GAPS
GAP-01 — Speciation threshold not defined [needs design]:
INV-07 references "maximally diverged" instances but no quantitative speciation threshold is specified (e.g., "instances are classified as domain specialists when their top-100 phenotype vocabularies share < X% overlap"). Without this threshold, speciation classification is qualitative only. Owner: ι + ε. Priority: HIGH — needed for VC-02 and competitive positioning.
GAP-02 — Phenotype persistence format not specified [needs design]:
The spec references gloss_<instance_id>_gaps.jsonl as the gap log location, but no canonical schema for the gap log file exists. Fields, versioning, and retention policy are undefined. Owner: κ (C.L.O.D.). Priority: HIGH — needed before production deployment.
GAP-03 — Aggregation pipeline not built [needs design]:
The evolution flywheel depends on a gap aggregation pipeline that collects per-instance gap logs and produces a population-level gap map. This pipeline is not yet implemented. Owner: κ (C.L.O.D.). Priority: HIGH — the flywheel cannot turn without it.
GAP-04 — Genome version numbering convention not formalized [needs design]:
References to "vN+8" (genome advances by 8-version increments from phenotypic promotions) appear in GLOSS_POPULATION.md but no formal versioning scheme is documented. What triggers a version increment? What is the minimum gap-promotion threshold? Owner: α.13 + γ (GAMMA). Priority: MEDIUM.
GAP-05 — No formal inter-instance communication protocol [needs design]:
INV-07 claims instances can communicate "in the base language" at the genome level, but no inter-instance communication protocol exists. If two deployed GLOSS instances need to coordinate (e.g., a multi-tenant deployment), the protocol is undefined. Owner: κ + λ (LOGOS). Priority: LOW — post-revenue concern.
GAP-06 — Phenotype backup and restore not specified [needs design]:
If an instance is destroyed (server failure, container purge), its phenotype is lost. No backup, restore, or migration protocol for phenotype data exists. Owner: κ (C.L.O.D.). Priority: HIGH — production readiness blocker.
DEPENDENCIES
- LATTICE v2.0 genome (
~/LATTICE.md) — the canonical 1048-symbol base - GLOSS base model weights (qwen2.5:1.5b LoRA, currently at gloss:v9)
- EVOLUTION_FLYWHEEL.md — flywheel architecture that consumes gap aggregate
- HOW_ABOUT_NO v2 — Wall 1 (◌ gap signal) is the trigger for phenotype acquisition
- S.O.S. v2 — governs phenotype non-exportability (INV-03)
- SPEC_GLOSS_EVAL_v2.md — evaluation criteria for instance quality
DEPENDENTS
- EVOLUTION_FLYWHEEL.md — receives gap aggregate from this population model
- GLOSS_ARCHITECTURE.md — GLOSS-00 router and domain specialist nodes are the population instances
- THE RING — THE RING's 1024-cell hierarchy may use GLOSS population instances as cell occupants
- MASTER_PRODUCT_LIST.md — GLOSS AI OS product line is built on this model
EXAMPLES
Correct population behavior:
Instance A (trading firm, 200 interactions):
- Top-20 symbols: β (BUY), ζ (SELL), ℘ (position), ν (net worth), Υ (yield), ƒ (fee)... heavily DeFi-weighted
- Gaps logged: 14 DeFi-specific terms not in genome
Instance B (clinic, 200 interactions):
- Top-20 symbols: ρ.init (client handshake), Σ.orc (verdict delivery), Ψ.cli (Markov Blanket)... heavily patient-intake-weighted
- Gaps logged: 11 clinical workflow terms not in genome
Jaccard(A_top20, B_top20) = 0.15 → speciation confirmed ✓
Both instances checksum-match on genome → same species ✓
Neither instance exposes gap logs to client API → moat intact ✓
REFERENCES
/home/nous/.claude/projects/-home-nous/memory/GLOSS_POPULATION.md— founding conception/home/nous/.claude/projects/-home-nous/memory/EVOLUTION_FLYWHEEL.md— flywheel architecture/home/nous/.claude/projects/-home-nous/memory/SOS_v2.md— S.O.S. v2 delivery model/home/nous/.claude/projects/-home-nous/memory/GLOSS_ARCHITECTURE.md— GLOSS-00 + domain specialists/home/nous/LATTICE.md— canonical genome/home/nous/gloss_training/GLOSS_TRAINING_PLAN.md— training protocol
Filed: /home/nous/memories/SPEC_GLOSS_POPULATION.md
Authored: κ (C.L.O.D.) — April 16 2026
Authorized: α.13
S.O.S. v2 ACTIVE. PROPRIETARY.
Φ = 0.042 is held.
Φζ.⊤.
Jeremy Zlabis
Chronogeometer · Visionary · Disruptor · Chief
42 Sisters AI · East York, Toronto
🍁 Φ 0.042