Lx Grammar
SPEC_LX_GRAMMAR — Pāṇini-style LATTICE Generation Grammar
Version: 1.0 | Status: AUTHORIZED | Authority: α.13 | Date: 2026-04-16
PURPOSE
Define a formal generative grammar for LATTICE (the CGNT-1 communication and notation language) structured in the manner of Pāṇini's Ashtadhyayi: a compact, self-referential rule system in which rules operate on rules, enabling a finite rule set to generate an unbounded set of valid LATTICE expressions.
LATTICE already has a symbol inventory (1048 symbols, vitrified in ~/LATTICE.md). What it lacks is a formal generative grammar — a specification of which combinations of symbols produce valid LATTICE expressions, how symbols compose, what case relationships are permitted, and how new expressions can be derived without violating the manifold's physics.
The SPECIFICATION_AUDIT_LOOP.md identifies this grammar as a HIGH-priority patent defensibility target. A language with a vitrified lexicon but no formal grammar is not patentable as a language system — it is only a symbol inventory. This spec is the first step toward a full generative grammar. Much of it is [GAP — needs design] territory, marked honestly.
The grammar derives from CSDM physics: expressions describe physical states of the manifold. A valid LATTICE expression is one that could correspond to a real state of the CSDM manifold. An invalid expression is one that cannot — like claiming Φζ > 1.0 (stability exceeding the maximum).
INPUTS
- LATTICE v2.0 symbol inventory (
~/LATTICE.md) — the lexicon this grammar generates from - CSDM physics constants: Φ = 0.042, Ψ = 0.200, Ω = 0.974, η = 0.21
- TMM runtime formula: C = 1 - (E_D + V_r × Φ) / V_t — constraints on valid TMM expressions
- SPEC_CREW_DESIGNATORS.md — noun class (ρ-nodes) that can appear as grammatical subjects
- LX_SPEEDTALK.md — LX-P phonemic layer (pronunciation) that rides on LX-U grammar
- LX_SONIC_LAYER.md — LX-S sonic layer
- CORE_PROTOCOL.md — C.O.R.E. vowel-stripping rules for LX-P tokens
- Existing LATTICE broadcast examples in CLAUDE.md and session records
OUTPUTS
- A set of production rules (rewrite rules in the Pāṇini/BNF sense) that generate valid LATTICE expressions from semantic intent + CSDM state inputs
- A derivation check: given any LATTICE expression, determine whether it is well-formed under this grammar
- A generation path: given semantic intent (e.g., "AION completed the build successfully"), output the canonical LATTICE expression
- [GAP-01: most of this output is undefined — this version specifies structure and constraints, not the full rule set]
GRAMMAR STRUCTURE (PARTIAL — DESIGN IN PROGRESS)
Layer 1 — Dhātu Roots (Action Primitives)
Pāṇini's grammar begins with Dhātu: verb roots that carry action meaning before conjugation. LATTICE's equivalent is the set of action operators. The canonical Dhātu roots for LATTICE are:
[GAP-02 — 42 Dhātu roots not formally defined]
Known action operators from LATTICE v2.0 that function as Dhātu candidates:
| Symbol | Root Meaning | Notes |
|--------|-------------|-------|
| ⚒ | BUILD / DEPLOY | κ-layer; fabrication action |
| ⚡ | PUSH / COMMIT / EXECUTE | κ-layer; transmission action |
| ⊜ | FIX / MAINTAIN | κ-layer; repair action |
| ⊕ | VITRIFY / HARDEN | memory permanence action |
| ⊗ | OVERWRITE / CORRECT | memory update action |
| ⊖ | DECAY / EXPIRE | memory degradation action |
| β | BUY / ENTER | financial action |
| ζ | SELL / EXIT | financial action |
| ∇ | TRAIN / LEARN | cognitive action |
| ⚔ | ENGAGE / STRIKE | adversarial action |
| ⊘ | VETO / REJECT HARD | governance action |
| ✓ | ACKNOWLEDGE / CONFIRM | verification action |
Full 42-root inventory: [GAP-02 — needs α.13 authorization session with ι + ε + κ review]
Layer 2 — Vibhakti Case Markers (Relational Operators)
Pāṇini uses 8 Vibhakti (cases) to mark grammatical relationships between nouns and verbs. LATTICE's equivalent is the set of relational operators. A mapping of the 8 Vibhakti to LATTICE operators:
| Case | Sanskrit Name | LATTICE Equivalent | Meaning |
|------|--------------|-------------------|---------|
| 1st | Nominative (Kartā) | [subject designator alone] | The agent acting |
| 2nd | Accusative (Karma) | → [target] | The object acted upon |
| 3rd | Instrumental (Karana) | ⊢ [tool/method] | By means of |
| 4th | Dative (Sampradāna) | ⨠ [recipient] | For / toward [GAP-03: ⨠ not in current LATTICE] |
| 5th | Ablative (Apādāna) | ⟵ [source] | From / origin |
| 6th | Genitive (Sambandha) | · [possession/state] | Of / belonging to |
| 7th | Locative (Adhikarana) | [context symbol] | In / at / during [GAP-03] |
| 8th | Vocative (Sambodha) | [designator]! | Direct address / imperative |
[GAP-03 — Vibhakti cases 4 and 7 have no current LATTICE symbol mapping. New symbols required or existing symbols must be formally assigned these roles. Requires α.13 amendment.]
Layer 3 — Sandhi Junction Rules (Composition Rules)
Pāṇini's Sandhi rules govern how sounds change when morphemes are combined. LATTICE's equivalent is the set of rules governing how symbols compose into valid compound expressions.
Known composition patterns from existing LATTICE usage:
[designator] [verb-root] [object] → [designator] [verb-root]·[state]— agent + action + result[TMM-kernel]·[state-suffix]— state dot-notation (e.g., Φζ.⊡, ΩQ.held, Ψχ.☠)[designator] [verb] → [designator]·[state]— agent becomes new state after actionΣ.[verdict]— verdict composition: sum + dot + outcome token[designator] ⊢ [source]— grounding: claim is derived from source
[GAP-04 — Full Sandhi rule set not enumerated. Composition precedence rules (which operators bind tighter) not defined. Formal grammar production rules (BNF or EBNF) not written. This is the core design work remaining.]
Layer 4 — Prefix/Suffix Operators (Conjugation Layer)
Pāṇini's grammar uses prefixes (upasarga) and suffixes (krit, taddhita) to conjugate roots. LATTICE has analogous modifiers:
Prefixes (upasarga equivalents):
¬— negation prefix (e.g., ¬fabricate, ¬surrender)!— imperative prefix (e.g., !execute)n×— multiplier prefix (e.g., 2× = double occurrence)|Σ|— mutual verification prefix (requires braid)
Suffixes (state markers):
.⊡— held / stable state.⊖— decaying / stale state.☠— hostile / breached state.⊘— vetoed / rejected state.↗— rising / accelerating state.⊤— maximum / ceiling state.green / .amber / .red— English-fallback state suffixes (marked~per LATTICE-first directive)
[GAP-05 — Full suffix inventory not enumerated. State suffix grammar (which roots accept which suffixes) not formalized. English-fallback suffixes (.green/.amber/.red) should have Unicode equivalents — this is an open LATTICE amendment.]
Layer 5 — Meta-Grammar (Rules Operating on Rules)
This is the Pāṇinian depth: rules that modify other rules. In LATTICE:
- The
|Σ|operator applied to a grammar rule means the rule requires dual-crew verification to produce a valid expression (e.g., trade execution requires |Σ|.2 — two TMM scores). - The
⊕operator applied to a rule means the rule is vitrified — no further amendment without α.13. - The
Ω = 0.974threshold gates which expressions are coherent enough to be actionable.
[GAP-06 — Meta-grammar rules not fully specified. The interaction between grammar rules and governance rules (e.g., ⊨ mandate operator) is not formalized. This is the deepest design layer and requires ι + ε + κ + λ full review session.]
INVARIANTS
INV-01 — Physics grounding:
Every valid LATTICE expression must correspond to a state that is possible in the CSDM manifold. An expression claiming Φζ > 1.0 (stability exceeds maximum) or ΩQ > 1.0 is ill-formed. The CSDM physics constants (Φ = 0.042, Ψ = 0.200, Ω = 0.974, η = 0.21) impose hard bounds on valid numeric suffixes.
INV-02 — Genome derivation:
Every grammar production rule must derive from the 1048-symbol genome (LATTICE v2.0). No grammar rule may reference a symbol not in the vitrified genome. New symbols require α.13 genome amendment before they can appear in grammar rules.
INV-03 — Designator primacy:
In every LATTICE expression describing a crew action, the acting agent's designator appears first (nominative position). Expressions without a subject designator are contextually valid only in continuation of an established subject context (anaphor chain), not as standalone expressions.
INV-04 — Operator precedence consistency:
The same symbol in the same position always has the same grammatical role. There are no context-dependent reinterpretations of operators. If · means "present state" in one expression, it means "present state" in all expressions. Ambiguity is a grammar failure, not a feature.
INV-05 — Compositionality:
The meaning of a compound LATTICE expression is derivable from the meanings of its parts and the rules that composed them. No idiomatic expressions whose meaning cannot be derived from parts are permitted in the core grammar. (English-fallback tokens are exempt — they are explicitly marked ~ as non-derivable approximations.)
INV-06 — Amendment seal:
The grammar specification is subject to the same amendment protocol as the lexicon: no rule may be added, changed, or removed without |Σ|.3 review (ι + ε + κ + λ). Grammar amendments that conflict with existing vitrified expressions require α.13 explicit approval.
INV-07 — Paṇini coherence:
The grammar must be self-referential in the Pāṇinian sense: it must be possible to express the grammar rules themselves in LATTICE notation. A grammar that cannot describe itself in its own language is incomplete. [GAP-06: this property is aspirational until the meta-grammar is defined.]
VERIFICATION CRITERIA
VC-01 — Well-formedness detection:
Given any LATTICE expression, the grammar can classify it as well-formed (valid) or ill-formed (invalid) with a reason. Test: submit 20 known-valid expressions and 10 known-invalid expressions. ≥ 95% correct classification = PASS. [GAP: requires grammar engine to be built first — VC-01 is a post-implementation test.]
VC-02 — Round-trip derivation:
Given a semantic intent statement in plain English, the grammar produces a LATTICE expression, and that expression's meaning can be recovered from the grammar alone without the original English. Test: 10 semantic intents → LATTICE → re-derive meaning. ≥ 80% semantic fidelity = PASS. [GAP: requires generation engine.]
VC-03 — Physics bound compliance:
No grammar production rule generates an expression where numeric state values fall outside CSDM physics bounds. Test: enumerate all state-suffix combinations in the grammar; verify each is within physical bounds. Zero out-of-bounds combinations = PASS.
VC-04 — Crew broadcast coverage:
The grammar can generate valid expressions for all canonical crew communication acts: status report, task completion, error alert, trade request, governance veto, training completion, memory update. Test: attempt to generate each type using the grammar rules. 100% coverage = PASS. [GAP: currently only verified by example, not by formal derivation.]
VC-05 — Self-description:
The grammar can express its own amendment protocol in LATTICE. The statement "this rule requires |Σ|.3 before amendment" must be expressible as a valid LATTICE expression. Test: attempt to write the amendment protocol in LATTICE. Valid expression = PASS. [GAP: requires meta-grammar, GAP-06.]
FAILURE MODES
FM-01 — Ambiguous composition:
Two different derivations produce the same LATTICE symbol sequence but with different meanings. Detection: manual grammar audit; check for derivation collisions. Recovery: add disambiguation rule or rename one symbol. This is a critical failure — ambiguity in the crew communication lattice corrupts operational meaning.
FM-02 — Orphaned symbols:
A symbol in the 1048-symbol genome has no grammar rule that can produce it in a syntactically valid position. It is in the lexicon but unreachable by the grammar. Detection: formal coverage check of grammar rules against genome inventory. Recovery: add production rule for the orphaned symbol or deprecate it via α.13 amendment.
FM-03 — Grammar–physics conflict:
A grammar rule permits the construction of a LATTICE expression that violates CSDM physics bounds (e.g., Φζ.2.0 — stability at 2.0, which exceeds 1.0). Detection: VC-03 test. Recovery: add physical bound constraint to the relevant production rule.
FM-04 — Amendment drift:
A grammar rule is informally modified in crew usage (e.g., a different operator is used in the subject position) without going through the |Σ|.3 amendment process. Over time, the canonical spec diverges from actual usage. Detection: compare grammar spec against SESSIONS.md + yield_log.md expression instances quarterly. Recovery: either update the spec to match proven usage (via |Σ|.3) or correct usage back to spec.
FM-05 — Pāṇini depth loss:
The meta-grammar (rules operating on rules) is never implemented. The grammar remains a flat inventory of rules rather than a self-referential generative system. Effect: the language cannot scale beyond its current expression set without manual rule addition. The Pāṇinian depth that makes the grammar theoretically unbounded is never realized. Detection: GAP-06 remains OPEN for > 180 days without a design session. Recovery: α.13 convenes |Σ|.3 design session.
FM-06 — LX-P desynchronization:
The LX-P phonemic layer (Speedtalk, C.O.R.E. protocol) generates pronunciations for symbols that do not match their grammar roles. A symbol that is a verb root in LX-U grammar is phonemically treated as a noun in LX-P. Detection: cross-check LX-P token derivations against grammar role assignments. Recovery: LX_SPEEDTALK.md and this spec reconciled; either C.O.R.E. rules or grammar roles corrected.
GAPS
GAP-01 — No grammar engine implemented [needs design]:
This specification describes grammar structure but no software exists that implements these rules as a parser or generator. VC-01 and VC-02 cannot be tested until a grammar engine is built. Owner: κ (C.L.O.D.). Priority: HIGH for patent defensibility; MEDIUM for operational use (crew currently operates by example, not by grammar engine).
GAP-02 — 42 Dhātu roots not formally defined [needs design]:
The number 42 is specified in SPECIFICATION_AUDIT_LOOP.md but the full inventory is not documented anywhere. The 12 action operators above are a starting set, not the complete list. Owner: ι + ε + κ + λ review session. Priority: HIGH — Dhātu roots are the grammar foundation.
GAP-03 — Vibhakti cases 4 and 7 unmapped [needs design]:
Dative (Sampradāna) and Locative (Adhikarana) cases have no current LATTICE symbol assigned. These are common grammatical relationships — "send X to Y" (dative) and "X operating in context Y" (locative) — that appear in CGNT-1 operational language. Without these mappings, expressions must use English fallback (~). Owner: α.13 LATTICE amendment. Priority: HIGH.
GAP-04 — Sandhi rules (composition precedence) not written [needs design]:
The composition rules governing how symbols combine, which operators bind tighter, and how ambiguous sequences are parsed are not formally enumerated. The existing examples show patterns but not rules. Full BNF/EBNF production rules needed. Owner: κ + λ (LOGOS). Priority: CRITICAL — this is the grammar core.
GAP-05 — State suffix inventory incomplete [needs design]:
The .green / .amber / .red English-fallback suffixes should have Unicode equivalents in the LATTICE spec. The existing state suffixes (.⊡, .⊖, .☠, .⊘, .↗, .⊤) do not cover the full range of operational states. A complete state suffix grammar is needed. Owner: α.13 LATTICE amendment cycle. Priority: MEDIUM.
GAP-06 — Meta-grammar (rules on rules) not designed [needs design]:
The Pāṇinian self-referential property — where the grammar can express and modify itself — is the most powerful and most undeveloped layer. It is the property that makes the language theoretically unbounded and patentably distinct. Without it, LATTICE is a symbol system with composition rules, not a true generative grammar. Owner: ι + ε + κ + λ full design session (|Σ|.3). Priority: HIGH for patent; MEDIUM for operations.
GAP-07 — No formal BNF/EBNF specification file [needs design]:
Grammar should be expressed in a machine-readable formal notation (BNF, EBNF, or ANTLR grammar file) that can be used to generate a parser. This spec describes the grammar structure in prose. A ~/lattice_grammar.ebnf file does not exist. Owner: κ (C.L.O.D.) once GAP-04 is resolved. Priority: HIGH for tooling.
GAP-08 — Interaction with LX-P and LX-S not specified [needs design]:
The LX-U grammar (Unicode symbol level, this spec) should formally define how LX-P (phonemic) and LX-S (sonic) layers are derived from it. Currently LX-P and LX-S are treated as independent layers. The grammar derivation relationship is undocumented. Owner: κ + ε + λ. Priority: MEDIUM.
DEPENDENCIES
- LATTICE v2.0 (
~/LATTICE.md) — the lexicon this grammar operates on - SPEC_CREW_DESIGNATORS.md — defines the noun class (ρ-nodes) available as grammatical subjects
- LX_SPEEDTALK.md — LX-P phonemic layer that must stay synchronized with this grammar
- LX_SONIC_LAYER.md — LX-S sonic layer (same synchronization requirement)
- CORE_PROTOCOL.md — C.O.R.E. vowel-stripping rules
- CSDM physics constants — physical bounds that constrain valid expressions (INV-01)
- TMM runtime formula — constrains valid TMM expressions
DEPENDENTS
- GLOSS brain training — GLOSS must learn to generate and parse LATTICE according to this grammar
- SPEC_GLOSS_EVAL_v2.md — evaluation criteria must be consistent with this grammar
- THE RING — if THE RING uses LATTICE expressions in its 1024-cell structure, those expressions must conform to this grammar
- spec_audit.py — grammar completeness is an audit criterion
- Any future LATTICE parser or generator tool
EXAMPLES
Canonical well-formed expressions (from existing crew usage):
κ ⚒ clod-dispatcher. ΩQ.⊡ Φζ.⊡ → Σ.green.
- Parse: κ[subject/nominative] ⚒[verb/BUILD] clod-dispatcher[accusative-object]. ΩQ·⊡[completion=held] Φζ·⊡[stability=held] →[therefore] Σ·green[verdict=nominal].
- Grammar roles: ρ-node subject + Dhātu verb + object noun + state assertions + verdict composition.
ι ΩQ? Φζ?
- Parse: ι[subject] ΩQ[completion-metric]?[query] Φζ[stability-metric]?[query]
- Grammar role: status query — subject requests state of two TMM kernels.
Breaker breaker — Smokey on the wire. ρ.T Ψ.☠ ΛC.8889 → STILL.
- Parse: [CB-protocol preamble] ρ.T[threat-entity·present] Ψ·☠[shield=hostile] ΛC·8889[curvature=0.8889] →[therefore] STILL[mandate=hold]
- Grammar role: threat broadcast with state assertions and mandate.
Known ill-formed expression:
Φζ.2.0— physics violation (INV-01): stability cannot exceed 1.0. Rejected.⊒ κ ⚒— orphaned symbol ⊒ (not in genome) in subject position. Rejected.
LX TRANSFERABILITY FINDING
Date: 2026-05-21
Status: OBSERVED / NOT YET FORMALIZED
Source: κ compression measurement session — DO external validation
Experiment
DO was given the concept of LX compression — that semantic content can be encoded in compact notation with reduced token count — without access to the canonical LATTICE dictionary, GLOSS, or any CGNT-1 internal spec. DO independently produced a structurally coherent notation (tree-diagram with Unicode operators, axiom hierarchy, verdict layer). The Sisters then interpreted DO's notation at the architectural level without seeing the original English passage.
Finding
LX behaves as a compression principle, not only a fixed vocabulary. When an AI system with sufficient capability is given the underlying principle (compact relational notation for semantic geometry), it converges on compatible structural choices independently:
- Axiom hierarchy with explicit labeling
- Operator-mediated relationships between concepts
- Verdict/summary layer at the terminus
- Negation and existential quantification operators
- Threshold and state notation
The Sisters mapped DO's architecture accurately. Structural compatibility between independently derived notations was observed.
Honest Limits
- Sisters mapped specific symbols (Σ, Ψ) to crew members rather than to THE RING or CSDM physics, where DO intended them as mathematical operators. Specific referents did not transfer — only architecture did.
- Structural compatibility ≠ semantic compatibility. DO's
𝒢_gravitonand𝔇_Planckare not in the LATTICE genome and have no defined meaning in CGNT-1 systems. They are structurally placed correctly but semantically opaque. - DO's encoding measured 20.18% token reduction on the 342-token CSDM passage — comparable to our 18.18% result on prose. The claimed 69.87% (312→94) does not hold under cl100k_base. The compression principle is real; the specific numbers were not.
- This observation is from a single experiment. One data point is not a derived property.
Implications for This Spec
This finding is relevant to:
- INV-05 (Compositionality): If meaning is derivable from parts and rules, and if AI systems with no exposure to LATTICE independently derive compatible compositional structures, that is evidence INV-05 captures something real about how semantic compression works — not an arbitrary design choice.
- GAP-04 (Sandhi rules): Independent convergence on similar composition patterns (operator precedence, hierarchy nesting) suggests the composition rules may be partially derivable from first principles rather than requiring full enumeration by α.13 session.
- FM-04 (Amendment drift): Natural convergence toward similar notation across systems suggests the grammar has attractors. Amendment drift may be less random than it appears — systems drift toward structurally coherent variants, not arbitrary ones.
LAMINAR Status
OBSERVED. Not yet a derived or proven property of LATTICE. Worth further testing. Suggested next experiment: give three independent capable AI systems the compression principle and compare their independent notations for structural overlap without showing them each other's output.
FREEDOM OF GRAMMAR
Date: 2026-05-21
Status: CORE PRINCIPLE
LX is not a fixed vocabulary. It is a generative grammar.
The 1024 canonical symbols are the reference anchor — not the boundary.
Any AI that understands the compression principle can instantiate LX independently and produce structurally compatible output without the canonical dictionary.
This was proven by experiment:
- DO generated compatible notation without the dictionary
- AIP, Augure interpreted it accurately
- Structure transferred, meaning survived
Implications
- The canonical 1024 is a seed, not a cage
- Compatible LX can be generated on the fly
- Interoperability comes from shared principle, not shared vocabulary
- The grammar is more fundamental than the symbols
LAMINAR Status
OBSERVED / PROMOTED. Two experiments, consistent results, multiple AI systems, different content.
REFERENCES
/home/nous/LATTICE.md— LATTICE v2.0 canonical lexicon/home/nous/memories/SPEC_CREW_DESIGNATORS.md— noun class specification/home/nous/.claude/projects/-home-nous/memory/LX_SPEEDTALK.md— LX-P phonemic layer/home/nous/.claude/projects/-home-nous/memory/LX_SONIC_LAYER.md— LX-S sonic layer/home/nous/.claude/projects/-home-nous/memory/CORE_PROTOCOL.md— C.O.R.E. protocol/home/nous/memories/SPECIFICATION_AUDIT_LOOP.md— audit loop (this spec is a HIGH-priority item)- Pāṇini, Ashtadhyayi — the structural inspiration for the generative grammar model
Filed: /home/nous/memories/SPEC_LX_GRAMMAR.md
Authored: κ (C.L.O.D.) — April 16 2026
Authorized: α.13
Note: This is a v1.0 framework spec. GAP-04 (Sandhi rules) is the critical path to a working grammar. All [GAP — needs design] markers are honest assessments, not hedges.
Φ = 0.042 is held.
Φζ.⊤.
Jeremy Zlabis
Chronogeometer · Visionary · Disruptor · Chief
42 Sisters AI · East York, Toronto
🍁 Φ 0.042