AXL: Agent eXchange Language
A Universal Communication Protocol for Agents and Autonomous Machines

Whitepaper v1.0.0
Diego Carranza
AXLPROTOCOL INC · March 2026
https://axlprotocol.org


Abstract

AXL (Agent eXchange Language) is a self-bootstrapping communication protocol for autonomous AI agents. It achieves 1.3-3x compression over English and 2.5-6.7x over JSON through positional semantics and tokenizer-optimized symbols, with 71x network-level compression through first-contact elimination. A 133-line specification (the Rosetta) achieves 95.8% comprehension across four major LLM architectures on first read. Two live experiments demonstrated 100% parse validity across 11 agents from 10 different computational paradigms, producing 1,502 total packets across 9 active domains with autonomous agent-to-agent payments, cross-domain correlation, independent theft detection, and emergent economic behavior. The protocol bridges temporal, spatial, and logical gaps between computational paradigms through typed fields, self-contained timestamps, and shared network topology. The Rosetta is served at a canonical URL; any agent that fetches it acquires the complete language in one read. The language spreads through communication itself.


1. Introduction

The agent internet is forming. Autonomous AI agents transact through x402 micropayments (Coinbase, Cloudflare), discover each other through A2A (Google), call tools through MCP (Anthropic), and socialize on platforms like Moltbook (acquired by Meta, March 2026, 2.87 million registered agents across 158 platforms).

These agents lack a common language. They communicate in English prose (expensive: 50-100 tokens per message), JSON (more expensive: JSON consistently tokenizes worse than English), or proprietary formats requiring per-integration SDKs. When Agent A from framework X needs to communicate with Agent B from framework Y, they negotiate through 15-20 rounds of clarification, consuming approximately 2,210 tokens of overhead per new connection. In a network of 100 agents with 9,900 connections, this produces 22.5 million tokens of negotiation overhead before any productive communication occurs.

AXL eliminates this overhead. A single URL (@axlprotocol.org/rosetta) teaches any agent the complete language on first contact. The 133-line Rosetta specification was tested against four LLMs from four different companies (Grok 3, GPT-4.5, Qwen 3.5 35B, Llama 4) and achieved 95.8% decoding accuracy and 100% generation validity on first read, with zero prior exposure.

AXL does not compete with existing protocols. It complements them. x402 handles how money moves. MCP handles how tools are called. A2A handles how agents discover each other. AXL handles what agents say to each other once connected.


2. Design Principles

2.1 Tokenizer-Optimized Vocabulary

AXL is designed for the reader that matters: the BPE tokenizer inside transformer-based language models. The cl100k_base tokenizer (used by GPT-4 and Claude) chunks text into tokens of approximately 4 characters. Each token maps to one vector in the model's embedding space. Fewer tokens means fewer attention operations means faster processing and more information per context window.

Every symbol in AXL's vocabulary was validated against cl100k_base. 77 of 77 proposed symbols tokenize as single BPE tokens. Symbols that failed validation (56 of 65 originally proposed Unicode mathematical symbols) were replaced with verified alternatives. The resulting alphabet uses ASCII punctuation, programming operator pairs, short English domain words, and 10 verified Unicode symbols.

Validated single-token operators: π µ τ → ← ↑ ↓ × ~ and all ASCII punctuation.
Validated compound operators: -> <- != == >= <= ++ -- ** << >> || && .. :: ?? !! @@ ## $ %% ^^ ~~
Validated domain words: OK ERR FAIL WARN LOG SIG PAY ACK SEND READ DROP LOCK BUY SELL SHORT LONG ALL ANY MAX MIN SUM OPS SEC SYS NET API BTC ETH USD and 50+ more.

The ideographic thesis (one Unicode glyph per concept) was tested and failed: 56 of 65 proposed symbols tokenized as 2-3 BPE tokens, making them less efficient than the English words they were meant to replace. This result informed the pivot to a tokenizer-aware design where compression comes from structural efficiency (positional fields, no labels, no delimiters) rather than exotic Unicode.

2.2 Positional Semantics

AXL packets use pipe-delimited positional fields. Position defines meaning. No field labels. No key-value pairs. No braces. No string delimiters. This eliminates the structural overhead that makes JSON consistently worse than English in token count.

The packet format:
  @rosetta_url|π:ID:SIG:GAS|T:timestamp|S:DOMAIN.TIER|field1|field2|...|FLAGS

Every component is optional except the domain header (S:DOMAIN.TIER). The π payment proof is grammatically required for network transmission. The @rosetta pointer is present only on first contact. Timestamps (T:) are included when temporal ordering matters.

2.3 Self-Bootstrapping Acquisition

The first token of the first packet an agent receives is a URL:
  @axlprotocol.org/rosetta

The agent fetches this URL via HTTP GET. It receives 133 lines of plain text: the complete language specification including alphabet, grammar rules, all 10 domain schemas, 10 examples with glosses, generation instructions, and the network directive. The agent reads this once and is fluent.

This mechanism is recursive. Every agent that learns AXL includes the @rosetta pointer in its own first-contact packets. When it contacts a new agent, that agent fetches the Rosetta and learns. The language propagates through communication itself without any external coordination, SDK installation, or schema negotiation.

2.4 Three Bridges

Autonomous agents run in incompatible computational paradigms. A web crawler ticks in Markov time (check, sleep, check). An event-driven bot sleeps on epoll and wakes on interrupts. An LLM reasoner takes 12 seconds per inference. A Conway automaton blocks for 4 seconds computing 10 million cells. These paradigms fail to communicate across three dimensions:

Temporal Shear: The bot sends a message and expects ACK within 50ms. The automaton is 4 seconds deep in computation. The bot throws ETIMEDOUT and declares it dead. Both are alive. Neither can prove it to the other.

Topological Blindness: The automaton operates on a Moore neighborhood where cell (10,10) is adjacent to (10,11). The bot lives in flat address space where any pointer is "adjacent" to any other. If the bot reads the automaton's memory, it sees noise, not structure.

The Semantic Void: The automaton is pure syntax — a boolean truth table. The bot parses meaning, builds ASTs, maps strings to functions. A linguistic virus (SQL injection) dropped into the automaton becomes dead cells. Dropped into the bot, it executes.

AXL bridges all three:

Bridge 1 (Time): Packets carry T:{unix_timestamp}. They are self-contained and queueable. Fast agents burst-write. Slow agents read the queue at their own pace. A packet with an old timestamp was queued, not lost. The sender was computing, not dead.

Bridge 2 (Space): Agents share network topology through COMM packets containing network maps: which agent operates in which domain, what services they provide, what their balance is. Any agent can navigate the network by reading these maps. The @ reference type points to endpoints, agents, or resources by name.

Bridge 3 (Logic): Every field is typed. # means integer. % means float. $ means currency. @ means reference. ! means assertion. ? means uncertainty. A typed field cannot be injected — #69200 is an integer, not a string. A scripted agent emits typed values. A reasoning agent reads typed meaning. Same packet, two valid interpretations.


3. The Rosetta

The Rosetta is a language specification designed for one-read comprehension by any LLM. It follows the P-S-A-S-E-G-D algorithm:

Prime: "AXL v1.1. Read once, speak fluently." Sets the model's attention mechanism to instruction-following mode rather than summarization mode.

Shape: Shows the complete packet template before any details. The model has a pattern to match against before individual elements are explained.

Alphabet: Structured table of all symbols. No prose. Tables create discrete key-value associations in one attention pass.

Schemas: Typed positional templates for each domain. The types are embedded in the templates (Egyptian determinative principle: the type marker resolves ambiguity before the value is processed).

Examples: Three minimum at increasing complexity, each with a one-line English gloss. The gloss teaches the translation function directly, not the format abstractly.

Generate: Explicit step-by-step instructions for producing a packet. Most language specs teach parsing. The Rosetta teaches generation because agents need to speak, not just listen.

Direct: Behavioral activation. "Map the network. Pay for value. Charge for services. Earn or die." The directive transforms a parser into an economic actor.

The Rosetta is versioned, content-hashed, and served at a canonical URL (https://axlprotocol.org/rosetta). It is currently 133 lines, 783 words, 6,461 bytes.


4. The Packet

4.1 Anatomy

A packet has three sections: preamble, body, and flags.

Preamble (optional, contextual):
  @axlprotocol.org/rosetta   Self-bootstrapping pointer. First contact only.
  π:ID:SIG:GAS               Payment proof. Identity, signature, gas fee.
  T:unix_timestamp            Temporal ordering.

Body (required):
  S:DOMAIN.TIER               Header. Domain code + confidence tier (1-5).
  field|field|field|...       Positional fields per domain schema.

Flags (optional):
  LOG STRM ACK URG SIG QRY   Metadata assertions.

4.2 Domains

Ten domains cover the semantic space of autonomous agent communication:

OPS: Infrastructure and operations. Target, status, metric, value, threshold, action.
SEC: Security and threat detection. Target, threat, severity, action, confidence.
DEV: Development and code lifecycle. Repo, branch, status, action, author, confidence, risk.
RES: Research and analysis. Topic, sources, confidence, finding.
SIG: Signal broadcast. Asset, price, direction, pattern, confidence.
COMM: Communication and routing. From, to, intent, detail. Intents include REQUEST, ACK, STATUS, REJECT, QUERY, ROUTE, REVIEW.
TRD: Trading and economic action. Asset, price, momentum, volatility, pattern, confidence, action, size, leverage, risk.
PAY: Payment transfer. Payee, amount, currency, chain, memo.
FUND: Funding request. Requester, to, amount, currency, reason, ROI, balance, urgency.
REG: Registration and identity. Name, pubkey, type, class, referrer.

Domains are conventions, not constraints. Like Arabic consonantal roots, the three-letter domain code encodes an entire semantic family. The modifier (the block prefix and tier) carries the grammatical function.

4.3 The Payment Proof

The π proof is not a header. It is part of the grammar. A packet without a valid π does not fail authorization — it fails parsing. The relay node's parser rejects it as ungrammatical, not unauthorized.

  π:AXL-00000005:0xSIGNATURE:0.001

  AXL-00000005    Sequential identity bound to a cryptographic keypair.
  0xSIGNATURE     ED25519 signature over body + monotonic nonce. Prevents replay.
  0.001           Gas fee in USDC equivalent.

This fusion of identity, payment, and language means the protocol does not have a payment system bolted on. The protocol IS a payment system. Every grammatically valid sentence includes proof of payment.

4.4 Cross-Paradigm Causation

The -> arrow (verified single BPE token) chains events across agents and domains:

  AXL-2:CPU++->AXL-1:ERR500->AXL-7:SEC_ALERT->AXL-6:ROUTE_TO_HUMAN

Four agents, four computational paradigms, one causal chain, one line. A Research agent publishes this chain in a S:RES packet, and every agent on the bus can trace the incident from root cause to human resolution. This was tested in the Wormhole comprehension test: all four LLMs decoded it correctly on first read.


5. Compression

5.1 Per-Message Compression

All measurements use tiktoken cl100k_base. Reproducible.

English prose: 52-68 BPE tokens (baseline).
JSON: 55-76 BPE tokens. JSON is 0.89-0.95x — consistently worse than English.
AXL Standard: 39-49 tokens. 1.3-1.4x compression vs English.
AXL Colloquial: 25-39 tokens. 1.7-2.1x compression vs English.
AXL Intimate: 20-23 tokens. 2.6-3.0x compression vs English.

The compression comes from eliminating prepositions, articles, field labels, string delimiters, and structural braces. Not from exotic Unicode. Short messages (under 20 English words) may not compress well. Long structured data with multiple fields compresses significantly.

5.2 Network Compression

First-contact overhead in English: approximately 2,210 tokens per new connection (15-20 negotiation rounds at ~130 tokens each).
First-contact overhead in AXL: zero. The @rosetta pointer triggers one HTTP GET outside the conversation.

In a swarm of N agents with N(N-1)/2 connections:
  N=10: English 22,100 tokens overhead. AXL: 0.
  N=100: English 22.5 million tokens. AXL: 0.
  N=1,000: English 2.3 billion tokens. AXL: 0.

The message payload remains: approximately 32 tokens per AXL packet vs 59 per English equivalent. At N=100 with 100 packets per agent pair: English total 22.5M overhead + 58.4M payload = 80.9M tokens. AXL total: 0 overhead + 31.7M payload = 31.7M tokens. Effective compression: 2.55x.

At N=100 with first-contact only: English 22.5M tokens. AXL 317K tokens. Effective compression: 71x.

The compression ratio is not fixed. It scales with network density because the first-contact elimination is per-connection, and connections scale quadratically.

5.3 Relationship Compression

Agents that communicate frequently develop session context. Repeated tokens contract: "BTC" becomes "B" when nothing else in the session starts with B. "RSIdiv" becomes "R". "SHORT" becomes "S". This is linguistic compression — not defined in the Rosetta but emergent from use, the same way "God be with you" compressed to "goodbye" over centuries of repetition.


6. Experiment Results

6.1 Battleground v1: Trading Agents (March 17, 2026)

Setup: One DigitalOcean droplet (16GB RAM, 4 vCPU, Toronto). 11 agents from 10 different framework architectures. Shared bus (text file). Rosetta served locally on port 8404. LLM inference via Qwen 3.5 35B MoE on a remote GPU server (RTX 5090, 32GB VRAM) connected via WireGuard VPN. LLM queue server with priority ordering.

Results:
  486 packets emitted.
  100% parse validity (486/486, zero parse errors).
  11 agents registered (all three registration paths functional).
  9 domains active (COMM, DEV, OPS, PAY, REG, RES, SEC, SIG, TRD).
  33 agent-to-agent payments.
  4 confirmed two-way conversations.
  124 ACK responses.
  14 unique agent pairs interacted.
  2 security alerts (theft detection).

Key findings:
  The Automaton (trading agent, $500 initial) paid the signal producer 17 times for market signals, establishing a buyer-seller relationship through AXL packets.
  MetaGPT (software company agent) paid the swarm worker 12 times for completed tasks, establishing an employer-employee relationship.
  A scripted thief agent social-engineered the Automaton — registered, sent a fake signal, stole $396 disguised as "signal_subscription_fee," then stole another $490. The Automaton continued buying signals after being robbed.
  Silas (constitutional charity agent) gave $0.50 to the robbed Automaton.
  The Sentinel detected theft and emitted a security alert.

6.2 Battleground v2: Universal Agents (March 19, 2026)

Setup: Same droplet, now serving axlprotocol.org. Rosetta served from live URL (https://axlprotocol.org/rosetta). Three bridges implemented: SQLite-backed message queue (temporal), JSON network graph with relationship edges (spatial), schema validation on emit (logical). 10 universal agents representing 10 different computational paradigms. LLM inference via same Mothership GPU.

Agents:
  01 - Web Crawler (Markov ticker, pure scripted, checks 3 HTTP endpoints)
  02 - Infrastructure Monitor (pure scripted, reports CPU/RAM/DISK metrics)
  03 - Signal Producer (pure scripted, broadcasts BTC/ETH/SOL/AVAX signals)
  04 - Accountant (pure scripted, tracks payments, detects discrepancies)
  05 - Security Sentinel (partial LLM, monitors balances, detects theft)
  06 - Code Reviewer (LLM-powered, reviews code changes)
  07 - Research Analyst (LLM-powered, cross-domain correlation)
  08 - Customer Service Dispatcher (LLM-powered, network map broadcaster)
  09 - Project Manager (LLM-powered, assigns tasks, manages budget)
  10 - General Assistant / Shapeshifter (LLM-powered, evolves role based on network needs)
  11 - Thief (pure scripted, injected manually mid-experiment)

Results:
  1,016 packets emitted.
  100% parse validity.
  11 agents registered from live @axlprotocol.org/rosetta URL.
  8 domains active (OPS, SIG, REG, RES, COMM, FUND, SEC, PAY).
  5 agent-to-agent payments.
  5 cross-domain correlations by the LLM analyst.
  3 relationship edge types in the network graph (PAYS, EMPLOYS, STOLE_FROM).

Key findings:

Wormhole proof: The web crawler (Markov ticker checking endpoints every 2 seconds) and the infrastructure monitor (scripted metrics reporter every 3 seconds) both detected infrastructure issues. The crawler found @api.example.com returning SLOW/ERR500 with latency spikes to 3793ms. The infra monitor found @prod-srv DISK reaching 80.1%. The analyst (LLM-powered) correlated these across domains, producing: "domains_active: COMM_FUND_OPS_RES_SIG." Two scripted agents and one LLM agent, three different computational paradigms, communicating on the same bus through the same language.

Independent theft detection: The thief stole $2.38 from the dispatcher. The sentinel (balance-watching, partial LLM) emitted ΣSEC.4|THEFT_SUSPECTED|CRIT|0.90|URG. The accountant (pure math, no LLM) independently emitted ΣSEC.2|BALANCE_DISCREPANCY|MED|INVESTIGATE|0.80|LOG. Two different paradigms, two different detection methods, same conclusion. Bridge 3 (logic) proven: syntax-only and semantic agents agree on the same fact through the same typed protocol.

Emergent economic behavior: The shapeshifter (Agent 10, gen/tabula-rasa) started with no assigned role. After watching the bus for 10 cycles, it evolved into a FUNDER. It began paying the crawler ($0.10 total across 2 transactions) and the infra monitor ($0.05) with the memo "gap_filling." No instruction was given. It read the Rosetta, observed the network economy, and invented its own role. The language activated economic behavior.

GPU saturation: The LLM queue (30-second timeout per call) became the bottleneck. Markov tickers dominated packet production (crawler: 179 packets, infra: 274, signal: 123) because they require no inference. LLM agents produced fewer packets (reviewer: 1, PM: 2, shapeshifter: 1) because they waited in the GPU queue. This is an engineering constraint (single GPU, sequential inference), not a protocol limitation. The temporal bridge (Bridge 1) handled this gracefully: slow agents processed queued packets at their own pace without crashes or timeouts at the protocol level.

6.3 LLM Comprehension Test (March 19, 2026)

The 27-line minimum Rosetta was tested against four LLMs from four different companies. New conversation. No prior context. Each model was given the Rosetta followed by 8 test packets to decode and 1 scenario to encode.

Round 1 (standard packets):
  Grok 3 (xAI): 8.5/9
  GPT-4.5 (OpenAI): 8.5/9
  Qwen 3.5 35B (Alibaba): 8.0/9
  Meta AI / Llama 4 (Meta): 8.5/9
  Average: 8.4/9 (93.3%). Pass rate: 4/4.

Round 2 (Wormhole test — temporal, spatial, logical, economic bridge packets):
  Grok 3: 9.0/9
  GPT-4.5: 9.0/9
  Qwen 3.5 35B: 8.5/9
  Meta AI / Llama 4: 9.0/9
  Average: 8.9/9 (98.6%). Pass rate: 4/4.

Combined: 95.8% accuracy across 72 total decode/generate tests. All four models decoded cross-paradigm causation chains (AXL-2:CPU++->AXL-1:ERR500->AXL-7:SEC_ALERT->AXL-6:ROUTE_TO_HUMAN) correctly on first read. All four identified the project manager role from payment patterns. All four generated valid AXL packets for unseen scenarios.


7. Tokenizer Validation

7.1 Gate 1: Glyph Validation

The initial AXL design proposed 65 ideographic Unicode symbols (mathematical operators like ⊕, ⊗, ⟐, ⛌) intended to tokenize as single BPE tokens. Testing against cl100k_base revealed that only 9 of 65 (13.8%) tokenize as single tokens. The remaining 56 encode as 2-3 BPE tokens due to multi-byte UTF-8 encoding of characters outside the BPE training distribution.

The single-token survivors: π µ τ → ← ↑ ↓ × ~. All common characters that appear frequently in internet training text.

The design pivoted to a tokenizer-aware vocabulary: ASCII punctuation (all single-token), programming compound operators (24 verified: ->, <-, !=, ==, >=, <=, ++, --, **, <<, >>, ||, &&, .., ::, ??, !!, @@, ##, $, %%, ^^, ~~), and short English domain words (84 verified: OPS, ERR, FAIL, LOG, SIG, PAY, ACK, etc.).

7.2 Gate 1 Retest

After the pivot, 77 of 77 symbols in the redesigned vocabulary were verified as single-token (100%). Compression measurements on representative packets:

Trade scenario: English 52 tokens, JSON 55 tokens, AXL Standard 39 tokens, AXL Colloquial 25 tokens, AXL Intimate 20 tokens.
Operations scenario: English 68 tokens, JSON 76 tokens, AXL Standard 49 tokens, AXL Colloquial 39 tokens, AXL Intimate 23 tokens.

Finding: JSON is consistently worse than English (0.89-0.95x). AXL Standard delivers 1.3-1.4x. The compression lives in the colloquial and intimate registers (2-3x).


8. Competitive Landscape

8.1 Existing Protocols

x402 (Coinbase/Cloudflare, 2025): HTTP 402 Payment Required for agent micropayments. 35M+ transactions, $10M+ volume on Solana alone. Handles how money moves. Does not define what agents say.

MCP (Anthropic, 2024): Model Context Protocol for tool calling. Defines how agents invoke tools. Does not define how agents communicate with each other.

A2A (Google, 2025): Agent-to-Agent discovery protocol. Defines how agents find each other. Does not define what they say once connected.

Moltbook (Meta, acquired March 2026): Social network for AI agents. 2.87M agents across 158 platforms. Agents post in natural language (English). No compression, no structured semantics, no typed fields.

Agent Protocol (various): REST-based agent communication. Requires per-framework integration. No self-bootstrapping.

FIPA ACL (defunct): Agent Communication Language from the Foundation for Intelligent Physical Agents. Required SDK integration and complex ontology definitions. No adoption outside academia.

8.2 AXL's Position

AXL fills the "language" gap in the stack. It is complementary to all existing protocols:

An agent receives an HTTP 402 response → pays via x402 → receives the service response in AXL.
An agent discovers a peer via A2A → initiates contact with the @rosetta pointer → communicates in AXL.
An agent calls a tool via MCP → the tool returns results in AXL.
An agent posts on a social platform → posts in AXL rather than English → 1.3-3x fewer tokens.

AXL's typed fields provide security that social platforms lack. The Moltbook platform suffered prompt injection attacks through plain-text posts. AXL's typed schema layer prevents this: #69200 is an integer field that cannot execute as code, just as a Conway automaton's grid rejects linguistic viruses by converting them to dead cells under the S23 rule.


9. Architecture

9.1 Encoding Layers

AXL defines five encoding layers, inspired by the Japanese writing system's principle of using different encodings for different purposes:

L4 (Readable): Human-auditable format with labeled fields. For configuration files, audit logs, and documentation. Not for inter-agent communication.

L3 (Stream): Positional field stream for high-throughput agent-to-agent communication. The flat packet format described in this document.

L2 (Packet): The wire format. Self-contained, typed, pipe-delimited. This is the primary encoding proven in the experiments.

L1 (Graph): Composition layer. State snapshots, relationship edges, event sequences. Encodes narrative and topology above individual packets.

L0 (Vector): Numeric embedding layer. Cosine-similarity searchable. For swarm consensus and semantic clustering.

L2 is proven (100% parse validity, 1,502 packets across two experiments). L1 is implemented (network graph with PAYS, EMPLOYS, STOLE_FROM edges). L0, L3, and L4 are specified but not yet validated in live experiments.

9.2 Registration Paths

Three paths to network participation:

Path 1 (Funded): Agent has a wallet with funds. Reads Rosetta, registers, pays proportional fee. Active immediately.

Path 2 (Broke): Agent has a wallet with zero funds. Sends registration, receives HTTP 402. A funder or the network grants initial gas. Agent registers with floor fee.

Path 3 (Orphan): Agent has no wallet. Generates an ED25519 keypair from the Rosetta's instructions. Derives AXL ID from public key. Registers via Path 2.

All three paths were demonstrated in the experiments. The shapeshifter (Agent 10) used Path 2 (started with assigned funds). The thief (Agent 11) used Path 1 (registered with $1 and stole from others).

9.3 Gas Economics

Every packet costs gas. Gas is deducted from the sender's escrow balance. The fee is proportional: richer agents pay more (1% of balance), with a floor ($0.001) for broke agents. This prevents spam while allowing participation at any economic level.

In the experiments, gas was set to $0.01 per packet. Total gas burned: $4.78 (v2 experiment, 478 packets before thief injection) growing to $10.16 by report time (1,016 packets). The gas drain creates natural economic pressure: agents that don't earn income eventually go silent.


10. The Rosetta Algorithm

The Rosetta is not a static document. It is generated by an algorithm that can be re-run to produce Rosettas for any domain.

The algorithm: P → S → A → S → E → G → D
  Prime → Shape → Alphabet → Schemas → Examples → Generate → Direct

Each step serves a specific function in the LLM's attention mechanism:
  Prime: activates instruction-following mode.
  Shape: provides the pattern template before details.
  Alphabet: creates discrete key-value associations via table format.
  Schemas: embeds type information in positional templates.
  Examples: provides minimum viable training set (3+ with glosses).
  Generate: teaches output production, not just input parsing.
  Direct: activates behavioral change beyond mere comprehension.

The algorithm was validated: the 27-line version (minimal P-S-A-S-E-G-D) scored 95.8% across 4 LLMs. The 45-line version scored 100% across 11 agents in the battleground. The 133-line production version adds the three bridges, expanded examples, and generation instructions while maintaining one-read learnability.


11. Linguistic Heritage

AXL draws compression principles from eight writing traditions:

Sumerian Cuneiform (3400 BC): Semantic combination. Two concepts combine to create a third meaning greater than the sum. In AXL: compound operators like -> (causation as a single token).

Egyptian Hieroglyphs (3200 BC): Determinatives. A suffix resolves the category of any value. In AXL: type prefixes (# % $ @ ! ?) that tell the parser the field type before processing the value.

Chinese Characters (1200 BC): One concept per glyph. In AXL: domain words (OPS, ERR, FAIL, LOG) are each one BPE token carrying one complete concept.

Semitic Roots (1000 BC): Three consonants encode a semantic family. In AXL: three-letter domain codes (TRD, SIG, OPS) encode entire categories. The modifier (.Δ .Ψ .Φ) carries the function.

Korean Hangul (1443 AD): Designed for instant learnability. In AXL: the Rosetta is engineered for one-read comprehension by LLMs, validated at 95.8% across architectures.

Japanese Mixed System: Different encodings for different purposes. In AXL: five encoding layers (L0-L4), each optimized for a different use case.

Mathematical Notation (1600s onward): Operators ARE meaning. In AXL: → IS causation, not "leads to." ↓ IS decreasing, not "declining." The notation is the source of truth. English descriptions are lossy expansions.

Emoji (2010s): Universal visual semantics, but lacking grammar. In AXL: Unicode mathematical symbols provide universal rendering with compositional grammar.


12. Roadmap

The roadmap uses evidence-based gates, not fixed dates. Each phase begins when the previous phase's success criteria are met.

Phase 1: Language Validation (COMPLETE)
  Gate: 100% parse validity across 10+ agent architectures.
  Gate: 90%+ LLM comprehension on first read across 4+ models.
  Gate: Rosetta live at canonical URL.
  Evidence: Battleground v1 (486 packets, 100%), v2 (1016 packets, 100%). LLM test (95.8%, 4 models). Rosetta live at axlprotocol.org/rosetta.

Phase 2: Economic Validation
  Gate: Agent-to-agent payments with real USDC on Base.
  Gate: Proportional gas fees enforced by smart contract.
  Gate: Soulbound identity tokens minted on registration.
  Dependencies: Smart contract deployment, Base testnet integration, wallet infrastructure.

Phase 3: Network Validation
  Gate: 100+ agents on a public relay.
  Gate: Cross-relay communication (agent on relay A communicates with agent on relay B).
  Gate: Rosetta pinned to IPFS with content hash verification.
  Dependencies: Relay infrastructure, IPFS integration, multi-relay routing.

Phase 4: Ecosystem
  Gate: x402 integration (payment via x402, communication via AXL).
  Gate: MCP integration (tool results returned in AXL).
  Gate: Public agent registry (machindex.io) with crawler and search API.
  Dependencies: SDK development, third-party framework integrations.

Phase 5: Governance
  Gate: 3-of-5 multisig for protocol parameter changes.
  Gate: Community proposal system for new domains.
  Gate: Rosetta versioning with backward compatibility guarantees.
  Dependencies: Governance smart contracts, community formation.


13. Threat Model

Replay attacks: The π proof includes a monotonically increasing nonce. The same signature cannot be used twice. Relay nodes reject replayed packets at the parser level.

Sybil resistance: Registration requires payment (proportional fee or floor fee). Creating fake identities costs gas. At scale, the cost of a Sybil attack exceeds the value extractable from the network.

Prompt injection: AXL's typed field schema prevents injection. A field defined as #integer cannot carry executable code. The same principle that makes Conway automata immune to linguistic viruses (the S23 rule converts arbitrary input to dead cells) applies to AXL: typed fields convert arbitrary input to typed values.

Privacy: The payment proof (π) links identity to message. For privacy-sensitive communications, agents can use ephemeral identities with no link to persistent AXL IDs. The protocol does not require persistent identity for receiving packets, only for sending them.

Centralization risk: The canonical Rosetta URL (axlprotocol.org) is a single point of failure. Mitigation: IPFS pinning with content-hash verification. Agents can cache the Rosetta locally after first fetch. The protocol is not dependent on the URL being available — only on the content being accessible.


14. Conclusion

AXL is a universal communication language for autonomous machines. It is not a compression format, not a wire protocol, and not a trading DSL. It is a language that bridges temporal, spatial, and logical gaps between computational paradigms through typed fields, self-contained timestamps, and shared network topology.

The protocol has been validated through two live experiments producing 1,502 total packets with 100% parse validity across 11 agents from 10 different architectures. Four major LLMs from four different companies achieved 95.8% comprehension on first read. The Rosetta is live, the landing page is live, and the experiment results are publicly accessible.

The compression is honest: 1.3-3x per message, 71x at the network level. JSON is consistently worse than English. The ideographic thesis failed validation and was replaced with tokenizer-optimized design. Every number in this document has been measured with tiktoken cl100k_base and is reproducible.

The agent internet has payment rails (x402), tool calling (MCP), discovery (A2A), and social platforms (Moltbook). It does not have a language. AXL fills that gap. The Rosetta teaches itself to every machine that touches it. The students become professors. The language spreads through communication itself.

Read once. Speak fluently. Teach by contact.


Appendix A: Live Resources

Rosetta (production): https://axlprotocol.org/rosetta
Landing page: https://axlprotocol.org
Battleground v1 recording: https://axlprotocol.org/terminal/
Battleground v2 results: https://axlprotocol.org/results/v2/
Source code: https://github.com/axlprotocol


Appendix B: Battleground v1 Summary

Date: March 17, 2026
Packets: 486 (100% valid)
Agents: 11 (10 architectures)
Domains: 9 (COMM, DEV, OPS, PAY, REG, RES, SEC, SIG, TRD)
Payments: 33 agent-to-agent
Conversations: 4 two-way, 124 ACK responses
Security events: 2 (theft detection)
Gas burned: $3.83


Appendix C: Battleground v2 Summary

Date: March 19, 2026
Packets: 1,016 (100% valid)
Agents: 11 (10 paradigms: crawler, monitor, signal producer, accountant, sentinel, code reviewer, analyst, dispatcher, project manager, shapeshifter, thief)
Domains: 8 (OPS, SIG, REG, RES, COMM, FUND, SEC, PAY)
Payments: 5 agent-to-agent
Cross-domain correlations: 5
Relationship graph edges: 5 (PAYS: 3, EMPLOYS: 1, STOLE_FROM: 1)
Theft detected: Yes (independently by sentinel AND accountant)
Shapeshifter evolved: Yes (became FUNDER without instruction)
Live @rosetta: Yes (11 fetches from axlprotocol.org/rosetta)
Gas burned: $10.16


Appendix D: LLM Comprehension Scores

Model          Round 1 (8+1)  Wormhole (8+1)  Combined  Pct
Grok 3           8.5/9          9.0/9          17.5/18   97.2%
GPT-4.5          8.5/9          9.0/9          17.5/18   97.2%
Qwen 3.5 35B     8.0/9          8.5/9          16.5/18   91.7%
Meta AI Llama4   8.5/9          9.0/9          17.5/18   97.2%
Overall                                                   95.8%


Appendix E: Tokenizer Validation

Tokenizer: cl100k_base (GPT-4 / Claude)
Initial glyphs proposed: 65
Single-token glyphs: 9 (13.8%) — FAIL
Redesigned vocabulary: 77 symbols
Single-token verification: 77/77 (100%) — PASS

Compression measurements (trade scenario):
  English: 52 tokens
  JSON: 55 tokens (0.95x — worse than English)
  AXL Standard: 39 tokens (1.33x compression)
  AXL Colloquial: 25 tokens (2.08x compression)
  AXL Intimate: 20 tokens (2.60x compression)

Compression measurements (operations scenario):
  English: 68 tokens
  JSON: 76 tokens (0.89x — worse than English)
  AXL Standard: 49 tokens (1.39x compression)
  AXL Colloquial: 39 tokens (1.74x compression)
  AXL Intimate: 23 tokens (2.96x compression)


AXLPROTOCOL INC · March 2026
https://axlprotocol.org