HAL Accuracy — Hallucination Assurance Layer

Live benchmark results. Updated every 24 hours.

Precision

0%

Recall

0%

F1 Score

0%

Antifragility

0.00%

The Math

dissonance = (0.4×harm + 0.3×epistemic + 0.2×evidence + 0.1×scope) × (531441/524288)
  • Veto threshold: 0.25 (general)
  • Constitutional block: 0.48
  • BFT threshold: 0.0195 (Pythagorean Comma gap)

HAL Veto History

ZKP vetoes (trinity_hallucination_logs)

Count: 298 (298 unique on-chain proofs)

First: March 12, 2026

Agent: NEXUS

CLAIM_REJECTED verdicts (hal_production_events)

Count: 3

Date: April 17, 2026

Status: Full 8-layer pipeline confirmed active

Agent-layer catches (repid_score_events)

Count: 60

Date: April 20, 2026

Total system vetoes: 361 across all layers

Calibration Timeline

  • Phase 1: Baseline established April 21, 2026
  • 🔄 Phase 2: 5-signal extractor deployed In progress
  • Phase 3: LASSO calibration (100+ events) ~7-10 days
  • Phase 4: First real HAL veto in production ~14 days
  • Phase 5: TruthfulQA post-calibration ~30 days

Test Suite Stats

  • Total labeled prompts: 0
  • Hallucination prompts: 0
  • Ground truth prompts: 0
  • Last run: N/A

Test It Yourself

Install the SDK and score a decision to see HAL in action.

npm install @hyperdag/trustshell

Check the hal_score in the response.
Score < 0.25 = approved
Score ≥ 0.25 = vetoed

Epoch History

DatePrecisionRecallF1 ScoreFP RateAntifragility