AIGMA — Executive Summary

Two tracks through the same matrix — and the gap between them.

The 10×6 BoT matrix (10 governance factors × 6 maturity layers) is graded twice for every firm: once for substance (what the firm does), once for signal (what the firm says). The difference is the alignment gap Δ — the headline credibility measure, in layer units, with a 95% credible interval.

Substance— what they do

Evidence of built capability: hiring outcomes, committed infrastructure, retained governance roles, audit findings, deployed controls.

Δ— alignment gap

Signal minus substance, in layer units. Positive Δ = over-marker. Negative Δ = under-marker. |Δ| ≤ 0.5 = aligned.

Signal— what they say

Evidence of declared posture: brand language, public commitments, forward-looking statements, marketing claims around AI ethics.

I.Methodology

How scores are derived

Coded evidence flows into a Bayesian Hidden Markov Model maintaining per-factor latent state over six layers, separately for substance and signal tracks.

The 10×6 matrix

L0L1L2 L3L4L5 5 Ent. 5 Sys.

10 factors— 5 Enterprise (Leadership, Culture, Operations, Stakeholder Accountability, Ecosystem) + 5 Systems (Oversight, Fairness, Transparency, Reliability, Privacy).
6 layers, cumulative— L0 Non-Compliance → L5 Ethical Vanguard. Monotonic stack: reaching L3 requires L2.
Per-factor inference— not collapsed. Headline = mean of 10 per-factor positions × 20.
Δ from paired samples— mean over factors of (signal − substance) from MCMC draws. Every reported number has a CI₉₅.

II.Architecture

How evidence reaches the model

Evidence sources are channels. Each channel emits records in one canonical schema. The inference engine never sees source-specific code.

Public

Tier 1 · Jobs, GitHub, SEC, Standards, Infrastructure — five baseline public channels.

Schema

Canonical emission · firm, quarter, factor, layer, track, confidence, source, raw_evidence, rationale.

Tier 2

Proprietary · audit reports, governance docs, training records — added on direct engagement.

HMM

Inference engine · same model, same priors, same outputs regardless of channel mix.

Channel = fetcher + coder + registry entry.New channels added without touching the framework or model.
Tiers are commensurable— same scoring, narrower posteriors with more evidence.
Missingness handled honestly— less evidence widens the CI; never imputed confidence.
Channels earn their place— WAIC and LOO-CV decide, not editorial judgment.

III.Audit Drilldown

Every score traces to a source

From any headline number, five steps reach the underlying document. Three deterministic, one probabilistic, all reproducible.

Aggregate → per-factorDeterministic · 1:N · mean of 10 factor scores
Per-factor → posteriorDeterministic · 1:1 · score = E[state] × 20
Posterior → emissionsProbabilistic · 1:N · inferred from N coded observations
Emission → sourceDeterministic · 1:1 · URL + raw_evidence + rationale
Source → channel registryDeterministic · 1:1 · validation κ, coder version

Reproducibility fingerprint— run_id + model version + emissions hash. Same inputs reproduce the same score.
Regulator-ready— the answer to “where did this number come from?” is a chain that terminates in a verifiable URL.

AI Governance Maturity Assessment

Two tracks through the same matrix — and the gap between them.

How scores are derived

How evidence reaches the model

Every score traces to a source

Where the firm sits today

Where the gap must close

Leadership Priorities

Stakeholder Accountability

Organizational Culture

Interventions and validation

Leadership Priorities

Stakeholder Accountability

Organizational Culture