Doc Verification Bridge

A tool for analyzing Lean 4 projects to classify theorems and track verification coverage using the Four-Category Ontology.

Repository Structure

This repository contains two independent Lake packages:

doc-verification-bridge/
├── REFACTORING.md              # Detailed refactoring plan
├── README.md                    # This file
├── DocVerificationBridge/       # Core analysis library (version-dependent)
│   ├── lakefile.toml
│   ├── DocVerificationBridge.lean
│   ├── Main.lean               # unified-doc CLI
│   └── DocVerificationBridge/   # Core modules
│       ├── Types.lean
│       ├── Inference.lean
│       ├── Classify.lean
│       ├── Report.lean
│       ├── StaticHtml.lean
│       ├── TableData.lean
│       ├── Unified.lean
│       ├── Cache.lean
│       ├── Attributes.lean
│       ├── Compatibility.lean
│       ├── SourceLinkerCompat.lean
│       ├── VerificationDecorator.lean
│       └── ... (other modules)
└── Experiments/                 # Multi-project orchestration (version-agnostic)
    ├── lakefile.toml
    ├── Main.lean               # experiments CLI
    ├── Experiments.lean        # Library root module
    └── Experiments/
        ├── Compatibility.lean  # Local copy of compat helpers
        └── ExperimentsCore.lean # Pipeline orchestration + git clone + subprocess

Why Two Packages?

DocVerificationBridge depends on Lean and doc-gen4 APIs that have breaking changes between versions. A single codebase cannot support all versions simultaneously.

Supported Version Range

Constant	Value	Location
Min supported	4.24.0	`Experiments/Experiments/ExperimentsCore.lean` → `minSupportedVersion`
Max supported	4.29.0	`Experiments/Experiments/ExperimentsCore.lean` → `maxSupportedVersion`
Current toolchain	v4.29.0	`DocVerificationBridge/lakefile.toml` → `leanVersion`

To add a new Lean version: update maxSupportedVersion in ExperimentsCore.lean, then follow the steps in Adding Support for New Lean Versions.

Solution: Version-specific git branches + process isolation:

DocVerificationBridge has version-specific branches (v4.27.0, v4.28.0, v4.29.0, main)
Experiments orchestrates by cloning the appropriate DocVerificationBridge version per project
Communication happens via CLI/subprocess (not Lean imports)

DocVerificationBridge Package

Purpose: Core analysis library that classifies theorems and generates verification reports.

Dependencies: Lean 4, doc-gen4 (version-specific)

Executables:

unified-doc: Combined doc-gen4 + verification pipeline

Building

cd DocVerificationBridge
lake build unified-doc

Usage

# Generate documentation with auto-classification
lake exe unified-doc unified \
  --repo https://github.com/owner/repo \
  --project "MyProject" \
  --auto \
  MyModule

# Generate with explicit annotations only
lake exe unified-doc unified \
  --repo https://github.com/owner/repo \
  --annotated \
  MyModule

Supported Lean Versions

Branch	Lean Versions	Notes
v4.27.0	4.24.0 – 4.27.0	Legacy
v4.28.0	4.28.0
v4.29.0 / main	4.29.0-rc1+	Current development

See the Supported Version Range table above for the authoritative min/max.

Experiments Package

Purpose: Orchestration module for running verification analysis across multiple projects.

Dependencies: Lean 4 (minimal, no doc-gen4)

Executables:

experiments: Multi-project pipeline runner

Building

cd Experiments
lake build experiments

Usage

# Run all experiments from config
lake exe experiments --config config.toml

# Resume incomplete experiments
lake exe experiments --config config.toml --resume

# Re-run analysis only (skip build)
lake exe experiments --config config.toml --reanalyze

Configuration Format

Create a config.toml in the Experiments/ directory:

# Experiments configuration
repos_dir = "experiments/repos"
sites_dir = "experiments/sites"
max_parallel_jobs = 4

[[project]]
name = "my-project"
repo = "https://github.com/owner/my-project"
modules = ["MyProject"]
description = "Project description"
docvb_version = "v4.29.0"  # Which DocVerificationBridge branch to use (see Supported Version Range)

The docvb_version field specifies which DocVerificationBridge branch/tag to use for that project. This allows mixing projects with different Lean versions in the same experiment run.

Four-Category Ontology

DocVerificationBridge classifies declarations into four categories based on E.J. Lowe's ontology:

	Mathematical (Universal)	Computational (Particular)
Substantial	Mathematical Abstraction	Computational Datatype
Non-substantial	Mathematical Definition	Computational Operation

Theorem Kinds:

Computational Property: Proves computational definitions satisfy laws
Mathematical Property: Proves abstract mathematical properties
Bridging Property: Connects Prop specifications with Bool computations
Soundness Property: Proves user types correctly embed into external specs
Completeness Property: Proves external specs can be represented by user types

Classification Modes

Auto Mode (Default)

Uses heuristic-based type analysis to automatically infer categories:

Types: Prop-based → Mathematical Abstraction, Data-carrying → Computational Datatype
Definitions: Returns Prop → Mathematical Definition, Returns non-Prop → Computational Operation
Theorems: Automatically infers assumes, proves, validates from type structure

lake exe unified-doc unified --auto MyModule

Annotated Mode

Only classifies declarations with explicit @[api_*] annotations:

@[api_type .mathematicalAbstraction]
class MapLike (α : Type) where ...

@[api_def .computationalOperation]
def lookup (m : Map α β) (key : α) : Option β := ...

@[api_theorem .bridgingProperty
  (assumes := #[`wellFormed])
  (proves := #[`isCorrect])
  (validates := #[`checkInvariant])]
theorem lookup_correct : wellFormed m → ... := ...

lake exe unified-doc unified --annotated MyModule

Development

Running Tests

# Test DocVerificationBridge
cd DocVerificationBridge
lake build
lake exe unified-doc unified --repo https://github.com/test/project TestModule

# Test Experiments
cd Experiments
lake build
lake exe experiments --config test-config.toml

Adding Support for New Lean Versions

Create a new git branch from main: git checkout -b v4.XX.0
Update DocVerificationBridge/lakefile.toml with new Lean/doc-gen4 versions
Fix any API breakages in the core modules
Test with projects using that Lean version
Tag the branch: git tag v4.XX.0
Update experiment configs to use docvb_version = "v4.XX.0" for appropriate projects

Security

Compiling untrusted Lean 4 code executes arbitrary code at build time (tactics, macros, elaborators, initialize blocks). The project uses a two-phase sandboxed build via Bubblewrap to mitigate this:

Dependency fetch phase: Network allowed, compilation blocked
Compilation phase: Network completely disabled (blocks exfiltration), filesystem restricted

See SECURITY.md for the full threat model, attack vector analysis, sandboxing strategy evaluation (Docker, Firejail, Bubblewrap, VMs, Landlock), and implementation details in scripts/sandbox-lake.sh.

Project Status

Refactoring: Complete

The three-phase refactoring is complete (see REFACTORING.md, PHASE1-COMPLETE.md, PHASE2-COMPLETE.md, PHASE3-PROGRESS.md):

Phase 1: Two-package structure, lakefiles, file moves (2026-03-03)
Phase 2: Per-branch cleanup — internal + v4.28.0 branches cleaned (2026-03-03)
Phase 3: Experiments decoupling — subprocess invocation, git clone/checkout, docvb_version config (2026-03-03)
- Implementation complete; end-to-end testing pending

Current Phase: Inference Improvements

Current work focuses on improving classification accuracy for verification coverage tracking.

Completed: Existential Binder Type Tracing (2026-03-20)

Problem: When a theorem concludes with ∃ vy : ValidYaml, vy.input = ... ∧ ..., the collectHeadConstants function in Inference.lean strips the existential binder and recurses into the body only. The field projections (.input, .value) are traced into proves, but the binder type (ValidYaml) is discarded. This means the parent structure never appears in any theorem's proves list, so its verifiedBy stays empty.

Fix — extract the binder type in the Exists branch of collectHeadConstants:

| .const ``Exists _, #[binderType, body] =>
  let binderNames ← collectHeadConstantsFromTerm binderType sourceDesc
  let bodyNames ← lambdaTelescope body fun _ innerBody =>
    collectHeadConstants innerBody sourceDesc internalPrefixes
  return binderNames ++ bodyNames

Uses collectHeadConstantsFromTerm (not collectHeadConstants) because the binder type is a Type-level expression, not Prop. Existing filtering (shouldFilter, isInternalName) prevents stdlib types like Nat from leaking in.

Impact — for parse_produces_valid_yaml:

Before: proves = [ValidYaml.input, stripAnnotations, ValidYaml.value, ...]
After: proves = [ValidYaml, ValidYaml.input, stripAnnotations, ValidYaml.value, ...]
computeVerifiedByMap then automatically populates ValidYaml.verifiedBy

Completed: Instance Classification Rules (2026-03-20)

Problem: Typeclass instances (e.g., instance : Decidable (isFoldAppendChar c), instance : BEq CollectionStyle) are classified as plain definitions (computationalOperation or mathematicalDefinition). The bridge treats them identically to regular functions, missing important verification semantics:

Self-certifying instances like Decidable are proofs by construction — their type encodes correctness
Requires-lawful instances like BEq need companion proofs (LawfulBEq) for correctness
Infrastructure instances like ToString, Repr are not verification-relevant

Solution — extensible typeclass rule system:

InstanceDisposition enum classifies how instances should be handled:

inductive InstanceDisposition where
  | selfCertifying                       -- Instance IS the proof (Decidable, LawfulBEq)
  | requiresLawful (lawfulClass : Name)  -- Needs companion proof (BEq → LawfulBEq)
  | infrastructure                       -- Not verification-relevant (ToString, Repr)

TypeclassRule pairs a class name with its disposition:

structure TypeclassRule where
  className : Name
  disposition : InstanceDisposition
  description : String := ""

defaultTypeclassRules provides rules for common typeclasses (extensible by adding entries):

Typeclass	Disposition	Rationale
`Decidable`, `DecidableEq`, `DecidablePred`, `DecidableRel`	`selfCertifying`	Decision procedure with proof
`LawfulBEq`, `LawfulFunctor`, `LawfulMonad`	`selfCertifying`	Law-satisfaction proofs
`BEq`	`requiresLawful LawfulBEq`	Needs correctness proof
`Functor`	`requiresLawful LawfulFunctor`	Needs functor law proof
`Monad`	`requiresLawful LawfulMonad`	Needs monad law proof
`ToString`, `Repr`, `Inhabited`, `Nonempty`, `Hashable`, `Ord`	`infrastructure`	Not verification-relevant

extractInstanceInfo (Classify.lean) detects instances via Meta.isInstance, extracts the class and argument types, and looks up the matching rule.
computeVerifiedByMap (TableData.lean) includes self-certifying instances in the reverse index — a Decidable (isFoldAppendChar c) instance now appears in isFoldAppendChar.verifiedBy.
JSON output adds instanceOf, instanceDisposition, instanceLawfulClass, instanceArgTypes fields to definition entries.

Impact:

instDecidableIsFoldAppendChar → instanceOf: Decidable, disposition: selfCertifying, argTypes: [isFoldAppendChar] → fills isFoldAppendChar.verifiedBy
instBEqCollectionStyle → instanceOf: BEq, disposition: requiresLawful, lawfulClass: LawfulBEq, argTypes: [CollectionStyle] → obligation tracked
instLawfulBEqCollectionStyle → instanceOf: LawfulBEq, disposition: selfCertifying, argTypes: [CollectionStyle] → fills CollectionStyle.verifiedBy

File	Change
Types.lean	Add `InstanceDisposition`, `InstanceInfo`, `TypeclassRule`, `defaultTypeclassRules`; add `instanceInfo` to `DefData`
Classify.lean	Add `extractInstanceInfo`, `collectInstanceArgTypes`, `findTypeclassRule`; call from `classifyConstantLight`/`classifyConstant`
TableData.lean	Self-certifying instances in `computeVerifiedByMap`; add instance fields to `DefinitionTableEntry`
Report.lean	Update `DefData` pattern matches (4 fields)
Unified.lean	Update `DefData` pattern matches (4 fields)
UnifiedBasic.lean	Update `DefData` pattern matches (4 fields)

Planned: Def-as-Witness Detection

Problem: A def returning a specification structure (e.g., def scan_produces_valid_tokens : ... → ValidTokenStream) is classified as computationalOperation because its return type is Type, not Prop. The bridge never populates proves for definitions — only theorems get inference. These "def-as-witness" patterns are invisible to verification coverage tracking.

Heuristic — "Structure-instance witness": A def is a witness if its return type (after stripping ∀-args) is an application of an internal structure/inductive classified as computationalDatatype.

Solution — add a witnessOf field to DefData:

structure DefData where
  category : DefCategory
  hasSorry : Bool
  witnessOf : Array Name := #[]  -- structures this def constructs instances of

New helper extractWitnessTargets checks whether the return type head is an internal inductive:

def extractWitnessTargets (type : Expr) (internalPrefixes : Array String) : MetaM (Array Name) := do
  let env ← getEnv
  forallTelescope type fun _params body => do
    let body ← whnf body
    match body.getAppFn with
    | .const name _ =>
      if isInternalName env internalPrefixes name then
        match env.find? name with
        | some (.inductInfo _) => return #[name]
        | _ => return #[]
      else return #[]
    | _ => return #[]

Then computeVerifiedByMap includes witnessOf in the reverse index alongside theorem proves/validates.

Impact:

scan_produces_valid_tokens → witnessOf = [ValidTokenStream] → fills ValidTokenStream.verifiedBy
toYamlValue_nodeToValue → witnessOf = [NodeToValue] → adds to NodeToValue.verifiedBy

File	Change
Types.lean	Add `witnessOf : Array Name := #[]` to `DefData`
Classify.lean	Add `extractWitnessTargets`, use in `classifyConstantLight`/`classifyConstant` (~20 lines)
TableData.lean	Include `witnessOf` in `computeVerifiedByMap` reverse index (~8 lines)
TableData.lean	Add `witnessOf` to `DefinitionTableEntry` JSON serialization (~3 lines)

Previous Improvements

Bubblewrap sandboxing (SECURITY.md, scripts/sandbox-lake.sh)
- Two-phase build isolation: network allowed during fetch, disabled during compilation
- Filesystem restrictions deny access to sensitive credentials
- Unprivileged, minimal attack surface (~2000 lines of C)
Three-phase refactoring (REFACTORING.md)
- Phase 1: Two-package structure (DocVerificationBridge + Experiments)
- Phase 2: Per-branch cleanup of version-specific file variants
- Phase 3: Experiments decoupled — git clone, subprocess invocation, docvb_version config
Fixed "Grammable" inference bug (Inference.lean:389-535)
- Theorems can now correctly appear in both assumes and proves
- Uses separate deduplication for hypotheses vs conclusion
- Critical for soundness/completeness proof chains

Contributing

See REFACTORING.md for the refactoring plan and contribution guidelines.

License

Apache 2.0 (see LICENSE file)

Authors

Nicolas Rouquette (California Institute of Technology)
Contributors

Acknowledgments

This tool builds on:

doc-gen4 - Lean 4 documentation generator
lean4-cli - CLI framework
E.J. Lowe's Four-Category Ontology (Oxford, 2006) - Philosophical foundation

Name		Name	Last commit message	Last commit date
Latest commit History 164 Commits
.github/workflows		.github/workflows
DocVerificationBridge		DocVerificationBridge
Experiments		Experiments
experiments		experiments
paper		paper
scripts		scripts
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
PHASE1-COMPLETE.md		PHASE1-COMPLETE.md
PHASE2-COMPLETE.md		PHASE2-COMPLETE.md
PHASE3-FIXES.md		PHASE3-FIXES.md
PHASE3-PROGRESS.md		PHASE3-PROGRESS.md
README.md		README.md
REFACTORING.md		REFACTORING.md
SECURITY.md		SECURITY.md
TABLE_DATA_README.md		TABLE_DATA_README.md

Folders and files

Latest commit

History

Repository files navigation

Doc Verification Bridge

Repository Structure

Why Two Packages?

Supported Version Range

DocVerificationBridge Package

Building

Usage

Supported Lean Versions

Experiments Package

Building

Usage

Configuration Format

Four-Category Ontology

Classification Modes

Auto Mode (Default)

Annotated Mode

Development

Running Tests

Adding Support for New Lean Versions

Security

Project Status

Refactoring: Complete

Current Phase: Inference Improvements

Completed: Existential Binder Type Tracing (2026-03-20)

Completed: Instance Classification Rules (2026-03-20)

Planned: Def-as-Witness Detection

Previous Improvements

Contributing

License

Authors

Acknowledgments

About

Topics

Resources

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages