Your AI Can't Read Documents. This Fixes That.

The AI memory system that unlocks infinite context. One command. 154 tools. Every PDF, contract, and report -- searchable, comparable, and verifiable. Running 100% on your hardware. Ships with 1,150 Hormozi transcripts ready to search.

$ npx -y ocr-provenance-mcp install

See it in action

terminal

$ npx -y ocr-provenance-mcp install ✓ Docker detected ✓ Pulling AI models... ✓ Starting container... ✓ License provisioned ✓ Registered with Claude Code Ready. 154 tools available.

No API keys. No cloud. Your data never leaves your machine.

Works with Claude Code, Claude Desktop, Cursor, and Windsurf -- automatically.

GPU-accelerated with NVIDIA. CPU mode for .md and .txt files -- free processing, no GPU needed.

154 MCP Tools

5 AI Models

100% Local

GPU Accelerated

3,700+ Tests Passing

You Have Hundreds of Documents. Your AI Can't Touch Any of Them.

You copy-paste text from PDFs into chat windows
You manually search through folders to find what you need
You can't prove where extracted data came from
You have no way to search semantically across your document corpus
You lose hours doing work that should take seconds

"Every hour you spend doing this manually is an hour you're not spending on work that actually grows your business."

One Install. Your AI Gets 154 Tools.

Before

Copy-paste text from PDFs

Manually compare document versions

Can't prove data origin

No semantic search

Hours of manual work

Send documents to cloud APIs

After

"Search my 200 contracts for non-compete clauses" -- done in 200ms

Structured diff with every change flagged by significance

SHA-256 cryptographic provenance chain, W3C PROV export

Hybrid search: BM25 + 768-dim vectors + cross-encoder reranking

Full pipeline: 1,150 docs processed in ~3 minutes

100% local. Your hardware. Your data.

Copy-paste text from PDFs

"Search my 200 contracts for non-compete clauses" -- done in 200ms

Manually compare document versions

Structured diff with every change flagged by significance

Can't prove data origin

SHA-256 cryptographic provenance chain, W3C PROV export

No semantic search

Hybrid search: BM25 + 768-dim vectors + cross-encoder reranking

Hours of manual work

Full pipeline: 1,150 docs processed in ~3 minutes

Send documents to cloud APIs

100% local. Your hardware. Your data.

$ npx -y ocr-provenance-mcp install

See Exactly What It Does

OCR Provenance MCP system architecture showing documents flowing through local GPU processing engine with OCR, VLM, and Embedding workers into 154 tool branches with SHA-256 provenance tracking chain

Included: Alex Hormozi Business Strategy Super Skill -- Every install ships with a pre-processed, fully searchable database of 1,147 Alex Hormozi YouTube video transcripts (last year of content) plus all 3 of his books ($100M Offers, $100M Leads, Gym Launch Secrets). That's 2.6+ million tokens of business strategy context -- more than fits in any context window. This is what we call a Super Skill: a skill that requires more context than a single context window can hold. Point your AI at this database and say "What would Hormozi do?" about pricing, offers, lead gen, copywriting, or anything. Ready to search the moment you install. No credits needed.

~2-5 sec/page

OCR Speed

~12ms/chunk

Embedding

<100ms

Semantic Search

<200ms

Hybrid Search

~3 min

Full Pipeline (1,150 docs)

$ npx -y ocr-provenance-mcp install