MCP OCR Server

MCP server for OCR using native Tesseract (C++), built with Node.js, delivering high-performance OCR and integrable with ChatGPT Desktop.

🚧 Work in Progress 🚧

✨ Features (planned)

High-performance OCR via native Tesseract (C++)
Node.js MCP server wrapper for easy integration
Compatible with ChatGPT Desktop and other MCP clients
Benchmark vs tesseract.js

📌 Roadmap

Step 1: C++ OCR CLI tool
Step 2: Node.js MCP server wrapper
Step 3: ChatGPT Desktop configuration guide
Step 4: Benchmark results
Step 5: Demo video

🔧 Tech Stack

C++ (Tesseract OCR)
Node.js + TypeScript (@modelcontextprotocol/sdk)
JSON-RPC 2.0 (MCP standard)

🛠 Installation

1. Install Tesseract OCR

macOS

brew install tesseract
# Optional: install additional languages
brew install tesseract-lang

Linux (Ubuntu/Debian)

sudo apt update
sudo apt install tesseract-ocr libtesseract-dev libleptonica-dev
# Optional: install Vietnamese language
sudo apt install tesseract-ocr-vie

Windows

Download installer from Tesseract OCR GitHub
Or using Chocolatey:

choco install tesseract

Add the installation path to your PATH environment variable.

2. Clone the repository

git clone https://github.com/dangvinh/mcp-ocr-server.git
cd mcp-ocr-server/cpp

3. Build the project with CMake

You can build the C++ OCR engine using the provided npm script. Run:

npm run build-core

This command will create the cpp/build-core directory, configure the project with CMake, and build the static library and CLI tool.

What it builds:

libmcp_ocr.a static library
ocr_cli executable in cpp/build-core/bin (or equivalent)

Running tests

# From the build directory
ctest --verbose

This will run all GoogleTest-based tests.
Ensure test images or resources exist in cpp/tests or examples/.
The setup works cross-platform (macOS, Linux, Windows).

4. Build the Node.js addon

The Node.js addon can be built using the provided npm script. Run:

npm run build-addon

This command runs node-gyp inside the cpp/ directory and produces the compiled addon (ocr_addon.node) inside cpp/build/Release/. This addon is required for Node.js integration with the C++ core.

🗂 Setup tessdata

The OCR engine requires trained data files to work. Please follow these steps:

Create a tessdata folder in the project root:

mkdir tessdata

Download the English trained data:

wget https://github.com/tesseract-ocr/tessdata/raw/main/eng.traineddata -P tessdata/

For other languages, download the corresponding .traineddata files into tessdata/.
Ensure your .env or .env.example has:

TESSDATA_PREFIX=./tessdata
OCR_LANG=eng

4. Run OCR CLI

./ocr_cli path/to/image.png

Ensure the tessdata folder is accessible for language files. The project supports macOS, Linux, and Windows (cross-platform).

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.github		.github
.husky		.husky
cpp		cpp
examples		examples
src		src
tests		tests
.editorconfig		.editorconfig
.env.example		.env.example
.gitignore		.gitignore
.prettierrc.js		.prettierrc.js
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
commitlint.config.cjs		commitlint.config.cjs
eslint.config.js		eslint.config.js
jest.config.cjs		jest.config.cjs
package-lock.json		package-lock.json
package.json		package.json
tsconfig.build.json		tsconfig.build.json
tsconfig.eslint.json		tsconfig.eslint.json
tsconfig.json		tsconfig.json
tsconfig.test.json		tsconfig.test.json
tsup.config.ts		tsup.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MCP OCR Server

✨ Features (planned)

📌 Roadmap

🔧 Tech Stack

🛠 Installation

1. Install Tesseract OCR

macOS

Linux (Ubuntu/Debian)

Windows

2. Clone the repository

3. Build the project with CMake

What it builds:

Running tests

4. Build the Node.js addon

🗂 Setup tessdata

4. Run OCR CLI

About

Uh oh!

Releases

Packages

Languages

License

dangvinh/mcp-ocr-server

Folders and files

Latest commit

History

Repository files navigation

MCP OCR Server

✨ Features (planned)

📌 Roadmap

🔧 Tech Stack

🛠 Installation

1. Install Tesseract OCR

macOS

Linux (Ubuntu/Debian)

Windows

2. Clone the repository

3. Build the project with CMake

What it builds:

Running tests

4. Build the Node.js addon

🗂 Setup tessdata

4. Run OCR CLI

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages