ScriptComputer Visionv1.3

OCR Pipeline Script

by Community · free · Last verified 2026-03-17

This script provides a sophisticated OCR pipeline that intelligently routes documents to the most suitable engine—Tesseract, PaddleOCR, or a cloud API—based on image quality analysis. It processes various document types and outputs structured JSON containing text sorted by reading order, complete with bounding box coordinates and confidence scores for each word or line.

https://github.com/PaddlePaddle/PaddleOCR ↗

C—Below Average

Adoption: AQuality: B+Freshness: ACitations: FEngagement: F

Specifications

License: MIT
Pricing: free
Capabilities: Dynamic OCR engine routing based on image heuristics, Structured JSON output with detailed metadata, Reading order sorting for logical text flow, Confidence scoring at word and block level, Integration with multiple open-source (Tesseract, PaddleOCR) and cloud OCR APIs, Image pre-processing for quality enhancement, Bounding box coordinates for text localization, Batch processing of document folders, Configurable routing logic
Integrations: [object Object], [object Object], [object Object], [object Object], [object Object], [object Object]
Use Cases: [object Object], [object Object], [object Object], [object Object], [object Object]
API Available: No
Language: python
Dependencies: paddlepaddle, paddleocr, pytesseract, opencv-python, pillow
Environment: Python 3.9+
Est. Runtime: 1-10 minutes
Tags: ocr, text-extraction, document-ai, tesseract, paddleocr, computer-vision, python-script, json-output, document-processing, intelligent-document-processing, data-extraction
Added: 2026-03-17
Completeness: 80%

Index Score

Adoption

Quality

Freshness

Citations

Engagement

Need this tool deployed for your team?

Get a Custom Setup

Explore the full AI ecosystem on Agents as a Service