All projects
Python
OCR
LLM
FastAPI
Receipt OCR Engine
Overview
Receipt OCR Engine is a comprehensive solution for extracting structured data from receipt images. It ships two modules:
receipt_ocr— LLM-powered extraction pipeline with a CLI, programmatic API, and a production FastAPI servicetesseract_ocr— Raw text extraction via Tesseract OCR with Docker support
Quick Start
pip install receipt-ocr
export OPENAI_API_KEY="your_key"
receipt-ocr images/receipt.jpg
Features
- Structured JSON output (merchant, items, totals, tax)
- CLI and programmatic API
- Docker-ready FastAPI service
- Full test coverage via pytest + codecov
- MIT licensed