All projects
Python
OCR
LLM
FastAPI

Receipt OCR Engine

·
Receipt OCR Engine

Overview

Receipt OCR Engine is a comprehensive solution for extracting structured data from receipt images. It ships two modules:

  • receipt_ocr — LLM-powered extraction pipeline with a CLI, programmatic API, and a production FastAPI service
  • tesseract_ocr — Raw text extraction via Tesseract OCR with Docker support

Quick Start

pip install receipt-ocr
export OPENAI_API_KEY="your_key"
receipt-ocr images/receipt.jpg

Features

  • Structured JSON output (merchant, items, totals, tax)
  • CLI and programmatic API
  • Docker-ready FastAPI service
  • Full test coverage via pytest + codecov
  • MIT licensed