Image to Text
Extract readable text from JPG, PNG, and BMP files.
UnlimitedOCR brings an OCR.ac-style upload workflow into a ShipAny application, with Baidu OCR wired as the first provider.
Supports JPG, PNG, BMP, PDF, and OFD. PDF files are processed by page through the provider.
OCR tools
Extract readable text from JPG, PNG, and BMP files.
Send PDF pages to Baidu OCR and collect clean text.
Turn product screenshots and UI captures into copyable text.
Recognize invoice, receipt, and form content for workflows.
Workflow
API
The app exposes a local Next.js route that keeps Baidu credentials server-side and returns normalized OCR text, lines, provider, and raw response data.
POST /api/ocr
Content-Type: application/json
{
"provider": "baidu_general_basic",
"imageBase64": "...",
"url": "https://example.com/document.png"
}Yes. The project keeps the ShipAny Next.js structure, theme system, Tailwind setup, layouts, and shared UI components.
It works after BAIDU_OCR_API_KEY and BAIDU_OCR_SECRET_KEY are configured in the runtime environment.
Yes. The /api/ocr route is shaped around provider selection so Google, Azure, DeepSeek self-hosted, or Unlimited-OCR can be added next.