Superpowers · Vision

Read any document.
Get back clean data.

Vision turns a photo, a scan or a PDF into the exact data you need — text, tables, fields, answers, a document type. It reads the messy real world, checks its own work, and hands the result straight to your workflows. One capability instead of a stack of OCR tools.

ReadExtractAskClassifyHand off
tiny command · Vision · invoice → fields
Extracted fields
Invoice #INV-204299%
VendorAcme Distributors98%
Date01 May 202697%
Total$1,250.0099%
Built into every form and workflow. No screenshot — that's the real shape of the output.

One drop-in. A whole document stack.

OCR, PDF reading, tables, fields, document Q&A, classification — and more — in a single capability, instead of stitching together a handful of separate tools.

Read text (OCR)

Pull every word out of a photo, screenshot or scan — handwriting included.

Read PDFs

Digital or scanned, page by page, with the title and author too.

Pull tables

Lift a table out of a document with the rows and columns intact.

Pull fields

Grab the exact values you name — invoice number, total, vendor, date.

Ask a document

Ask a plain-English question and get the answer, as text or clean JSON.

Classify

Auto-detect what a document is and route it the right way.

and alsoHandwritingBarcodes & QRCheckboxesSignaturesMake PDFs searchableConvert to Markdown

Ask your documents a question.

You don't always want the whole page — you want one answer. Point Vision at a contract, an invoice or a report and ask in plain English. Get it back as text, or as clean JSON ready for the next step.

  • “What's the total due, and by when?”
  • “Find the termination clause.”
  • “List every line item with its price.”
What's the total due and the due date?
Answer · JSON
{ total: "$1,250.00", due_date: "2026-05-30" }

It picks the fastest, cheapest way to read each file.

Most tools send every page to one expensive model. Vision doesn't. It tries the quick, free path first, and only escalates to heavy AI when a document is genuinely hard — so it stays fast and affordable at any volume.

01Instant & free

A PDF that already has text? Vision reads it directly — no AI, no wait, no cost.

02Fast read

Scans, photos and handwriting get cleaned up and read quickly, right on our servers.

03Deep AI, only when needed

A badly degraded scan or odd layout escalates to a powerful AI model that looks at the page itself.

Built-in cloud backup and a self-healing safety net mean the answer keeps coming even if one path hiccups. Every result tells you which method read it.

It even checks its own math.

The real danger with reading invoices isn't a missed word — it's a confident, wrong number. So Vision reconciles what it reads: every line has to satisfy amount = rate × qty. Rows that don't add up are flagged, not trusted — and a human only looks at those.

ProductQtyRateAmountCheck
Amoxicillin 500mg10₹42.00₹420.00✓ matches
Paracetamol 650mg24₹18.50₹444.00✓ matches
Cough Syrup 100ml6₹95.00₹620.00⚠ review
Vitamin D3 60k12₹31.00₹372.00✓ matches

That one flagged row? 6 × ₹95.00 = ₹570, not ₹620. Vision caught it mechanically and sent the document to review — instead of letting a transposed price reach your books.

Built for the documents you actually get.

Not clean PDFs in a lab — the crumpled, photographed, handwritten reality.

Bad scans, fixed

Low light, skew and blur are auto-corrected before Vision reads a thing.

Photos & screenshots

A phone photo of a receipt or a screenshot is fair game, not just clean files.

Handwriting

A dedicated pass reads handwriting and cleans up the shaky bits on its own.

Any language

Multilingual out of the box — mixed-language documents included.

Whole batches

Send one file or hundreds in a single call; results come back as a list.

Upload, link or data

Hand it a file, a URL, or raw data — whatever your workflow already has.

The data goes straight to work.

Vision isn't a dead end. Drop it into a form or a workflow and route what it reads — classify the document, pull the fields, and send them onward without anyone retyping a thing.

Classify the document
Route by type
Pull the fields
Save to Tables
Email the team

All on one canvas — no glue code, no exports, no second tool.

Six ways to drop it in.

Vision lives in the builder as six nodes — add them to a Tiny Form or a Workflow, wire a file in, and route the result onward. Metered in the same credits as everything else.

Extract Text5 credits

Read text from any image, with OCR.

Read PDF5 credits

Extract text from any PDF, page by page.

Extract Table8 credits

Turn document tables into structured rows.

Extract Fields8 credits

Pull the key-value pairs you ask for.

Ask Document10 credits

Ask questions about any document.

Classify3 credits

Auto-detect the document type.

How it compares.

The cloud document-AI services are powerful and accurate. The difference is that Vision is built into the place you already work, and it does the parts they leave to you.

 Tiny Command VisionCloud document-AIOCR librariesDoc-parsing SaaS
OCR, tables, fields, Q&A and classify in onemostNomost
Ask a document a plain-English questionadd-onNolimited
Checks the math and flags bad rowsNoNoNo
Built into your forms & workflows, no codeAPI onlyAPI onlysome
One bill with the rest of your stackNoNoNo

If you need a raw OCR API to wire up yourself, the big clouds do that well. If you want documents read, reconciled and acted on inside the tools you already use, that's what we built.

Good to know.

What can it read? +

PDFs (digital or scanned), photos, screenshots and images, plus Office documents and spreadsheets — up to 20 MB a file. Invoices, receipts, contracts, IDs, forms, bank statements, handwritten notes and more, in many languages.

How accurate is it on bad scans? +

Vision auto-cleans low-light, skewed and blurry images before reading, and escalates the genuinely hard pages to a powerful AI model. Every field comes back with a confidence score, so you review the few that need it instead of re-checking everything.

Do I need to write any code? +

No. Vision is six drag-in nodes inside Tiny Forms and Workflows — add one, point it at a file, and route the result to a table, an email or the next step. No keys, no servers, nothing to host.

Where does the extracted data go? +

Wherever you send it. The output is clean, structured data, so it drops straight into your own Tiny Tables, kicks off a workflow, or fills an email — no exporting and re-importing between tools.

What does it cost? +

It's metered in the same credits as the rest of Tiny Command — from 3 credits to classify a document up to 10 to ask one a question — on every plan, including Free. No separate vendor bill.

Can it handle a whole batch? +

Yes. Send one file or hundreds in a single call and the results come back as a list, each with its own fields and confidence scores.

Stop retyping what's already on the page.

Drop Vision into a form or a workflow and let it read, check and hand off your documents. Free to start.