Question 1

Is this an OCR benchmark?

Accepted Answer

It is an invoice extraction benchmark kit for field and line-item extraction, not a generic OCR benchmark.

Question 2

Is this real invoice data?

Accepted Answer

No. The invoices are synthetic.

Question 3

Can I run it locally?

Accepted Answer

Yes. The evaluator and supporting files ship inside the pack.

Question 4

What metrics are included?

Accepted Answer

Per-field accuracy (normalized exact match), line-item accuracy (positional), invoice-level success rate, coverage, and slice summaries by noise, layout, and difficulty.

Question 5

What is the difference between Starter and Full?

Accepted Answer

Starter is the main paid benchmark. Full keeps the same workflow and expands coverage.

Question 6

Why is Starter priced at this level?

Accepted Answer

Because it saves benchmark setup time and gives you a reusable local evaluation workflow instead of a raw file dump.

Question 7

Is this suitable for production validation?

Accepted Answer

It is useful for structured evaluation, comparison, and regression checks. It is not a substitute for validation on your own private documents.

Question 8

Can I compare two models with it?

Accepted Answer

Yes. That is one of the main use cases.

Question 9

What format does the evaluator expect?

Accepted Answer

The evaluator accepts a JSON file with one entry per document containing document_id, header_fields, and line_items. A predictions template ships inside the pack.

Question 10

What do I receive after purchase?

Accepted Answer

A downloadable benchmark kit with PDFs, labels, evaluator, reports, and supporting docs.

Question 11

Is there a short walkthrough before I request the sample?

Accepted Answer

Yes. There is a compact public walkthrough with one representative invoice, one sample report view, and one benchmark card view.

Question 12

How do you handle checkout?

Accepted Answer

The Free Sample is a direct download. Starter and Full are purchased by email directly from the seller. No checkout platform or account is required.

Question 13

Who sells this?

Accepted Answer

Filip Barcík, Cheb, Czech Republic (IČO 24517763). Full details on the imprint page.

Question 14

Do you offer custom benchmarks or datasets?

Accepted Answer

Yes. We build custom benchmarks and datasets for any document type, language, or domain. Contact info@ocurka.com with your requirements.

Direct answers for technical buyers.

Start with the free sample.