Why did we open-source our inference engine? Read the post

naver-clova-ix/donut-base-finetuned-docvqa

Donut model fine-tuned on DocVQA. It was introduced in the paper OCR-free Document Understanding Transformer by Geewok et al. and first released in this repository.

Overview

Architecture
Encoder-Decoder
Parameters
110M
Tasks
Extract
Outputs
text_regions
License
mit

Benchmarks

DocVQA

general kie en

Visual question answering on document images

Corpus: 5,188 Queries: 5,188
Quality
anls 0.6350
Performance L4-SPOT b1 c4
Performance L4 b1 c16
Reference →

Open source inference for agents

Open-source inference for the models behind your agents. Run it yourself, or let us run it for you.

Github 2.1K

Contact us

Tell us about your use case and we'll get back to you shortly.

Apply for an inference grant

Free capacity on our hosted cluster for selected projects. Tell us what you run and we reply by email.