Why did we open-source our inference engine? Read the post

mynkchaudhry/Florence-2-FT-DocVQA

This model card provides details about the Florence-2-FT-DocVQA model, which is fine-tuned for Document Visual Question Answering (VQA) tasks.

Overview

Architecture
Florence-2
Parameters
271M
Tasks
Extract
Outputs
text_regions
License
apache-2.0
Languages
en

Benchmarks

DocVQA

general kie en

Visual question answering on document images

Corpus: 5,188 Queries: 5,188
Quality
anls 0.3521
exact match 0.2600
Performance L4 b1 c16
Reference →

Open source inference for agents

Open-source inference for the models behind your agents. Run it yourself, or let us run it for you.

Github 2.1K

Contact us

Tell us about your use case and we'll get back to you shortly.

Apply for an inference grant

Free capacity on our hosted cluster for selected projects. Tell us what you run and we reply by email.