Why did we open-source our inference engine? Read the post

← Catalog

fastino/gliner2-large-v1

Open comparison →

Primitive: /extract · Extract · extractor

> Extract entities, classify text, parse structured data, and extract relations—all in one efficient model.

MultilingualEntities

Overview

Hardware: — drives latency, throughput & cost

Size486M params
Tasks /extract
Licenseapache-2.0
Languagesen, fr, es
Latency
Throughput
Cost /1M tok

Cost is approximate — computed from list GPU prices; your actual price depends on the provider you deploy SIE with.

Extraction

Output kindsEntities
Inputstext
Max sequence length512

Benchmarks

CoNLL-2003

news ner en

Named entity recognition on Reuters newswire text

Corpus: 3,453 Queries: 3,453
Quality
f1 0.5366
precision 0.4531
recall 0.6578
Reference →

Open source inference for agents

Open-source inference for the models behind your agents. Run it yourself, or let us run it for you.

Github 2.1K

Contact us

Tell us about your use case and we'll get back to you shortly.

Apply for an inference grant

Free capacity on our hosted cluster for selected projects. Tell us what you run and we reply by email.