Why did we open-source our inference engine? Read the post

← Catalog

Marqo/marqo-fashionSigLIP

Open comparison →

Primitive: /encode · Encode · SigLIP

Marqo Fashion Siglip 2 is available. marqo-fashion-SigLip-2 has shown a further 78% improvement in MMR and recall vs marqo-fashion-SigLip. Contact Marqo to learn more: https://www.marqo.ai/book-demo

MultimodalDense

Overview

Hardware: — drives latency, throughput & cost

Size203M params
Tasks /encode
Licenseapache-2.0
Languagesen
Latency
Throughput
Cost /1M tok

Cost is approximate — computed from list GPU prices; your actual price depends on the provider you deploy SIE with.

Embedding

Output typesDense
Dimensionsdense: 768
Max sequence length64
Inputstext · image

Benchmarks

Flickr30kI2TRetrieval

general retrieval en

Image-to-text retrieval: retrieve captions from images

Corpus: 31,783 Queries: 1,000
Quality
ndcg at 10 0.8383
map at 10 0.7556
mrr at 10 0.9376
Reference →

Open source inference for agents

Open-source inference for the models behind your agents. Run it yourself, or let us run it for you.

Github 2.1K

Contact us

Tell us about your use case and we'll get back to you shortly.

Apply for an inference grant

Free capacity on our hosted cluster for selected projects. Tell us what you run and we reply by email.