ibm-granite/granite-guardian-3.0-2b

Open comparison →

Primitive: /generate · Generate · Granite

Long contextStreamingGuard

View on Hugging Face →

Overview

Hardware: — drives latency, throughput & cost

Size	2.5B params
Tasks	/generate
License	apache-2.0
Latency	—
Throughput	—
Cost	— /1M tok

Cost is approximate — computed from list GPU prices; your actual price depends on the provider you deploy SIE with.

Generation

Capabilities	Streaming · Guard
Context length	8,192
Max output tokens	512

Benchmarks

ToxicChat

safety generation en

Quality

guard F1 0.2780

Open source inference for agents

Open-source inference for the models behind your agents. Run it yourself, or let us run it for you.