Doubleword
    Pricing

    Same intelligence. Up to 99% cheaper.

    One API. Open-weight models. Pick your delivery window — Async or Overnight — and pay only for what you use.

    Per-token pricing

    Same Intelligence. Fraction of the price.

    Cost to process 1 billion tokens in + 1 billion tokens out at comparable intelligence.

    Model
    Anthropic
    $18K
    OpenAI
    $15.8K
    Industry Average
    $5.0K
    Doubleword (Async)
    $3.7K
    $0$4.5K$9K$13.5K$18K

    Intelligence via Artificial Analysis Index v4.0 · Hover any bar for full pricing details · Want access to a model you don't see here — just ask us!

    No credit card required · No minimum spend · Pay only for tokens used

    Start building

    Three speeds. One API.

    Pick the delivery window that fits your workflow. All tiers use the same OpenAI-compatible API.

    Dev Mode (Real-time)

    Iterate on prompts with real-time responses. Full price, zero wait.

    Async Inference

    up to 50% off RT

    Background agents that need results fast. Async delivery with SLA guarantee.

    Overnight (~24 hours)

    up to 75% off RT

    Big batch jobs where cost matters most. Deepest discounts.

    Full pricing

    Model-by-model breakdown

    ModelSLAInput $/MTokOutput $/MTokCost / 1B in+outvs Big Token
    Kimi-K2.6NewAsync$0.70$3.00$3.7K79% cheaperTry Model API
    Overnight (24H)$0.45$2.00$2.5K86% cheaper
    GLM-5.1-FP8NewAsync$1.05$3.30$4.3K76% cheaperTry Model API
    Overnight (24H)$0.70$2.20$2.9K84% cheaper
    Qwen3.5-397B-A17BNewAsync$0.30$1.80$2.1K93% cheaperTry Model API
    Overnight (24H)$0.15$1.20$1.3K96% cheaper
    Qwen3.6-35B-A3B-FP8NewAsync$0.07$0.30$37098% cheaperTry Model API
    Overnight (24H)$0.05$0.20$25099% cheaper
    Qwen3.5-35B-A3B-FP8NewAsync$0.07$0.30$37094% cheaperTry Model API
    Overnight (24H)$0.05$0.20$25096% cheaper
    Qwen3.5-4BNewAsync$0.05$0.08$13099% cheaperTry Model API
    Overnight (24H)$0.04$0.06$10099% cheaper
    Qwen3.5-9BNewAsync$0.04$0.35$39094% cheaperTry Model API
    Overnight (24H)$0.03$0.29$32095% cheaper
    Gemma-4-31BNewAsync$0.11$0.30$41093% cheaperTry Model API
    Overnight (24H)$0.07$0.20$27096% cheaper
    Nemotron-3-Super-120B-A12BNewAsync$0.23$0.56$79087% cheaperTry Model API
    Overnight (24H)$0.15$0.38$53091% cheaper
    GPT-OSS-20BAsync$0.03$0.20$23098% cheaperTry Model API
    Overnight (24H)$0.02$0.15$17098% cheaper
    Qwen3-VL-235B-A22BAsync$0.15$0.55$70069% cheaperTry Model API
    Overnight (24H)$0.10$0.40$50078% cheaper
    Qwen3-VL-30B-A3BAsync$0.07$0.30$37075% cheaperTry Model API
    Overnight (24H)$0.05$0.20$25083% cheaper
    Qwen3-14B-FP8Async$0.03$0.30$33078% cheaperTry Model API
    Overnight (24H)$0.02$0.20$22085% cheaper
    DeepSeek-OCR-2OCRAsync$0.08$0.08$160Try Model API
    Overnight (24H)$0.05$0.05$100
    olmOCR-2-7BOCRAsync$0.15$0.15$300Try Model API
    Overnight (24H)$0.10$0.10$200
    LightOnOCR-2-1BOCRAsync$0.08$0.08$160Try Model API
    Overnight (24H)$0.05$0.05$100
    Qwen3-Embedding-8BAsync$0.03$30Try Model API
    Overnight (24H)$0.02$20

    No surprises. No lock-in.

    No credit card required
    No minimum spend
    Pay only for tokens used
    Results stream as they're ready
    OpenAI-compatible API
    Cancel or retry any batch, any time

    Stop overpaying for inference.

    Run your background agents and workloads at a fraction of the price and double the scale.

    If you can wait an hour, you can save a lot.