Question 1

Which models are available?

Accepted Answer

20+ models across OpenAI, Anthropic, Google, Groq, Mistral, Cohere, Together, Perplexity, HuggingFace, OpenRouter, FAL, Stability, Runway, Luma, ElevenLabs and Firecrawl. New providers are added continuously.

Question 2

How does billing work?

Accepted Answer

Usage-based: per-token for chat, per-call for media generation. Top up via Stripe, PayPal, Razorpay or Paddle. Low-credit alerts via Slack and Discord.

Question 3

Is streaming supported?

Accepted Answer

Yes — SSE streaming is default for chat. Image and video generation use async jobs you can poll or subscribe to via webhooks.

Question 4

Can we self-host or get a private deployment?

Accepted Answer

Yes — dedicated and on-prem deployments are available for enterprise customers. Talk to us about your requirements.

The whole stack behind one key.

Ten capabilities. One API.

Streaming chat

Image generation

Video generation

Text-to-speech

Speech-to-text

Real-time voice agents

Web search

Data extraction

Code execution

Translation

Use the SDK you already have.

Common questions

Ready to build with every model?