AI21 Labs · AsyncAPI Specification
AI21 Studio Streaming API
Version 1.0.0
AsyncAPI description of AI21 Labs' documented streaming surface. The AI21 Studio Jamba chat completions endpoint streams partial responses over HTTP using Server-Sent Events (SSE) when the `stream` request parameter is `true`. Each event delivers an incremental `delta` for the assistant message; the stream terminates with the literal `[DONE]` event. Only event flows that are explicitly described in the AI21 documentation (https://docs.ai21.com) are modeled here. The Maestro and Conversational RAG endpoints do not currently document an SSE streaming mode and are therefore intentionally omitted.
View Spec
View on GitHub
AIFoundation ModelsLLMJambaMambaRAGAgentsMaestroInferenceEnterprise AIFine-TuningAsyncAPIWebhooksEvents
Channels
chat/completions
Open a streamed Jamba chat completion
Jamba chat completions stream. Clients POST a chat completion request with `stream: true` to `https://api.ai21.com/studio/v1/chat/completions` and the server responds with a `text/event-stream` body. Each `data:` line carries one JSON chunk; the final event is the literal string `[DONE]`.
Messages
ChatCompletionStreamRequest
Chat Completion Stream Request
Open the SSE stream for a Jamba chat completion.
ChatCompletionChunk
Chat Completion Chunk
One Server-Sent Event carrying a partial chat completion.
ChatCompletionDone
Chat Completion Stream Terminator
Stream terminator event.
Servers
https
studio
https://api.ai21.com/studio/v1
AI21 Studio production base URL. SSE responses are delivered over the same HTTPS endpoint as the standard REST API when `stream=true`.