LLM Chat Streaming
Streams text chunks via plain HTTP (Transfer-Encoding: chunked).
Documentation Index
Fetch the complete documentation index at: https://docs.runflow.ai/llms.txt
Use this file to discover all available pages before exploring further.
Body
Project ID
LLM provider
openai, anthropic, bedrock, groq, gemini, xai, custom Model name
"gpt-4"
Messages
Provider name (configured in LLM Providers)
"OpenAI Production"
Use legacy Chat Completions API when true
true
System instructions
Temperature
0 <= x <= 20.7
Max tokens
1000
Top P (nucleus sampling)
0 <= x <= 10.9
Frequency penalty
-2 <= x <= 20.3
Presence penalty
-2 <= x <= 20.2
Stop sequences
["END", "STOP"]Seed for reproducible outputs
12345
Tools available for the model to call
Response format for structured output
{ "type": "json_object" }Enable extended thinking/reasoning for supported models
{ "type": "enabled", "budgetTokens": 5000 }Server-side tools (provider-specific). Anthropic: web_search, code_execution
[{ "type": "web_search_20250305" }]Response
text/plain stream of chunks. Terminates when the LLM completes.