Which API capabilities are supported?
Last updated: June 30, 2025
All models:
Streaming
Streaming with structured outputs
Structured outputs
Tool calling
Multi-turn tool calling
Temperature, top P, logit probabilities
Some models:
Parallel tool calling
Multi turn tool calling
Tool Calling w/ Structured Outputs
Streaming w/ Structured Outputs
Streaming w/ Tool Calling
Here is a breakdown of the limitations for each model:
llama3.1-8bParallel Tool Calling
llama-3.3-70bTool Calling w/ Structured Outputs
Multi-turn tool calling
llama-4-scout-17b-16e-instructParallel Tool Calling
qwen-3-32bStreaming w/ Structured Outputs
Parallel Tool Calling
Streaming w/ Tool Calling
deepseek-r1-distill-llama-70bStreaming w/ Structured Outputs
Parallel Tool Calling