modelparams.dev

Every parameter, for every model.

An open, community-maintained catalog of LLM model parameters. Search, filter, and link straight to the knobs you can turn. API-key and subscription variants of the same model are listed separately, because they behave differently.

Filter by provider

Filter by parameter

61 of 61 models

Anthropic Claude 3.5 Haiku 20241022 API key 4 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Anthropic Claude 3.5 Haiku Latest API key 4 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Anthropic Claude 3.5 Sonnet 20241022 API key 4 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Anthropic Claude 3.5 Sonnet Latest API key 4 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Anthropic Claude 3.7 Sonnet 20250219 API key 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude 3.7 Sonnet Latest API key 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude 3 Opus 20240229 API key 4 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Anthropic Claude 3 Opus Latest API key 4 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Anthropic Claude Haiku 4 API key 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude Haiku 4.5 API key 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude Haiku 4.5 20251001 API key 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type = "enabled" or top_p ≠ null
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type = "enabled" or temperature ≠ null
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type = "enabled"
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude Haiku 4 Subscription 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude Opus 4.1 20250805 API key 7 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type = "enabled" or top_p ≠ null
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type = "enabled" or temperature ≠ null
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type = "enabled"
Reasoning · 3 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Thinking display
thinking.display
enum (summarized | omitted) "summarized" Controls whether Anthropic returns summarized or omitted thinking content.
Only when thinking.type = "enabled"
Anthropic Claude Opus 4.20250514 API key 7 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type = "enabled"
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type = "enabled"
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type = "enabled"
Reasoning · 3 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Thinking display
thinking.display
enum (summarized | omitted) "summarized" Controls whether Anthropic returns summarized or omitted thinking content.
Only when thinking.type = "enabled"
Anthropic Claude Opus 4.5 20251101 API key 8 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type = "enabled" or top_p ≠ null
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type = "enabled" or temperature ≠ null
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type = "enabled"
Reasoning · 4 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Thinking display
thinking.display
enum (summarized | omitted) "summarized" Controls whether Anthropic returns summarized or omitted thinking content.
Only when thinking.type = "enabled"
Effort
output_config.effort
enum (low | medium | high) "high" Controls Anthropic response thoroughness and token spend.
Anthropic Claude Opus 4.6 API key 8 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"enabled", "adaptive"} or top_p ≠ null
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"enabled", "adaptive"} or temperature ≠ null
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"enabled", "adaptive"}
Reasoning · 4 params
Thinking mode
thinking.type
enum (disabled | adaptive | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Thinking display
thinking.display
enum (summarized | omitted) "summarized" Controls whether Anthropic returns summarized or omitted thinking content.
Only when thinking.type ∈ {"adaptive", "enabled"}
Effort
output_config.effort
enum (low | medium | high | max) "high" Controls Anthropic response thoroughness and token spend.
Anthropic Claude Opus 4.7 API key 4 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 3 params
Thinking mode
thinking.type
enum (disabled | adaptive) "disabled" Controls the Anthropic thinking mode values supported by this model.
Thinking display
thinking.display
enum (summarized | omitted) "omitted" Controls whether Anthropic returns summarized or omitted thinking content.
Only when thinking.type = "adaptive"
Effort
output_config.effort
enum (low | medium | high | xhigh | max) "high" Controls Anthropic response thoroughness and token spend.
Anthropic Claude Opus 4 Subscription 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | adaptive | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude Sonnet 4.20250514 API key 7 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type = "enabled"
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type = "enabled"
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type = "enabled"
Reasoning · 3 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Thinking display
thinking.display
enum (summarized | omitted) "summarized" Controls whether Anthropic returns summarized or omitted thinking content.
Only when thinking.type = "enabled"
Anthropic Claude Sonnet 4.5 API key 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | adaptive | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude Sonnet 4.5 20250929 API key 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type = "enabled" or top_p ≠ null
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type = "enabled" or temperature ≠ null
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type = "enabled"
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Anthropic Claude Sonnet 4.6 API key 8 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"enabled", "adaptive"} or top_p ≠ null
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"enabled", "adaptive"} or temperature ≠ null
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"enabled", "adaptive"}
Reasoning · 4 params
Thinking mode
thinking.type
enum (disabled | adaptive | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
Thinking display
thinking.display
enum (summarized | omitted) "summarized" Controls whether Anthropic returns summarized or omitted thinking content.
Only when thinking.type ∈ {"adaptive", "enabled"}
Effort
output_config.effort
enum (low | medium | high | max) "high" Controls Anthropic response thoroughness and token spend.
Anthropic Claude Sonnet 4 Subscription 6 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 3 params
Temperature
temperature
number (0…1 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Not when thinking.type ∈ {"adaptive", "enabled"}
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
Not when thinking.type ∈ {"adaptive", "enabled"} or temperature ≠ 1
Top K
top_k
integer (0…+∞) 0 Limits token sampling to the top K most likely next tokens.
Not when thinking.type ∈ {"adaptive", "enabled"}
Reasoning · 2 params
Thinking mode
thinking.type
enum (disabled | adaptive | enabled) "disabled" Controls the Anthropic thinking mode values supported by this model.
Budget tokens
thinking.budget_tokens
integer (1024…+∞) 4096 Maximum token budget Anthropic may use for extended thinking before producing the final answer.
Only when thinking.type = "enabled"
DeepSeek Deepseek Chat API key 4 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect.
Not when thinking.type = "enabled"
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect.
Not when thinking.type = "enabled"
Reasoning · 1 param
Thinking mode
thinking.type
enum (disabled | enabled) "disabled" Controls whether DeepSeek uses thinking mode before producing the final answer.
DeepSeek Deepseek Reasoner API key 5 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect.
Not when thinking.type = "enabled"
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect.
Not when thinking.type = "enabled"
Reasoning · 2 params
Thinking mode
thinking.type
enum (enabled | disabled) "enabled" Controls whether DeepSeek uses thinking mode before producing the final answer.
Reasoning effort
reasoning_effort
enum (high | max) "high" Controls DeepSeek thinking effort when thinking mode is enabled.
Only when thinking.type = "enabled"
DeepSeek Deepseek V4 Flash API key 5 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect.
Not when thinking.type = "enabled"
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect.
Not when thinking.type = "enabled"
Reasoning · 2 params
Thinking mode
thinking.type
enum (enabled | disabled) "enabled" Controls whether DeepSeek uses thinking mode before producing the final answer.
Reasoning effort
reasoning_effort
enum (high | max) "high" Controls DeepSeek thinking effort when thinking mode is enabled.
Only when thinking.type = "enabled"
DeepSeek Deepseek V4 Pro API key 5 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect.
Not when thinking.type = "enabled"
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling. In DeepSeek thinking mode this parameter is accepted for compatibility but has no effect.
Not when thinking.type = "enabled"
Reasoning · 2 params
Thinking mode
thinking.type
enum (enabled | disabled) "enabled" Controls whether DeepSeek uses thinking mode before producing the final answer.
Reasoning effort
reasoning_effort
enum (high | max) "high" Controls DeepSeek thinking effort when thinking mode is enabled.
Only when thinking.type = "enabled"
OpenAI Chatgpt 4o Latest API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI Gpt 3.5 Turbo API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI Gpt 4 Turbo API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI Gpt 4 Turbo 2024.04 09 API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI Gpt 4.1 API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI Gpt 4.1 Mini API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI Gpt 4.1 Nano API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI GPT-4o API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI Gpt 4o 2024.11 20 API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI GPT-4o mini API key 3 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Sampling · 2 params
Temperature
temperature
number (0…2 step 0.1) 1 Controls randomness. Lower values make outputs more focused; higher values make them more varied.
Top P
top_p
number (0…1 step 0.01) 1 Controls nucleus sampling by limiting generation to tokens whose cumulative probability reaches this value.
OpenAI Gpt 5 API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (minimal | low | medium | high) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI Gpt 5 Chat Latest API key 1 param
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
OpenAI Gpt 5 Mini API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (minimal | low | medium | high) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI Gpt 5 Nano API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (minimal | low | medium | high) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI Gpt 5.1 API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (none | low | medium | high) "none" Controls how much reasoning the model should perform before producing an answer.
OpenAI Gpt 5.1 Codex Max Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI Gpt 5.1 Codex Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI Gpt 5.2 API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (none | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI Gpt 5.2 Codex Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI Gpt 5.2 Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI Gpt 5.3 Codex Spark Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI Gpt 5.3 Codex Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI Gpt 5.4 API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (none | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI Gpt 5.4 Mini API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (none | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI Gpt 5.4 Mini Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI Gpt 5.4 Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI Gpt 5.5 API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (none | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI Gpt 5.5 Subscription 3 params
Parameter Type Default Description Condition
Reasoning · 2 params
Reasoning effort
reasoning.effort
enum (minimal | low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
Reasoning summary
reasoning.summary
enum (auto | concise | detailed | none) "auto" Controls the level of reasoning summary returned with the response.
Output · 1 param
Verbosity
text.verbosity
enum (low | medium | high) "medium" Controls how concise or detailed the model's final text response should be.
OpenAI o1 API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI o1-mini API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (minimal | low | medium | high) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI O1 Preview API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_tokens
integer (1…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (minimal | low | medium | high) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI o3 API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI o3-mini API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.
OpenAI o4-mini API key 2 params
Parameter Type Default Description Condition
Length · 1 param
Max tokens
max_completion_tokens
integer (16…+∞) 4096 Maximum number of output tokens the model may generate.
Reasoning · 1 param
Reasoning effort
reasoning_effort
enum (low | medium | high | xhigh) "medium" Controls how much reasoning the model should perform before producing an answer.

How to use

Building with an AI agent? Hit Copy to grab this whole guide as Markdown and paste it in — or point your agent straight at /llms.txt.

modelparams.dev is an open, community-maintained catalog of LLM model parameters. Each entry shows the knobs you can turn — type, default, range, and the conditions that gate it.

The same model accessed via an API key and via a subscription usually exposes a different set of parameters. We list both as separate entries so the data stays honest.

Catalog API

The full catalog is static JSON, CORS-enabled, served from the edge.

curl https://modelparams.dev/api/v1/models.json

Each entry is keyed by provider/model for API-key variants; subscription variants append -subscription.

Single model

curl https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-7.json
curl https://modelparams.dev/api/v1/models/anthropic/claude-opus-4-7-subscription.json

JSON Schema

Every entry validates against a JSON Schema you can use in your editor or pipeline.

curl https://modelparams.dev/api/v1/schema.json

Add this header to any YAML you author for autocomplete in VS Code:

# yaml-language-server: $schema=https://modelparams.dev/api/v1/schema.json

Logos

Provider logos are available at /assets/logos/{provider}.svg where {provider} is the provider slug. They use currentColor so they inherit your text color.

curl https://modelparams.dev/assets/logos/anthropic.svg

Logos are sourced from the models.dev repo (MIT) and used under nominative fair use.

Contribute

The data lives in YAML under models/{provider}/{model}-{auth}.yaml in the GitHub repo. Open a PR; CI validates against the schema and rebuilds.

Edit on GitHub MIT licensed