Skip to content
  • Auto
  • Light
  • Dark
DiscordForumGitHubSign up
Self-hosting
View as Markdown
Copy Markdown

Open in Claude
Open in ChatGPT

Supported Models

Letta routinely runs automated scans against available providers and models. These are the results of the latest scan.

Ran 2512 tests against 157 models across 7 providers on June 27th, 2025

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
claude-3-5-haiku-20241022200,0002025-06-27
claude-3-5-sonnet-20240620200,0002025-06-27
claude-3-5-sonnet-20241022200,0002025-06-27
claude-3-7-sonnet-20250219200,0002025-06-27
claude-opus-4-20250514200,0002025-06-27
claude-sonnet-4-20250514200,0002025-06-27
claude-3-opus-20240229200,0002025-06-27
claude-3-haiku-20240307200,0002025-06-27
claude-3-sonnet-20240229200,0002025-06-27

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
gpt-4-turbo128,0002025-06-27
gpt-4-turbo-2024-04-09128,0002025-06-27
gpt-4.11,047,5762025-06-27
gpt-4.1-2025-04-141,047,5762025-06-27
gpt-4.1-mini1,047,5762025-06-27
gpt-4.1-mini-2025-04-141,047,5762025-06-27
gpt-4.1-nano1,047,5762025-06-27
gpt-4.1-nano-2025-04-141,047,5762025-06-27
gpt-4o128,0002025-06-27
gpt-4o-2024-05-13128,0002025-06-27
gpt-4o-2024-08-06128,0002025-06-27
gpt-4o-2024-11-20128,0002025-06-27
gpt-4o-mini128,0002025-06-27
gpt-4o-mini-2024-07-18128,0002025-06-27
gpt-4-06138,1922025-06-27
gpt-4-1106-preview128,0002025-06-27
gpt-4-turbo-preview128,0002025-06-27
gpt-4-0125-preview128,0002025-06-27
o1200,0002025-06-27
o1-2024-12-17200,0002025-06-27
o3200,0002025-06-27
o3-2025-04-16200,0002025-06-27
o4-mini30,0002025-06-27
o4-mini-2025-04-1630,0002025-06-27
gpt-48,1922025-06-27
o3-mini200,0002025-06-27
o3-mini-2025-01-31200,0002025-06-27
o3-pro30,0002025-06-27
o3-pro-2025-06-1030,0002025-06-27

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
gemini-1.5-pro2,000,0002025-06-27
gemini-1.5-pro-0022,000,0002025-06-27
gemini-1.5-pro-latest2,000,0002025-06-27
gemini-2.0-flash-thinking-exp1,048,5762025-06-27
gemini-2.5-flash-preview-04-171,048,5762025-06-27
gemini-2.5-pro1,048,5762025-06-27
gemini-2.5-pro-preview-03-251,048,5762025-06-27
gemini-2.5-pro-preview-05-061,048,5762025-06-27
gemini-2.5-flash1,048,5762025-06-27
gemini-2.0-flash-thinking-exp-12191,048,5762025-06-27
gemini-2.5-flash-preview-04-17-thinking1,048,5762025-06-27
gemini-2.5-flash-preview-05-201,048,5762025-06-27
gemini-2.5-pro-preview-06-051,048,5762025-06-27
gemini-2.0-flash-thinking-exp-01-211,048,5762025-06-27
gemini-2.5-flash-lite-preview-06-171,048,5762025-06-27
gemini-1.0-pro-vision-latest12,2882025-06-27
gemini-1.5-flash1,000,0002025-06-27
gemini-1.5-flash-0021,000,0002025-06-27
gemini-1.5-flash-8b1,000,0002025-06-27
gemini-1.5-flash-8b-0011,000,0002025-06-27
gemini-1.5-flash-8b-latest1,000,0002025-06-27
gemini-1.5-flash-latest1,000,0002025-06-27
gemini-2.0-flash1,048,5762025-06-27
gemini-2.0-flash-0011,048,5762025-06-27
gemini-2.0-flash-exp1,048,5762025-06-27
gemini-2.0-flash-exp-image-generation1,048,5762025-06-27
gemini-2.0-flash-lite1,048,5762025-06-27
gemini-2.0-flash-lite-0011,048,5762025-06-27
gemini-2.0-flash-lite-preview1,048,5762025-06-27
gemini-2.0-flash-lite-preview-02-051,048,5762025-06-27
gemini-2.0-flash-preview-image-generation32,7682025-06-27
gemini-2.0-pro-exp1,048,5762025-06-27
gemini-2.0-pro-exp-02-051,048,5762025-06-27
gemini-2.5-flash-preview-tts32,7682025-06-27
gemini-2.5-pro-preview-tts65,5362025-06-27
gemini-exp-12061,048,5762025-06-27
gemini-pro-vision12,2882025-06-27

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
arcee-ai/coder-large32,7682025-06-27
meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP81,048,5762025-06-27
Qwen/Qwen2.5-Coder-32B-Instruct32,7682025-06-27
meta-llama/Llama-3.3-70B-Instruct-Turbo131,0722025-06-27
meta-llama/Llama-3.3-70B-Instruct-Turbo-Free131,0722025-06-27
meta-llama/Meta-Llama-3.1-405B-Instruct-Turbo130,8152025-06-27
meta-llama/Meta-Llama-3.1-8B-Instruct-Turbo131,0722025-06-27
deepseek-ai/DeepSeek-V3131,0722025-06-27
meta-llama/Meta-Llama-3.1-70B-Instruct-Turbo131,0722025-06-27
Qwen/Qwen2.5-72B-Instruct-Turbo131,0722025-06-27
arcee-ai/virtuoso-large131,0722025-06-27
arcee-ai/virtuoso-medium-v2131,0722025-06-27
meta-llama/Llama-4-Scout-17B-16E-Instruct1,048,5762025-06-27
Qwen/Qwen3-235B-A22B-fp8-tput40,9602025-06-27
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF32,7682025-06-27
scb10x/scb10x-llama3-1-typhoon2-70b-instruct8,1922025-06-27
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO32,7682025-06-27
Qwen/QwQ-32B131,0722025-06-27
google/gemma-3n-E4B-it32,7682025-06-27
mistralai/Mistral-7B-Instruct-v0.232,7682025-06-27
perplexity-ai/r1-1776163,8402025-06-27
Qwen/Qwen2-72B-Instruct32,7682025-06-27
Qwen/Qwen2-VL-72B-Instruct32,7682025-06-27
Qwen/Qwen2.5-7B-Instruct-Turbo32,7682025-06-27
Qwen/Qwen2.5-VL-72B-Instruct32,7682025-06-27
arcee-ai/AFM-4.5B-Preview65,5362025-06-27
arcee-ai/arcee-blitz32,7682025-06-27
arcee-ai/caller32,7682025-06-27
arcee-ai/maestro-reasoning131,0722025-06-27
arcee_ai/arcee-spotlight131,0722025-06-27
deepseek-ai/DeepSeek-R1163,8402025-06-27
deepseek-ai/DeepSeek-R1-0528-tput163,8402025-06-27
deepseek-ai/DeepSeek-R1-Distill-Llama-70B131,0722025-06-27
deepseek-ai/DeepSeek-R1-Distill-Llama-70B-free8,1922025-06-27
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B131,0722025-06-27
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B131,0722025-06-27
deepseek-ai/DeepSeek-V3-p-dp131,0722025-06-27
google/gemma-2-27b-it8,1922025-06-27
lgai/exaone-3-5-32b-instruct32,7682025-06-27
lgai/exaone-deep-32b32,7682025-06-27
meta-llama/Llama-3-70b-chat-hf8,1922025-06-27
meta-llama/Llama-3-8b-chat-hf8,1922025-06-27
meta-llama/Llama-3.2-11B-Vision-Instruct-Turbo131,0722025-06-27
meta-llama/Llama-3.2-3B-Instruct-Turbo131,0722025-06-27
meta-llama/Llama-3.2-90B-Vision-Instruct-Turbo131,0722025-06-27
meta-llama/Llama-Vision-Free131,0722025-06-27
meta-llama/Meta-Llama-3-70B-Instruct-Turbo8,1922025-06-27
meta-llama/Meta-Llama-3-8B-Instruct-Lite8,1922025-06-27
mistralai/Mistral-7B-Instruct-v0.132,7682025-06-27
mistralai/Mistral-7B-Instruct-v0.332,7682025-06-27
mistralai/Mistral-Small-24B-Instruct-250132,7682025-06-27
mistralai/Mixtral-8x7B-Instruct-v0.132,7682025-06-27
scb10x/scb10x-typhoon-2-1-gemma3-12b131,0722025-06-27
togethercomputer/MoA-132,7682025-06-27
togethercomputer/MoA-1-Turbo32,7682025-06-27
togethercomputer/Refuel-Llm-V216,3842025-06-27
togethercomputer/Refuel-Llm-V2-Small8,1922025-06-27

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
deepseek-chat64,0002025-06-27
deepseek-reasoner64,0002025-06-27

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
allam-2-7b30,0002025-06-27
compound-beta30,0002025-06-27
compound-beta-mini30,0002025-06-27
deepseek-r1-distill-llama-70b30,0002025-06-27
distil-whisper-large-v3-en30,0002025-06-27
gemma2-9b-it30,0002025-06-27
llama-3.1-8b-instant30,0002025-06-27
llama-3.3-70b-versatile30,0002025-06-27
llama3-70b-819230,0002025-06-27
llama3-8b-819230,0002025-06-27
meta-llama/llama-4-maverick-17b-128e-instruct30,0002025-06-27
meta-llama/llama-4-scout-17b-16e-instruct30,0002025-06-27
meta-llama/llama-guard-4-12b30,0002025-06-27
meta-llama/llama-prompt-guard-2-22m30,0002025-06-27
meta-llama/llama-prompt-guard-2-86m30,0002025-06-27
mistral-saba-24b30,0002025-06-27
playai-tts30,0002025-06-27
playai-tts-arabic30,0002025-06-27
qwen-qwq-32b30,0002025-06-27
qwen/qwen3-32b30,0002025-06-27
whisper-large-v330,0002025-06-27
whisper-large-v3-turbo30,0002025-06-27

ModelBasicToken StreamingMultimodalContext WindowLast Scanned
letta-free8,1922025-06-27