| Meta | | | | |
Meta-Llama-3.3-70B-Instruct | Text | View- 4K (1,2,4,8,16,32)
- 8K (1,2,4,8)
- 16K (1,2,4)
- 32K (1,2,4)
- 64K (1)
- 128K (1)
| View- Endpoint: Chat completions
- Capabilities: Function calling, JSON mode
- Import checkpoint: Yes
- Optimizations: Speculative decoding
| Model card |
Meta-Llama-3.1-8B-Instruct | Text | View- 4K (1,2,4,8)
- 8K (1,2,4,8)
- 16K (1,2,4)
| View- Endpoint: Chat completions
- Capabilities: Function calling, JSON mode
- Import checkpoint: Yes
- Optimizations: None
| Model card |
Llama-4-Maverick-17B-128E-Instruct | Image, Text | View | View- Endpoint: Chat completions
- Capabilities: Function calling, JSON mode
- Import checkpoint: No
- Optimizations: None
| Model card |
| DeepSeek | | | | |
DeepSeek-R1-0528 | Reasoning, Text | View- 4K (4)
- 8K (1)
- 16K (1)
- 32K (1)
| View- Endpoint: Chat completions
- Capabilities: Function calling, JSON mode
- Import checkpoint: No
- Optimizations: None
| Model card |
DeepSeek-R1-Distill-Llama-70B | Reasoning, Text | View- 4K (1,2,4,8,16,32)
- 8K (1,2,4,8)
- 16K (1,2,4)
- 32K (1,2,4)
- 64K (1)
- 128K (1)
| View- Endpoint: Chat completions
- Capabilities: None
- Import checkpoint: Yes
- Optimizations: Speculative decoding
| Model card |
DeepSeek-V3-0324 | Text | View- 4K (4)
- 8K (1)
- 16K (1)
- 32K (1)
| View- Endpoint: Chat completions
- Capabilities: Function calling, JSON mode
- Import checkpoint: No
- Optimizations: None
| Model card |
DeepSeek-V3.1 | Reasoning, Text | View- 4K (4)
- 8K (1)
- 16K (1)
- 32K (1)
| View- Endpoint: Chat completions
- Capabilities: Function calling, JSON mode
- Import checkpoint: No
- Optimizations: None
| Model card |
| OpenAI | | | | |
Whisper-Large-v3 | Audio | View | View- Endpoint: Translation, Transcription
- Capabilities: None
- Import checkpoint: No
- Optimizations: None
| Model card |
| Qwen | | | | |
Qwen3-32B | Reasoning, Text | View | View- Endpoint: Chat completions
- Capabilities: None
- Import checkpoint: No
- Optimizations: None
| Model card |
| Tokyotech-llm | | | | |
Llama-3.3-Swallow-70B-Instruct-v0.4 | Text | View- 4K (1,2,4,8,16)
- 8K (1,2,4,8,16)
- 16K (1,2,4)
- 32K (1,2,4)
- 64K (1)
- 128K (1)
| View- Endpoint: Chat completions
- Capabilities: None
- Import checkpoint: No
- Optimizations: Speculative decoding
| Model card |
| Other | | | | |
E5-Mistral-7B-Instruct | Embedding | View | View- Endpoint: Embeddings
- Capabilities: None
- Import checkpoint: No
- Optimizations: None
| Model card |