DeepSeek-R1

Rank: 10

The team introduces its first - generation reasoning models, DeepSeek - R1 - Zero and DeepSeek - R1. DeepSeek - R1 - Zero, trained via large - scale RL without SFT, shows remarkable reasoning ability but also has problems like endless repetition. DeepSeek - R1, which incorporates cold - start data before RL, solves these issues and achieves performance on par with OpenAI - o1. The team has open - sourced these models and six distilled ones, with DeepSeek - R1 - Distill - Qwen - 32B outperforming OpenAI - o1 - mini in benchmarks.

open source

Visit Website

Related Sites

Discover more sites in the same category

Gemini Pro 1.5

The Gemini family of models are the most general and capable AI models we've ever built. They鈥檙e built from the ground up for multimodality 鈥 reasoning seamlessly across text, code, images, audio...

Opus by Anthropic

Today, we're announcing the Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks. The family includes three state-of-the-art models in ascending order of capability: Claude 3 Haiku, Claude 3 Sonnet, and Claude 3 Opus.

Llama 3

The open source AI model you can fine-tune, distill and deploy anywhere. Our latest models are available in 8B, 70B, and 405B variants.

Llama 3.2

The open source AI model you can fine-tune, distill and deploy anywhere. Our latest models are available in 8B, 70B, and 405B variants.

VASA-1 by Microsoft

Opens in a new tab

Claude 3.5 Sonnet

Talk with Claude, an AI assistant from Anthropic

Share your thoughts about this page. All fields marked with * are required.