DeepSeek-R1

Rank: 10

The team introduces its first - generation reasoning models, DeepSeek - R1 - Zero and DeepSeek - R1. DeepSeek - R1 - Zero, trained via large - scale RL without SFT, shows remarkable reasoning ability but also has problems like endless repetition. DeepSeek - R1, which incorporates cold - start data before RL, solves these issues and achieves performance on par with OpenAI - o1. The team has open - sourced these models and six distilled ones, with DeepSeek - R1 - Distill - Qwen - 32B outperforming OpenAI - o1 - mini in benchmarks.

open source

Visit Website

相关站点

发现更多相同类别的站点

Gemini Pro 1.5

The Gemini family of models are the most general and capable AI models we've ever built. They鈥檙e built from the ground up for multimodality 鈥 reasoning seamlessly across text, code, images, audio...

Opus by Anthropic

Today, we're announcing the Claude 3 model family, which sets new industry benchmarks across a wide range of cognitive tasks. The family includes three state-of-the-art models in ascending order of capability: Claude 3 Haiku, Claude 3 Sonnet, and Claude 3 Opus.

Llama 3

The open source AI model you can fine-tune, distill and deploy anywhere. Our latest models are available in 8B, 70B, and 405B variants.

Llama 3.2

The open source AI model you can fine-tune, distill and deploy anywhere. Our latest models are available in 8B, 70B, and 405B variants.

VASA-1 by Microsoft

Opens in a new tab

Claude 3.5 Sonnet

Talk with Claude, an AI assistant from Anthropic

发表评论

分享你的想法。带 * 的字段为必填项。