The team introduces its first - generation reasoning models, DeepSeek - R1 - Zero and DeepSeek - R1. DeepSeek - R1 - Zero, trained via large - scale RL without SFT, shows remarkable reasoning ability but also has problems like endless repetition. DeepSeek - R1, which incorporates cold - start data before RL, solves these issues and achieves performance on par with OpenAI - o1. The team has open - sourced these models and six distilled ones, with DeepSeek - R1 - Distill - Qwen - 32B outperforming OpenAI - o1 - mini in benchmarks.
Descubre más sitios en la misma categoría
Talk with Claude, an AI assistant from Anthropic
Claude 3.7:Anthropic 的最新 AI 里程碑,混合推理能力引领未来
The open source AI model you can fine-tune, distill and deploy anywhere. Our latest models are available in 8B, 70B, and 405B variants.
OpenAI developed a new series of AI models designed to spend more time thinking before they respond. Here is the latest news on o1 research, product and other updates.
Comparte tus pensamientos. Los campos marcados con * son obligatorios.