|
|
|
<br>DeepSeek open-sourced DeepSeek-R1, [wiki.whenparked.com](https://wiki.whenparked.com/User:HoustonConway) an LLM fine-tuned with reinforcement knowing (RL) to enhance thinking capability. DeepSeek-R1 attains results on par with OpenAI's o1 design on several criteria, including MATH-500 and [SWE-bench](http://h2kelim.com).<br> |