Back to All Resources

DeepSeek-R1

Open-source language model achieving SOTA performance in sub-2B parameter category, supporting 17 languages and optimized for real-world applications through efficient architecture choices like GQA and RoPE.