DeepSeek-R1 achieves state-of-the-art performance among models under 2B parameters, featuring enhanced reasoning capabilities and support for 17 languages. Utilizes innovative training techniques including Grouped Query Attention and Rotary Positional Embeddings for efficient inference.