Back to All Resources

Qwen 2.5 72B Instruct

It features SwiGLU activation, attention QKV bias, and group query attention. It is pretrained on extensive data with supervised finetuning and direct preference optimization.