Lenovo's Qitian WA7785aG3 Server Achieves New High in DeepSeek 671B Large Model Performance: 6708 Tokens/s Throughput
Lenovo's Qitian WA7785aG3 Server Achieves New High in DeepSeek 671B Large Model Performance: 6708 Tokens/s ThroughputOn March 18th, Lenovo announced a significant breakthrough achieved by its first AMD AI large model training serverthe Lenovo Qitian WA7785aG3. When deploying the full-scale DeepSeek 671B parameter large language model on a single machine, the server achieved a remarkable peak throughput of 6708 tokens/s...