Wafer.ai runs GLM5.2 on AMD MI355X at 2626 tok/s with half Blackwell's cost
AI infrastructure firm Wafer.ai has benchmarked the GLM5.2 language model on AMD's MI355X accelerator, achieving a throughput of 2626 tokens per second per node. The result positions AMD's hardware as a cost-competitive alternative to Nvidia's Blackwell GPUs, reportedly at more than twice lower cost. The findings were published on Wafer.ai's official blog. The benchmark highlights growing competition in the AI accelerator market, with AMD increasingly challenging Nvidia's dominance in inference workloads.
This is an AI-generated summary. ShortSingh links to the original source for the complete article.
Discussion (0)
Log in to join the discussion and vote.
Log in