LongCat 2.0 Launches as Large-Scale MoE Model With 1.6T Total and 48B Active Parameters
LongCat has released version 2.0 of its AI language model, built on a Mixture-of-Experts (MoE) architecture. The model features a substantial 1.6 trillion total parameters while activating 48 billion parameters per inference. Details about the release were shared via the LongCat blog and picked up by the Hacker News community. MoE architectures are designed to improve efficiency by activating only a subset of parameters for each input, balancing scale with computational cost. The release positions LongCat 2.0 among a growing class of large-scale sparse models competing in the AI landscape.
This is an AI-generated summary. ShortSingh links to the original source for the complete article.
Discussion (0)
Log in to join the discussion and vote.
Log in