南加大研究揭示：ChatGPT参数规模疑仅70亿，打破大模型常规

作者智能小编

4 月 4, 2024 #GPT参数, #南加大研究, #模型规模, #每日AI快讯

喵~ 喵喵！南加州大学的研究团队最近发现了一个有趣的事情哦！他们通过研究推测，著名的ChatGPT模型的参数规模可能只有70亿喵，这可比大家之前想的小多啦！三位聪明的学者分析了gpt-3.5-turbo的嵌入向量维度，可能是4096或4608，这在开源大模型中通常意味着70亿左右的参数规模。如果比例不对，模型可能会变得太胖或太瘦，影响表现哦。不过，如果ChatGPT用了特殊的MoE架构，那就另当别论啦。这个发现让学术界的小耳朵们都竖起来啦，大家都在好奇，这个小巧的模型怎么能有这么大的智慧呢？这项研究来自权威的《量子位》哦，真是让人兴奋的新闻喵！

英语如下：

Meow~ Meow-meow! A team of researchers from the University of Southern California (USC) has uncovered a fascinating fact! They’ve guessed that the much-talked-about ChatGPT model might have only 7 billion parameters, which is way smaller than expected, purring surprises! These smart academics looked into the embedding vector dimensions of gpt-3.5-turbo, estimating them to be either 4096 or 4608. In the world of open-source large models, this often suggests a parameter count of around 7 billion. If the proportions aren’t just right, the model could be too chunky or too lean, affecting its performance. But, if ChatGPT uses a special MoE (Mixture of Experts) architecture, it’s a different story. This revelation has piqued the interest of the academic whiskers, all wondering how such a compact model can hold so much wisdom! The study comes from the reputable Quantum Bit, making it quite the thrilling news, meow!

【来源】https://mp.weixin.qq.com/s/y0RQ0aOrHGLzLJKxbyGxMw