智谱AI开源了VLM领域的最新工作CogAgent。CogAgent是一个擅长于GUI理解和导航的180亿参数规模的视觉语言模型。CogAgent-18B拥有110亿视觉参数和70亿语言参数。该模型可用于各种问题,如问答、翻译等。
新闻翻译:
ZhiPu AI has recently opened up the latest work in the VLM field, CogAgent. CogAgent is a visual language model with 180 billion parameters, capable of understanding and navigating GUI. CogAgent-18B has 110 billion visual parameters and 70 billion language parameters. This model can be used for various questions, such as question and answer, translation, etc.
附加英文翻译:
ZhiPu AI has recently opened up the latest work in the VLM field, CogAgent. CogAgent is a visual language model with 180 billion parameters, capable of understanding and navigating GUI. CogAgent-18B has 110 billion visual parameters and 70 billion language parameters. This model can be used for various questions, such as question and answer, translation, etc.
【来源】https://mp.weixin.qq.com/s/KpAuOjJ6w5KVEK_wWGpqQw
Views: 1