近日,苹果公司的研究人员在机器学习领域取得了新的进展,他们推出了一种名为自回归视觉模型(AIM)的新技术。这一成果被详细阐述在一篇名为《Scalable Pre-training of Large Autoregressive Image Models》的论文中。AIM模型的推出,标志着苹果在图像处理和机器学习领域的研究又迈出了重要的一步。
自回归图像模型(AIM)的核心在于利用自回归目标来训练视觉变换器(ViT)模型,旨在探索这种训练方式是否能够在学习表征方面达到与语言模型(LLMs)相同的扩展能力。苹果的研究团队发现,通过适当的调整和优化,AIM模型能够轻松扩展到数十亿个参数,这一发现对于机器学习领域来说是一个重大的突破。
此外,AIM模型在利用大量未经整理的图像数据方面表现出了高效的能力。这意味着,AIM不仅能够处理结构化的数据,还能够从海量的、非结构化的图像数据中提取和学习有价值的信息。这一特性使得AIM在图像识别、分类以及生成任务中具有巨大的潜力。
苹果的这一研究不仅推动了机器学习技术的发展,也为未来的人工智能应用提供了新的思路和工具。随着技术的不断成熟和优化,AIM模型有望在图像处理、计算机视觉以及人工智能的多个领域发挥重要作用。
英文标题:Apple Introduces Scalable Autoregressive Image Model AIM
英文关键词:Autoregressive Model, Image Learning, Apple Research
英文新闻内容:
Apple’s research team has made a significant breakthrough in the field of machine learning with the introduction of a new technology called the Autoregressive Image Model (AIM). This innovative model is detailed in a paper titled “Scalable Pre-training of Large Autoregressive Image Models,” where the focus is on training Vision Transformers (ViT) using autoregressive objectives. The research explores whether this training approach can achieve the same scalability in learning representations as Language Models (LLMs). The team discovered that with proper adjustments, the AIM model can be scaled to billions of parameters, a remarkable achievement for the machine learning community. Furthermore, the AIM model demonstrates efficient utilization of vast amounts of unsorted image data, showcasing its potential in image recognition, classification, and generation tasks. Apple’s research not only advances the field of machine learning but also opens up new possibilities for future artificial intelligence applications. As the technology matures, the AIM model is expected to play a crucial role in various domains of image processing, computer vision, and AI.
【来源】https://www.jiqizhixin.com/articles/2024-01-18-7
Views: 1