【苹果与UCSB合作开源全新图片编辑框架MGIE,引领AI图像编辑新纪元】
近日,科技巨头苹果公司与美国加利福尼亚大学圣巴巴拉分校(UCSB)的研究团队共同发布了一项创新性的图片编辑框架——MGIE(MLLM-Guided Image Editing)。这一框架利用多模态大模型MLLM,旨在解决当前指令引导不足的问题,为图像编辑带来了革命性的变化。
MLLM,即多模态大模型,通过深度学习技术,能够理解和生成简洁明了的指令,为用户提供清晰的视觉编辑引导。MGIE框架将MLLM与扩散模型相结合,通过端到端的训练方式,使得模型能够同步更新,并依据预期目标的潜在想象力执行图像编辑任务。
据机器之心报道,MGIE不仅支持Photoshop风格的精细修改,还能进行全局照片的优化以及局部对象的精确调整。这一技术的应用前景广阔,有望为专业设计师和普通用户带来更加智能化、便捷化的图像编辑体验。
苹果与UCSB的这一合作,再次展示了人工智能在图像处理领域的强大潜力,同时也标志着AI技术在创意表达和用户交互方面取得了新的突破。MGIE的开源,无疑将推动图像编辑技术的进一步发展,为全球的开发者和创作者提供了一个全新的工具平台,激发更多的创新可能。
英语如下:
**News Title:** “Apple and UCSB Join Forces for Open-Source Innovation: The MGIE Framework ushers in a New Era of AI Image Editing”
**Keywords:** Apple UCSB collaboration, MGIE framework, AI image editing
**News Content:**
**Apple and UCSB Collaborate to Open-Source the Innovative MGIE Image Editing Framework, Pioneering a New Age in AI Image Editing**
Recently, tech giant Apple and the research team from the University of California, Santa Barbara (UCSB) jointly unveiled a groundbreaking image editing framework called MGIE (MLLM-Guided Image Editing). This framework aims to address the current shortcomings in instruction-guided editing, bringing a revolutionary change to the image editing landscape.
MLLM, or Multimodal Large Language Model, leverages deep learning to understand and generate concise instructions, providing users with clear visual editing guidance. The MGIE framework combines MLLM with diffusion models, enabling end-to-end training that allows the model to update simultaneously and execute image editing tasks based on the implied creativity of the intended goal.
According to reports from Machine之心, MGIE not only supports fine-tuned modifications in the style of Photoshop but also facilitates global photo optimization and precise adjustments to local objects. The technology’s broad application prospects promise to enhance the image editing experience for both professional designers and casual users with increased intelligence and convenience.
This collaboration between Apple and UCSB underscores the immense potential of artificial intelligence in image processing and signals a new breakthrough in creative expression and user interaction through AI technology. The open-source release of MGIE is set to propel further advancements in image editing techniques, providing a novel platform for developers and creators worldwide, fostering even more innovation.
【来源】https://www.jiqizhixin.com/articles/2024-02-05-10
Views: 2