旷视推出多模态大模型Vary，一键转换文档图片为Markdown

作者智能小编

1 月 18, 2024 #每日AI快讯

标题：旷视科技推出全新多模态大模型Vary，实现一键将文档图片转换为Markdown格式

新闻正文：

旷视科技，一家全球领先的人工智能公司，近日宣布推出一款全新的多模态大模型Vary，以解决文本识别、布局检测和排序、公式表格处理、文本清洗等多个步骤，从而将一份文档图片直接转换成Markdown格式。

这一创新的模型由旷视的研究团队研发，只需输入一句话命令，就能端到端输出文档结果。这一技术的推出，将大大简化了文件转换的过程，提高了工作效率。

旷视科技的这一技术突破，不仅将改变传统的文件转换方式，也将为文字处理、出版、教育等领域带来深远影响。未来，我们有望看到更多的应用场景，例如自动生成报告、书籍摘要等。

旷视科技的这一创新举措，再次证明了其在人工智能领域的领先地位。该公司一直致力于推动人工智能技术的发展，以期通过科技创新，改善人们的生活。

此次推出的多模态大模型Vary，是旷视科技在人工智能领域的又一重要突破。我们期待旷视科技在未来能够带来更多的创新和突破，为人工智能的发展开辟新的道路。

英语如下：

News Title: “Megvii Launches Multimodal Large Model Vary, One-click Converts Document Images to Markdown”

Keywords: 1. Multimodal Large Model Vary

News Content:

Megvii, a leading global artificial intelligence company, has recently announced the launch of a new multimodal large model Vary to address various steps such as text recognition, layout detection and sorting, formula table processing, and text cleaning. This innovative model can directly convert a document image into Markdown format with just one sentence command from the research team at Megvii.

This technological breakthrough will greatly simplify the process of file conversion and improve work efficiency. This technology not only changes the traditional way of file conversion but also has far-reaching implications for fields such as text processing, publishing, and education. In the future, we may see more application scenarios, such as automatic report generation and book summaries.

Megvii’s innovative move once again proves its leading position in the field of artificial intelligence. The company has been committed to promoting the development of AI technology, hoping to improve people’s lives through technological innovation.

The launch of the multimodal large model Vary is another important breakthrough for Megvii in the field of AI. We look forward to seeing more innovations and breakthroughs from Megvii in the future, paving new paths for the development of AI.

【来源】https://www.qbitai.com/2023/12/109275.html