Customize Consent Preferences

We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.

The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ... 

Always Active

Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.

No cookies to display.

Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.

No cookies to display.

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.

No cookies to display.

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.

No cookies to display.

Advertisement cookies are used to provide visitors with customized advertisements based on the pages you visited previously and to analyze the effectiveness of the ad campaigns.

No cookies to display.

上海枫泾古镇正门_20240824上海枫泾古镇正门_20240824
0

Okay, here’s a draft of a news article based on the provided information, aiming for the style and depth you’ve outlined:

Title: OpenAI’s Long-Awaited Operator Arrives, But Trails Chinese Rival in Race for Autonomous Agents

Introduction:

For nearly a decade, the promise of a truly autonomous AI agent capable of navigating the digital world has been a tantalizing prospect. OpenAI, a pioneer in the field, finally unveiled its contender this week: Operator, a web-navigating agent designed to handle a range of tasks from online shopping to booking reservations. However, this highly anticipated launch comes with a twist. Not only did it arrive after a similar offering from Chinese AI firm Zhipu, but early demonstrations also revealed a less-than-seamless performance, raising questions about its readiness for widespread adoption.

Body:

The Arrival of Operator: A Long Time Coming

OpenAI’s Operator, launched in the early hours of today, marks the company’s first foray into a truly human-like web browsing agent. Available initially to U.S. Pro subscribers, with plans to expand to Plus, Team, and Enterprise users and eventually integrate into ChatGPT, Operator promises to automate repetitive browser-based tasks. Users can instruct it to fill out forms, order groceries, or even create memes. This capability represents a significant step toward AI agents that can interact with the web as humans do, moving beyond simple chatbots and into the realm of task execution.

Zhipu’s Head Start: A Code-Minded Approach

While OpenAI’s Operator is making headlines, it’s important to note that Chinese AI firm Zhipu had already launched a similar product, GLM-PC, back in November. This week, Zhipu further upgraded its agent with a new Deep Thinking mode, emphasizing its code-minded approach with enhanced logic reasoning and code generation capabilities. Zhipu’s proactive approach puts it ahead of OpenAI in the race for practical, autonomous agents.

Operator’s Performance: Promise and Pitfalls

Operator’s capabilities are impressive on paper. It can automatically handle complex tasks like booking travel, making restaurant reservations, and shopping online. Users can select from various categories, add instructions, and even take over control mid-task. It can run multiple tasks concurrently, save frequently used prompts, and even browse the web if a specific site isn’t provided. Crucially, Operator interacts with the browser by seeing (through screenshots) and interacting (using mouse and keyboard emulation), eliminating the need for custom API integrations. It also boasts self-correction capabilities when encountering errors.

However, the official demonstration of Operator revealed some significant challenges. While the initial part of the demo went smoothly, the latter half was plagued with errors, including failures to load web pages. This flip-over (翻车, a Chinese term for a performance failure) highlights the fact that Operator is still in a research preview phase and is very much a work in progress. OpenAI itself acknowledges that while it can handle various tasks, it is still in a learning and development phase.

The Technical Underpinnings:

Operator’s ability to interact with the web without custom APIs is a crucial technical achievement. It simulates human interaction by using screen captures and emulating mouse and keyboard inputs. This approach allows it to work across a vast array of websites and applications, a significant advantage over AI agents that rely on pre-defined APIs. The self-correction feature, which leverages reasoning capabilities, is also a key differentiator, allowing the agent to learn and adapt to unexpected situations.

Conclusion:

OpenAI’s Operator represents a significant milestone in the development of autonomous AI agents. Its ability to navigate and interact with the web like a human opens up a wide range of possibilities for automation and task management. However, its delayed launch and the performance issues revealed in the demonstration highlight the challenges of creating robust and reliable autonomous agents. The fact that Zhipu has already launched and upgraded a similar product also underscores the competitive landscape in this rapidly evolving field. As both companies continue to develop their offerings, the race for the most capable and reliable autonomous agent is far from over. The future of how we interact with the digital world is being shaped by these advancements, and it will be fascinating to see how these technologies evolve and impact our daily lives.

References:

  • InfoQ. (2025, January 27). OpenAI 想了快 10 年的超级智能体终于来了!有惊喜,但比智谱晚还“翻车”系数高? [OpenAI’s Super Intelligent Agent, 10 Years in the Making, Finally Arrives! Surprises, but Lags Behind Zhipu and Has a High Flip-Over Coefficient?]. Retrieved from [Insert Original URL Here]

Note: I have used a mix of direct reporting and analysis, and have tried to capture the nuance and potential implications of the news. I have also included a reference to the source material. The tone is professional and objective, as befitting a senior news publication. I have also used the term flip-over (翻车) to reflect the original article’s use of the term, and to provide a bit of cultural context.


>>> Read more <<<

Views: 0

0

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注