据IT之家报道,一项由路透社研究所最新完成的研究揭示了一个引人关注的现象:截至2023年底,在全球10个主要国家的热门新闻网站中,竟然有接近一半,即48%,选择屏蔽了OpenAI的网络爬虫。这一数据反映出新闻行业对于AI技术在内容获取上的谨慎态度。

与此同时,研究还指出,有24%的新闻网站同样拒绝了谷歌AI爬虫的访问。这一比例显示出,不仅是新兴的AI企业,即便是像谷歌这样的科技巨头,在新闻信息的自动化采集上也遇到了障碍。这可能是出于对版权保护、数据安全以及用户隐私的考虑,同时也可能与网站希望保持对内容分发的控制权有关。

这一现象对AI驱动的新闻聚合和分析服务构成了挑战,它们依赖于爬虫技术来获取和处理信息。随着越来越多的新闻网站采取限制措施,这些服务可能需要寻找新的数据来源,或者与新闻机构建立更直接的合作关系,以确保信息的合法、及时获取。

路透社研究所的研究提醒我们,AI在新闻领域的应用并非一帆风顺,它需要与传统媒体的运营模式和法规环境找到更好的融合点。未来,如何平衡技术创新与内容创作者的权益,将成为业界需要共同面对和解决的问题。

英语如下:

News Title: “Half of Popular News Websites Block OpenAI: AI Crawlers Encounter Access Hurdles”

Keywords: News websites, AI restrictions, OpenAI crawler

News Content:

Title: Nearly Half of Top News Websites Restrict OpenAI Crawler Access, Posing Challenges for AI News Aggregation

According to IT Home, a recent study by the Reuters Institute reveals an intriguing finding: as of the end of 2023, a staggering 48% of popular news websites across 10 major countries have opted to block OpenAI’s web crawlers. This statistic underscores the cautious approach the news industry is taking towards AI technology in content retrieval.

Simultaneously, the study highlights that 24% of these news websites also deny access to Google’s AI crawlers. This indicates that obstacles to automated information gathering are not exclusive to emerging AI companies, but extend to tech giants like Google, possibly due to considerations of copyright protection, data security, and user privacy. It might also relate to websites’ desire to maintain control over content distribution.

This development poses challenges for AI-driven news aggregation and analysis services that rely on crawling technology for information acquisition and processing. With more sites implementing restrictions, these services may need to seek alternative data sources or establish direct partnerships with news outlets to ensure legal and timely access.

The Reuters Institute’s research underscores that the integration of AI in journalism is not without hurdles, requiring better alignment with traditional media operations and regulatory environments. Moving forward, striking a balance between technological innovation and content creators’ rights will be a shared challenge for the industry to address.

【来源】https://www.ithome.com/0/752/306.htm

Views: 0

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注