Customize Consent Preferences

We use cookies to help you navigate efficiently and perform certain functions. You will find detailed information about all cookies under each consent category below.

The cookies that are categorized as "Necessary" are stored on your browser as they are essential for enabling the basic functionalities of the site. ... 

Always Active

Necessary cookies are required to enable the basic features of this site, such as providing secure log-in or adjusting your consent preferences. These cookies do not store any personally identifiable data.

No cookies to display.

Functional cookies help perform certain functionalities like sharing the content of the website on social media platforms, collecting feedback, and other third-party features.

No cookies to display.

Analytical cookies are used to understand how visitors interact with the website. These cookies help provide information on metrics such as the number of visitors, bounce rate, traffic source, etc.

No cookies to display.

Performance cookies are used to understand and analyze the key performance indexes of the website which helps in delivering a better user experience for the visitors.

No cookies to display.

Advertisement cookies are used to provide visitors with customized advertisements based on the pages you visited previously and to analyze the effectiveness of the ad campaigns.

No cookies to display.

90年代的黄河路
0

Beijing – DeepSeek, a rising force in the global AI community, has been the subject of intense scrutiny following the unexpected fervor surrounding its DeepSeek-R1 model. However, high-quality information about the company remains scarce. To address this, Li Guangmi, founder and CEO of Shixiang, organized a closed-door discussion on January 26, 2025, bringing together dozens of leading AI researchers, investors, and industry practitioners to explore DeepSeek’s technical details, organizational culture, and its short, medium, and long-term impact.

This discussion sought to unveil a corner of the mysterious Eastern force amidst limited information. It is important to note that this discussion was a non-governmental technical exchange and does not represent the views of any specific individual or institution. As renowned Silicon Valley venture capitalist Marc Andreessen commented on DeepSeek-R1: As open source, a profound gift to the world. Therefore, the participants in this discussion also learned from DeepSeek, and in the spirit of open source, made public the collective thinking of the closed-door meeting. The following is a summary of the key points of the discussion. This summary was compiled by the Shixiang team, with minor edits by the author.

The Mystery of DeepSeek

The core takeaway from the meeting is that DeepSeek’s driving force lies in its unwavering commitment to pushing the boundaries of artificial intelligence.

  • Focus on Intelligence, Not Just Commercialization: Participants emphasized that founder and CEO Liang Wenfeng is the core of DeepSeek, a technically proficient leader driven by the pursuit of intelligence rather than immediate commercial gains. He is not in the same mold as figures like Sam Altman, known for their business acumen.

  • Efficient Resource Utilization: DeepSeek has earned a strong reputation for being the first to release reproductions of MoE and o1, capitalizing on early timing. However, the potential for further advancement remains significant. The challenge lies in maximizing limited resources, focusing them on the most promising areas. The team’s research capabilities and culture are commendable, and with access to more resources, such as 100,000 to 200,000 GPUs, they could achieve even greater breakthroughs.

  • Rapid Advancements in Long Context Capabilities: DeepSeek has demonstrated impressive progress in long context capabilities between the preview and official release of its models. They have achieved a 10K long context using conventional methods.

  • Resource Speculation: While Scale.ai’s CEO suggested DeepSeek possesses 50,000 GPUs, the actual number is likely lower. Public information indicates that DeepSeek has approximately 10,000 older A100 GPUs and potentially 3,000 H800 GPUs acquired before export restrictions.

Conclusion

The closed-door discussion on DeepSeek highlights the company’s unique position in the AI landscape. Its focus on fundamental intelligence, efficient resource utilization, and rapid technical advancements have propelled it to the forefront of the field. While challenges remain, particularly in securing sufficient resources, DeepSeek’s commitment to open-source principles and its technically driven leadership suggest a promising future for the company and the broader AI community. The insights from this meeting underscore the importance of vision and dedication in driving innovation, even in the face of limited resources.

References

  • Zhang, X. (2024, January 29). 一场关于DeepSeek的高质量闭门会:比技术更重要的是愿景 [A High-Quality Closed-Door Meeting on DeepSeek: Vision is More Important Than Technology]. Tencent Technology. [Original article link]


>>> Read more <<<

Views: 0

0

发表回复

您的邮箱地址不会被公开。 必填项已用 * 标注