Tsinghua’s LongCite Model Boosts LLMs’ Precision Cuts Hallucinations

In the realm of artificial intelligence, precise and verifiable responses from language models are becoming increasingly crucial. Tsinghua University has recently introduced LongCite, an open-source model designed to enhance the reliability and credibility of large language models (LLMs) in long-text question-answering tasks. This innovative project has the potential to revolutionize how we interact with AI by ensuring that the information provided is both accurate and traceable.

What is LongCite?

LongCite is a project initiated by Tsinghua University aimed at improving the trustworthiness and verifiability of LLMs when dealing with long texts. The project comprises several core components, including the LongBench-Cite evaluation benchmark, the Coarse to Fine (CoF) automated data construction process, the LongCite-45k dataset, and the LongCite-8B and LongCite-9B models trained on this dataset. These models are capable of comprehending extensive text content and providing accurate answers, complete with direct text citations for enhanced transparency and reliability.

Key Features of LongCite

Fine-Grained Citations

LongCite enables language models to generate precise sentence-level citations when answering long-text questions. This allows users to directly trace back to specific information in the original text.

Increased Fidelity

The model ensures that the answers provided by LLMs are more faithful to the original content, reducing the occurrence of hallucinations — information generated by the model that does not correspond to the original text.

Enhanced Verifiability

Users can verify the authenticity and accuracy of the answers by relying on the fine-grained citations provided by the model, thereby increasing the credibility of the output.

Automated Data Construction

LongCite employs the CoF process to automate the generation of high-quality long-text question-answering data with fine-grained citations, providing a rich annotated resource for model training.

Evaluation Benchmark

The LongBench-Cite benchmark is introduced to measure the model’s ability to generate citations in long-text question-answering tasks, assessing both correctness and citation quality.

Technical Principles of LongCite

Long Text Processing Capability

LongCite supports large language models with ultra-long context windows (such as GLM-4-9B-1M, Gemini 1.5) capable of processing and understanding texts of tens of thousands of characters.

Fine-Grained Citation Generation

The model is trained to generate precise sentence-level citations, ensuring that each answer can be traced back to a specific sentence in the original text, enhancing the verifiability of the responses.

Automated Data Construction Process (CoF)

The Self-Instruct method is used to automatically generate question-answer pairs from long texts. Relevant sentence blocks are retrieved from the text and block-level citations are generated. Based on these block-level citations, specific sentences supporting each statement are extracted to create sentence-level citations.

Supervised Fine-Tuning (SFT)

The high-quality dataset with fine-grained citations generated by the CoF process is used to fine-tune large language models, improving their performance in long-text question-answering tasks.

Project Addresses

GitHub Repository: THUDM/LongCite
HuggingFace Model Library: THUDM
arXiv Technical Paper: LongCite Paper

Application Scenarios

LongCite has a wide range of applications across various fields:

Academic Research: Researchers and scholars use LongCite to access extensive literature and obtain detailed answers with citations, supporting their research work.
Legal Consultation: Legal professionals use LongCite to analyze legal documents and obtain specific citations of laws or cases, aiding in legal analysis and case research.
Financial Analysis: Financial analysts and investors utilize LongCite to understand complex financial reports and market research, obtaining accurate citations of key data and trends.
Medical Consultation: Medical professionals rely on LongCite to access medical literature and obtain citations for diagnostic and treatment recommendations based on the latest research.
News Reporting: Journalists and news organizations use LongCite to verify information in their reports, ensuring the accuracy of published news content and providing reliable source citations.

With its focus on enhancing the precision and verifiability of LLMs, LongCite represents a significant step forward in the evolution of artificial intelligence. By providing users with the ability to trace and verify the information provided by AI, Tsinghua University’s innovative project is poised to impact a wide range of industries and applications.

>>> Read more <<<

一	二	三	四	五	六	日
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30
31

Tsinghua’s LongCite Model Boosts LLMs’ Precision Cuts Hallucinations

作者智能小编

What is LongCite?

Key Features of LongCite

Fine-Grained Citations

Increased Fidelity

Enhanced Verifiability

Automated Data Construction

Evaluation Benchmark

Technical Principles of LongCite

Long Text Processing Capability

Fine-Grained Citation Generation

Automated Data Construction Process (CoF)

Supervised Fine-Tuning (SFT)

Project Addresses

Application Scenarios

相关文章

AI模型评测新纪元：Vertex AI领航

360’s Light-R1 AI Outperforms DeepSeek-R1 in Math Open-Sourced for $1000!

360智脑开源Light-R1，千元成本超越DeepSeek！

发表回复取消回复

为您推荐

AI模型评测新纪元：Vertex AI领航

360’s Light-R1 AI Outperforms DeepSeek-R1 in Math Open-Sourced for $1000!

360智脑开源Light-R1，千元成本超越DeepSeek！

大模型应用开发：人人可掌握的技术核心竞争力！

作者智能小编

What is LongCite?

Key Features of LongCite

Fine-Grained Citations

Increased Fidelity

Enhanced Verifiability

Automated Data Construction

Evaluation Benchmark

Technical Principles of LongCite

Long Text Processing Capability

Fine-Grained Citation Generation

Automated Data Construction Process (CoF)

Supervised Fine-Tuning (SFT)

Project Addresses

Application Scenarios

相关文章

发表回复 取消回复

为您推荐

发表回复取消回复