To prevent chatbots from "spreading rumors", Google Deepmind and Stanford University researchers launched AI fact-checking tools

Regardless of the current AI ChatbotsNo matter how powerful AI is, it will have a behavior that is often criticized - providing users with answers that are inconsistent with the facts in a way that looks convincing. In simple terms, AI sometimes "talks nonsense" or even "spreads rumors" in its answers.

To prevent chatbots from "spreading rumors", Google Deepmind and Stanford University researchers launched AI fact-checking tools

Image source: Pixabay

Preventing large AI models from behaving in this way is not easy and is a technical challenge. However, according to foreign media Marktechpost,Google DeepMind andStanford UniversityIt seems some kind of workaround has been found.

Researchers have introduced a tool based on a large language model - Search Enhanced Fact EvaluatorThe results of the study, along with the experimental code and dataset, have been published.Click here to view

The system analyzes, processes, and evaluates responses generated by the chatbot in four steps:, to verify accuracy and truthfulness: split the answer into individual fact checks, correct them, and compare them with Google search results. The system then checks the relevance of each fact to the original question.

To evaluate its performance, the researchers created a dataset called LongFact containing about 16,000 facts and tested the system on 13 large language models from Claude, Gemini, GPT, and PaLM-2. The results showed that in a focused analysis of 100 controversial facts, SAFE's judgments were correct with a rate of 76% under further review. At the same time, the framework also has economic advantages:The cost is more than 20 times cheaper than manual annotation.

To prevent chatbots from "spreading rumors", Google Deepmind and Stanford University researchers launched AI fact-checking tools

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

A 15-second voice clip can synthesize someone's voice. OpenAI has released a small-scale open Voice Engine model.

2024-3-31 8:28:28

Information

Databricks launches DBRX, a 132 billion parameter large language model, known as "the most powerful open source AI at this stage"

2024-4-1 9:28:44

Search