Single card day processing 200,000 pages of documents, DeepSeek-OCR open-source on-line

Single card day processing 200,000 pages of documents, DeepSeek-OCR open-source

On October 21st, according to Al Quilqi's reportDeepSeek The team has recently released a new studyOCR, proposes a "text-based optical compression" approach, which provides groundbreaking thinking for long text processing for large models。

Single card day processing 200,000 pages of documents, DeepSeek-OCR open-source

research shows that by rendering long text into images and then turning to visual token, it is possible to significantly reduce the calculation costs while maintaining high accuracy。

Experimental data show that the OCR decoded accuracy rate was as high as 971 TP3T at a rate of less than 10 times; even at a rate of 20 times higher, the accuracy rate remained at about 601 TP3T. On the authoritative document parsing baseline OmniDocBench, the model goes beyond several mainstream SOTA models with less visual token。

IN PRACTICAL APPLICATIONS, SINGLE-DESK A100-40G GPU CAN PROCESS OVER 200,000 PAGES OF DOCUMENTS PER DAY, PROVIDING BIG DATA SUPPORT FOR LARGE MODEL TRAINING。

Currently, the relevant code and model weights are on the GitHub and Hugging Face platformOpen Source.

💻 GitHub: https://github.com/deepseek-ai/DeepSeek-OCR

Hugging Face: https://huggingface.co/deepseek-ai/DeepSeek-OCR

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.

{{userData.name}}Verify

Single card day processing 200,000 pages of documents, DeepSeek-OCR open-source

Touching near human levels, Sharpa Robotics launched a new biomimicator

OpenAI's Web Browser: ChatGPT Atlas is officially released to enable AI to "connect you online"

AI Weibo

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai tiktok

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

1ai WeChat

Five minutes a day

Become a master in one year

Scan the QR code to follow

{{userData.name}}Verify

Related content:

Touching near human levels, Sharpa Robotics launched a new biomimicator

OpenAI's Web Browser: ChatGPT Atlas is officially released to enable AI to "connect you online"

DeepSeek's Late-Night Amplification: 7B Parameters for Everyone's Visual Multimodal Model Janus-Pro-7B Open Source

Mistral embraces open source: teases new AI model that will outperform DeepSeek

If after 10 years, DeepSeek, how to keep China behind America, the answer is open source

The DeepSeek-V3.2-Exp model is officially published and is open, and API has significantly reduced prices

AI Applications

5000+ AI applications! Updated daily

1AICLUB

Highly recommended! Official brand Weibo

AI Tutorials

Tons of tutorials to read

AI Basic Training Camp

Zero-based entry, leading you to become an AI expert

1ai master

TikTok account: 1ai.net

1ai master

TikTok account: 1ai.net

Five minutes a day

Become a master in one year

Scan the QR code to follow