DeepSeek unveils multimodal AI model that uses visual perception to compress text input

Post Views: 3

DeepSeek on Monday released a new multimodal artificial intelligence model that can handle large and complex documents with significantly fewer tokens – the smallest unit of text that a model processes – by using visual perception as a compression medium for information.

The open-source DeepSeek-OCR (optical character recognition) model, available via online developer platforms Hugging Face and GitHub, was the result of an “investigation into the role of vision encoders” to compress text for large language models (LLMs), the Hangzhou-based AI start-up said in a blog post.

By using that approach, LLMs would be able to process a massive amount of text without incurring a proportional increase in computing cost.

“Through DeepSeek-OCR, we demonstrated that vision-text compression can achieve significant token reduction – seven to 20 times – for different historical context stages, offering a promising direction” to address long-context challenges in LLMs, the company said.

That showed DeepSeek’s steadfast efforts to raise the efficiency of AI models, while driving down the costs of building and using them – a principle that the company followed in the development of its breakthrough open-source models V3 and R1 that were released in December and February, respectively.

[LIVE] China Future Tech webinar | How is DeepSeek shaping the race for AI supremacy?

According to the company’s blog post, DeepSeek-OCR consisted of two main components: DeepEncoder and DeepSeek3B-MoE-A570M as the decoder.

Source link

What's Hot

Gold gives up record highs on profit-taking

Will corruption backlash against Marcos reshape Philippine politics?

US singer Ryan Adams calls Australia ‘worst country ever’ after tour meltdown

HSBC names new CEO of ring-fenced UK division amid biggest overhaul in a decade

Alibaba’s Singles’ Day shows ‘faster growth’ as 80 brands hit US$14 million in an hour

Hong Kong an aviation hub for Greater Bay Area and markets beyond: Airport Authority chief

Sea CEO Forrest Li sees path to US$1 trillion market cap with help of AI

Chinese listed companies’ earnings may buffer stock skid as profit growth set to pick up

Hong Kong stock rally extends as investors bet on US-China trade agreement

Pakistan seeks Saudi investment to set up copper refineries – Business & Finance

Local currency financing: SBP, IFC sign agreement – Business & Finance

APMEPA urges inclusion in NAFSA – Business & Finance

FTO issues warning to taxpayer: ‘Perjury is considered a serious offence’ – Business & Finance

PSX hits all-time high as proposed ‘neutral-to-positive’ budget well-received by investors – Business

Sindh govt to allocate funds for EV taxis, scooters in provincial budget: minister – Pakistan

US, China reach deal to ease export curbs, keep tariff truce alive – World

Latest Posts

Gold gives up record highs on profit-taking

Will corruption backlash against Marcos reshape Philippine politics?

US singer Ryan Adams calls Australia ‘worst country ever’ after tour meltdown

Archives

Categories

What's Hot

Related Posts

Subscribe to Updates