AI News

DeepSeek Launches Its Debut Multi-modal LLM

DeepSeek AI Unveils Janus With 1.3 Billion Parameters

Hangzhou, China – 18th October 2024 – DeepSeek, a Chinese startup focused on artificial general intelligence (AGI), launched Janus, its novel autoregressive framework designed for multimodal understanding and generation tasks. Janus stands out by addressing limitations in earlier models by decoupling visual encoding into distinct pathways.

DeepSeek’s first multi-modal LLM will be available on Hugging Face, announced by Philipp Schmid, the Tech Lead and LLMs at HuggingFace, with a post on X:

While the visual encoding is separated for each task, Janus utilizes a single, unified transformer architecture for processing. By decoupling the visual encoding pathways, Janus resolves conflicts that visual encoders face: handling both understanding and generation tasks.

The launch of Janus represents the ability to integrate MLLMs seamlessly across various tasks, a significant improvement over its predecessors. It also leads to enhanced flexibility without sacrificing performance.

However, Janus not only surpasses previous models but also exceeds the performances of task-specific models. It improves the handling of multimodal inputs compared to older frameworks, making Janus a frontrunner among the next generation of unified multimodal models.

Janus is built on DeepSeek’s LLM-1.3b-base and is trained on approximately 500B text tokens. It also leverages SigLIP-L as the vision encoder, supporting image input resolutions of 384 x 384. This makes Janus a strong contender for potentially driving innovations in AI-powered content creation, multimedia analysis and more.

DeepSeek’s Janus positions itself as a leading solution in the evolving multimodal LLM landscape by offering decoupled visual pathways but retaining a unified transformer framework. Its flexibility without compromising performance will make it a popular tool among Hugging Face’s community and other AI developers.

What is your reaction?

Excited
0
Happy
0
In Love
0
Not Sure
0
Silly
0
Aman Dasgupta
Aman is an experienced content marketer and strategist with expertise in technology, finance and marketing. With an engineering background, he aims to simplify the latest news and trends in technology for digital audiences. Having worked with leading tech businesses in AI/ML, data science, AR/VR and Web 3.0, Aman helps decision-makers stay up-to-date and informed on everything technology.
You may also like

Leave a reply

Your email address will not be published. Required fields are marked *

More in:AI News