Large Language Models Parameters Icons

Baidu to open-source its Ernie large language model series

Baidu Inc. intends to open-source its Ernie series of large language models later ... had started developing in 2019. The model featured 10 billion parameters and was trained on a 4-terabyte ...

4don MSN

Mistral's CEO Arthur Mensch tells BI that DeepSeek is a win for the open-source ecosystem

Arthur Mensch told Business Insider that DeepSeek was the "Mistral of China," with its new R1 models a "great moment for open ...

What is Mistral’s Le Chat?

Like its competitors, Mistral’s Le Chat can perform a variety of generative functions, from uploading and analyzing documents, to planning and tracking projects, to generating text and images. It can ...

Micron Redefines Performance for AI PCs, Gamers and Professionals

The Micron 4600 SSD showcases sequential read speeds of 14.5 GB/s and write speeds of 12.0 GB/s. These capabilities allow users to load a large language model (LLM) from the SSD to DRAM in less than ...

20d

DeepSeek is driving demand for Nvidia's H200 chips, some cloud firms say

Cloud providers report a significant increase in demand for Nvidia H200 chips as DeepSeek's AI models gain traction.

Impacts7d

Why Сurrent LLMs Struggle to Integrate with Complex Data Lakes in Multi-agent Systems

Despite the latest AI advancements, Large Language Models (LLMs) continue to face challenges in their integration into the ...

Diginomica8d

The ZohoDay AI review - inside Zoho's provocative AI views, and its agentic business model

DeepSeek might have disrupted plenty of AI vendors, but Zoho wasn't one of them. If anything, DeepSeek's cost breakthroughs ...

VentureBeat28d

No retraining needed: Sakana’s new AI model changes how machines learn

This is the latest in a series of techniques that aim to improve the abilities of large language models ... during which the model is exposed to new examples and its parameters are adjusted.

Hackaday24d

New Open Source DeepSeek V3 Language Model Making Waves

In the world of large language models ... effort required by competing models, while performing significantly better. The full training of DeepSeek-V3’s 671B parameters is claimed to have ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results