Databricks launches DBRX, a 132 billion parameter large language model, known as "the most powerful open source AI at this stage"

Databricks Recently, a universalLarge Language Model DBRX, known as "the most powerful open source AI at present",It is said to surpass "all otherOpen Source Model.

Databricks launches DBRX, a 132 billion parameter large language model, known as "the most powerful open source AI at this stage"

According to the official press release, DBRX is a large language model based on Transformer, using the MoE (Mixture of Experts) architecture.It has 132 billion parameters and is pre-trained on 12T Token source data..

The researchers tested this model and found that compared with the existing open source models such as LLaMA2-70B, Mixtral, and Grok-1 on the market, DBRX performed better in benchmarks such as language understanding (MMLU), programming (HumanEval), and mathematical logic (GSM8K). The official also mentioned that DBRX also surpassed OpenAI's GPT-3.5 in the above three benchmarks.

Databricks launches DBRX, a 132 billion parameter large language model, known as "the most powerful open source AI at this stage"

Naveen Rao, vice president of Databricks AI, told TechCrunch that the company spent two months and $10 million to train DBRX. Although DBRX currently performs well in terms of overall results, the model currently requires four NVIDIA H100 GPUs to run, so there is still room for optimization.

statement:The content of the source of public various media platforms, if the inclusion of the content violates your rights and interests, please contact the mailbox, this site will be the first time to deal with.
Information

To prevent chatbots from "spreading rumors", Google Deepmind and Stanford University researchers launched AI fact-checking tools

2024-4-1 9:27:45

Information

Google: Running artificial intelligence models on mobile phones will take up a lot of memory

2024-4-1 9:29:37

Search