{"id":16316,"date":"2024-07-24T08:45:11","date_gmt":"2024-07-24T00:45:11","guid":{"rendered":"https:\/\/www.1ai.net\/?p=16316"},"modified":"2024-07-24T08:45:11","modified_gmt":"2024-07-24T00:45:11","slug":"%e9%9c%87%e6%83%8aai%e7%95%8c%ef%bc%81llama-3-1%e6%b3%84%e9%9c%b2%ef%bc%9a4050%e4%ba%bf%e5%8f%82%e6%95%b0%e7%9a%84%e5%bc%80%e6%ba%90%e5%b7%a8%e5%85%bd%e6%9d%a5%e8%a2%ad%ef%bc%81","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/16316.html","title":{"rendered":"Shocking the AI world! Llama 3.1 leaked: an open source behemoth with 405 billion parameters is coming!"},"content":{"rendered":"<p data-pm-slice=\"0 0 []\"><a href=\"https:\/\/www.1ai.net\/en\/tag\/llama3-1\" title=\"[See articles with [Llama3.1] labels]\" target=\"_blank\" >Llama3.1<\/a>Leaked! You heard it right, this one has 405 billion parameters<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bc%80%e6%ba%90%e6%a8%a1%e5%9e%8b\" title=\"[See articles with [open source model] labels]\" target=\"_blank\" >Open Source Model<\/a>, which has caused a stir on Reddit. This may be the closest thing to GPT-4o yet.<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bc%80%e6%ba%90\" title=\"[View articles tagged with [open source]]\" target=\"_blank\" >Open Source<\/a>model, and even surpasses it in some aspects.<\/p>\n<p data-track=\"13\"><a href=\"https:\/\/www.1ai.net\/en\/tag\/llama\" title=\"_Other Organiser\" target=\"_blank\" >Llama<\/a>3.1 is composed of<a href=\"https:\/\/www.1ai.net\/en\/tag\/meta\" title=\"[View articles tagged with [Meta]]\" target=\"_blank\" >Meta<\/a>Developed by (formerly Facebook)<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%a4%a7%e5%9e%8b%e8%af%ad%e8%a8%80%e6%a8%a1%e5%9e%8b\" title=\"[View articles tagged with [large-scale language model]]\" target=\"_blank\" >Large Language Models<\/a>Although it has not been officially released yet, the leaked version has already caused a stir in the community. This model not only includes the base model, but also the benchmark results of 8B, 70B and the maximum parameter 405B.<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-16317\" title=\"get-775\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/07\/get-775.jpg\" alt=\"get-775\" width=\"598\" height=\"280\" \/><\/div>\n<p data-track=\"14\"><strong>Performance comparison: Llama3.1 vs GPT-4o<\/strong><\/p>\n<p data-track=\"15\">Judging from the leaked comparison results, even the 70B version of Llama3.1 has surpassed GPT-4o on multiple benchmarks. This is the first time that an open source model has reached the SOTA (State of the Art) level on multiple benchmarks, which makes people sigh: the power of open source is really strong!<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-16319\" title=\"get-777\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/07\/get-777.jpg\" alt=\"get-777\" width=\"743\" height=\"592\" \/><\/div>\n<p data-track=\"16\"><strong>Model highlights: multi-language support, richer training data<\/strong><\/p>\n<p data-track=\"17\">The Llama 3.1 model is trained using 15T+ tokens from public sources, and the pre-training data cutoff is December 2023. It supports not only English, but also multiple languages including French, German, Hindi, Italian, Portuguese, Spanish, and Thai. This makes it perform well in multilingual conversation use cases.<\/p>\n<div class=\"pgc-img\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-16318\" title=\"get-776\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/07\/get-776.jpg\" alt=\"get-776\" width=\"753\" height=\"487\" \/><\/div>\n<p data-track=\"18\">The Llama3.1 research team attaches great importance to the security of the model. They adopted a multi-faceted data collection method, combining artificially generated data with synthetic data to mitigate potential security risks. In addition, the model also introduced boundary prompts and adversarial prompts to enhance data quality control.<\/p>\n<p data-track=\"19\">Model card source: https:\/\/pastebin.com\/9jGkYbXY#google_vignette<\/p>\n<p>&nbsp;<\/p>","protected":false},"excerpt":{"rendered":"<p>Llama 3.1 has leaked! You heard it right, the open source model with 405 billion parameters has been making waves on Reddit. It's probably the closest open source model to GPT-4o to date, even surpassing it in some ways. Llama 3.1 is a large-scale language model developed by Meta (formerly Facebook). Although it has not been officially released, the leaked version has already caused a stir in the community. This model contains not only the base model, but also benchmark results of 8B, 70B and 405B for the largest parameter. Performance Comparison:Llama3.1 vs GPT-4o From the leaked comparison results, even the 70B version of Llama3.1 outperforms GPT on a number of benchmarks<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[184,3670,297,371,219,862],"collection":[],"class_list":["post-16316","post","type-post","status-publish","format-standard","hentry","category-news","tag-llama","tag-llama3-1","tag-meta","tag-371","tag-219","tag-862"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/16316","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=16316"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/16316\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=16316"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=16316"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=16316"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=16316"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}