{"id":7846,"date":"2024-04-13T09:14:08","date_gmt":"2024-04-13T01:14:08","guid":{"rendered":"https:\/\/www.1ai.net\/?p=7846"},"modified":"2024-04-13T09:14:08","modified_gmt":"2024-04-13T01:14:08","slug":"360-%e6%99%ba%e8%84%91-7b-%e5%8f%82%e6%95%b0%e5%a4%a7%e6%a8%a1%e5%9e%8b%e5%bc%80%e6%ba%90%ef%bc%8c%e6%94%af%e6%8c%81-50-%e4%b8%87%e5%ad%97%e9%95%bf%e6%96%87%e6%9c%ac%e8%be%93%e5%85%a5","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/7846.html","title":{"rendered":"360 Brain 7B parameter large model open source, supports 500,000 words of long text input"},"content":{"rendered":"<p data-vmark=\"3254\"><a href=\"https:\/\/www.1ai.net\/en\/tag\/360\" title=\"_Other Organiser\" target=\"_blank\" >360<\/a> The company recently released a new<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bc%80%e6%ba%90\" title=\"[View articles tagged with [open source]]\" target=\"_blank\" >Open Source<\/a>360 Intelligent Brain 7B (7 billion parameter model).<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%a4%a7%e6%a8%a1%e5%9e%8b\" title=\"[View articles tagged with [large models]]\" target=\"_blank\" >Large Model<\/a>It uses a corpus of 3.4 trillion tokens for training, mainly in Chinese, English, and code.<span class=\"accentTextColor\">Open 4K, 32K, 360K three different text lengths<\/span>. 360 said that 360K (about 500,000 words) is the longest text length among the current domestic open source models.<\/p>\n<p data-vmark=\"3894\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-7847\" title=\"52b7b1bb-725f-4649-a23c-fa5748b6fe7e\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/04\/52b7b1bb-725f-4649-a23c-fa5748b6fe7e.png\" alt=\"52b7b1bb-725f-4649-a23c-fa5748b6fe7e\" width=\"1267\" height=\"994\" \/><\/p>\n<p data-vmark=\"f7da\">360 said that they verified the model performance on the mainstream evaluation data sets of OpenCompass, including C-Eval, AGIEval, MMLU, CMMLU, HellaSwag, MATH, GSM8K, HumanEval, MBPP, BBH, LAMBADA, and the capabilities examined included natural language understanding, knowledge, mathematical calculation and reasoning, code generation, logical reasoning, etc. Among them, the 360 model ranked first on four evaluation data sets and ranked third on average.<\/p>\n<p data-vmark=\"489a\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-7848\" title=\"d113618f-d06c-4208-b356-83a32b721a81\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/04\/d113618f-d06c-4208-b356-83a32b721a81.png\" alt=\"d113618f-d06c-4208-b356-83a32b721a81\" width=\"1024\" height=\"546\" \/><\/p>\n<p data-vmark=\"cd00\">In the LongBench test (a multi-task, bilingual Chinese-English benchmark for evaluating the long text comprehension capabilities of large language models), 360 selected Chinese single-document question and answer, multi-document question and answer, summary, and few-shot tasks that are most closely related to Chinese long text applications for evaluation. The 360Zhinao-7B-Chat-32K model achieved the highest average score.<\/p>\n<p data-vmark=\"e113\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-7849\" title=\"69ec13fb-5248-42c2-8409-528f1e8a4ca0\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2024\/04\/69ec13fb-5248-42c2-8409-528f1e8a4ca0.png\" alt=\"69ec13fb-5248-42c2-8409-528f1e8a4ca0\" width=\"1268\" height=\"557\" \/><\/p>\n<p data-vmark=\"6c6a\">In the English NeedleInAHaystack test (a method of inserting key information into different positions of a long text and then asking questions about the key information to test the long text ability of a large model), 360Zhinao-7B-Chat-360K achieved an accuracy rate of more than 98%. 360 constructed a Chinese NeedleInAHaystack test based on the SuperCLUE-200K evaluation benchmark and also achieved an accuracy rate of more than 98%.<\/p>\n<p data-vmark=\"08e0\">In addition to the model weights, the model&#039;s fine-tuning training code, inference code and a full set of tools are also open source, allowing developers of large models to use it &quot;out of the box&quot;.<\/p>\n<p data-vmark=\"80d9\">Zhou Hongyi once said that the length of the text of the large model industry paper will soon be 1 million words. &quot;We plan to open source this capability, so there is no need for everyone to reinvent the wheel. The 360K is mainly for the sake of reputation.&quot; He also called himself a &quot;believer in open source&quot; and believed in the power of open source.<\/p>","protected":false},"excerpt":{"rendered":"<p>360 company recently in GitHub open source 360 brain 7B (7 billion parameter model). 360 brain big model using 3.4 trillion Tokens corpus training, in Chinese, English, code-based, open 4K, 32K, 360K three different text length. 360 said, 360K (about 500,000 words) is the current domestic open source model of the longest length of the text. 360 said that 360K (about 500,000 words) is the longest text length of current domestic open source models. 360 said that they verified the model performance on the mainstream evaluation datasets of OpenCompass, including C-Eval, AGIEval, MMLU, CMMLU, HellaSwag, MATH, GSM8K, HumanEval, MBPP, BBH, LA<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[337,216,219],"collection":[],"class_list":["post-7846","post","type-post","status-publish","format-standard","hentry","category-news","tag-337","tag-216","tag-219"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/7846","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=7846"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/7846\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=7846"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=7846"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=7846"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=7846"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}