-
DeepSeek's Late-Night Amplification: 7B Parameters for Everyone's Visual Multimodal Model Janus-Pro-7B Open Source
January 28, 2011 - DeepSeek has announced that it has open-sourced a new visual multimodal model, Janus-Pro-7B, which beat Stable Diffusion and OpenAI's DALL-E 3 in the GenEval and DPG-Bench benchmarks. 1AI is available at: GitHub: click here to go there. HuggingFace: go here The official description of the model is to the following effect: Janus-Pro is an innovative autoregressive...- 3.1k
-
Tencent Hybrid 3D Generation Big Model 2.0 Open Source Release, Simultaneously Launched the "Industry's First One-Stop 3D Content AI Creation Platform"
January 21 news, Tencent today announced that the open source on-line mixed yuan 3D generate big model 2.0. tencent mixed yuan also synchronized on-line mixed yuan 3D AI creation engine, said to be "the industry's first one-stop 3D content AI creation platform". The technology claims that a sentence, a picture, or even a sketch can generate a 3D model, and it can even add movements, change textures, pinch characters, and do animation. Tencent's Mixed 3D-2.0 version mainly upgrades the geometry and texture models in the 3D generation process. The task of the geometry model is to capture 3...- 1.7k
-
Researchers Open Source Sky-T1 Reasoning AI Model, Costs Less Than $450 to Train
January 12, 2011 - This week, NovaSky, a research team from UC Berkeley's Sky Computing Lab, released an inference model called Sky-T1-32B-Preview. The model's performance in a number of key benchmarks is comparable to earlier versions of OpenAI's o1 model. Notably, Sky-T1-32B-Preview appears to be the first truly open-source inference model with a publicly available training dataset and code that allows users to reproduce the...- 1.4k
-
Open Source Media Player VLC Breaks 6 Billion Downloads, Previews Native AI Subtitling/Translation Capabilities
January 10, 2012 - VideoLAN, developer of the open-source media player VLC, is celebrating at CES 2025 as the player passes the 6 billion cumulative download mark. Jean-Baptiste Kempf, president of VideoLAN, said VLC's active user base continues to grow even in this era of streaming services. VideoLAN also previewed new features coming to VLC at CES: offline subtitle generation and translation based on native open-source AI models...- 1.4k
-
Microsoft open-sources Phi-4, a 14 billion parameter small language AI model that outperforms GPT-4o Mini
January 9, 2012 - After its December 12, 2024 release, Microsoft yesterday (January 8) open-sourced the small language model Phi-4 on the Hugging Face platform, allowing interested developers and testers to download, fine-tune, and deploy the AI model. Note: The model has only 14 billion parameters, yet it performs well in several benchmarks, even outperforming the much larger Llama 3.3 70B (nearly five times as many as Phi-4) and OpenAI's GPT-4o Mini; in...- 1.6k
-
World's First: Wizards Robotics Announces Open Source AgiBot World Million-Machine Dataset, Dramatically Outperforms Google's Open X-Embodiment
December 30, 2011 - Wisdom Robotics today announced the launch of AgiBot World, the world's first open source project based on real-world scenarios, an all-around hardware platform, and full quality control of millions of real-machine datasets. Wisdom Robotics said, "This milestone open source project marks the arrival of 'ImageNet time' in the field of body intelligence. ImageNet moment' in the field of Embodied Intelligence." Jiyuan Robotics will upload the data in batches on HuggingFace, Github, and agibot-world.com as planned, with the following addresses: Huggin...- 2.7k
-
Smart Spectrum open-sources GLM-PC base model CogAgent-9B to let AI intelligences "read" the screen
Smart Spectrum Technology Team public yesterday (December 26) published a blog post announcing the open source of GLM-PC's base model CogAgent-9B-20241220, based on GLM-4V-9B training, dedicated to intelligent body (Agent) tasks. Note: The model only needs screenshots as input (no textual representation such as HTML), and can predict the next GUI operation based on any user-specified task, combined with historical operations. Thanks to the universality of screenshots and GUI operations, CogAgent can be widely used...- 2.6k
-
Ali Tongyi Thousand Questions Open Source Visual Reasoning Model QVQ-72B-Preview: Think Like a Physicist
Ali Tongyi Qwen team released a blog post today (December 25th), announcing the launch of QVQ-72B-Preview, an open source visual reasoning model based on the Qwen2-VL-72B build, which is capable of finding solutions through logical reasoning calmly in the face of complex physics problems like a master of physics. Ali Tongyi Thousand Questions team evaluates QVQ-72B-Preview on 4 datasets, and 1AI attaches the relevant introduction as follows: MMMU: A university-level multidisciplinary and multimodal evaluation set designed to examine the model visual...- 2.1k
-
No Questions Asked Core Dome Open Sources World's First End-Side Omnimodal Understanding Model Megrez-3B-Omni, Supports Image, Audio, and Text Understanding
December 16th, 2011 - Unquestionable Core Dome today announced that it has open sourced Megrez-3B-Omni, a full-modal comprehension mini-model from Unquestionable Core Dome's end-side solution, and its language-only model version, Megrez-3B-Instruct. Officially, Megrez-3B-Omni is a full-modal comprehension model for the end-side, with the ability to process three types of modal data: Image, Audio, Text and Audio. Megrez-3B-Omni is a full-modal understanding model for the end, with the ability to process image, audio, and text at the same time: In terms of image understanding, Megrez-3B-Omni is currently the most popular model for OpenCompass, MME, MMMU, O...- 2.1k
-
DeepSeek-VL2 AI visual model open source: support for dynamic resolution, processing scientific research charts, parsing various terrain maps, etc.
DeepSeek's official public website released a blog post yesterday (December 13), announcing the open source DeepSeek-VL2 model, which has achieved very advantageous results in various evaluation indexes, and officially said that its visual model has officially entered the era of Mixture of Experts (MoE). Citing the official press release, 1AI attached the highlights of DeepSeek-VL2 as follows: Data: double the quality of training data compared to the first generation of DeepSeek-VL, and introduction of terrain understanding, visual localization, visual storytelling...- 1.9k
-
Hugging Face Releases SmolVLM Open Source AI Model: 2 Billion Parameters for End-Side Reasoning, Small and Fast
The Hugging Face platform published a blog post yesterday (November 26) announcing the launch of the SmolVLM AI visual language model (VLM), with just 2 billion parameters for device-side reasoning, which stands out from its peers by virtue of its extremely low memory footprint. Officials say the SmolVLM AI model benefits from being small, fast, memory efficient, and completely open source, with all model checkpoints, VLM datasets, training recipes, and tools released under the Apache 2.0 license. SmolVLM AI ...- 2.7k
-
Ali Tongyi Qianqian Releases Qwen2.5-Turbo AI Model: Supports 1 Million Tokens Contexts, Processing Time Reduced to 68 Seconds
November 19th, Ali Tongyi Qianqian released a blog post yesterday (November 18th) announcing the launch of the Qwen2.5-Turbo open source AI model in response to the community's request for a longer Context Length after months of optimization and polishing. Qwen2.5-Turbo extends the context length from 128,000 to 1,000,000 tokens, an improvement equivalent to about 1,000,000 English words or 1,500,000 Chinese characters, and can accommodate 10 complete novels,...- 3.9k
-
Take a look at AI virtual digital people and inventory current open source projects on digital people
Recently, in the AI circle, the digital man is getting prettier and prettier, and each company is launching the "open source strongest" digital man But, there are too many choices, how to know which one is suitable for you? You can't say "me + difficulty = give up", right? I can't! As a fanatic, I can't let you face such a dilemma! That's why I've made a decisive move! For you to one-time share before the digital person related integration package, do an inventory, including the effect of realization, the configuration required, generation time, etc., so that we can take a breath to see the current open source digital person in the end which is strong, together with the best choice of excavator! Digital people fire fire fire! To say A...- 10.3k
-
Ali Tongyi Thousand Questions open source Qwen2.5-Coder full range of models, claiming that the code ability to tie GPT-4o
November 12 news, Ali Tongyi Thousand Questions open source Qwen2.5-Coder full series of models, of which Qwen2.5-Coder-32B-Instruct become the current SOTA open source model, the official claim that the code ability to level with the GPT-4o. Qwen2.5-Coder-32B-Instruct as the open-source flagship model, on several popular code generation benchmarks (such as EvalPlus, LiveCodeBench, BigCodeBench) are...- 2.8k
-
Say Goodbye to Silent Movies: Smart Spectrum Launches New Clear Shadow, Generating 10-Second 4K60 Frame/Self-Audio Videos
Wisdom Spectrum technology team today released and open-sourced the latest version of the video model CogVideoX v1.5, compared to the original model, CogVideoX v1.5 will include 5/10 second, 768P, 16 frame video generation capabilities, I2V model support for any size scale, significantly improve the quality of graphical video and complex semantic understanding. Officially, CogVideoX v1.5 will also be synchronized to the "ClearVideo" platform, and combined with the newly launched CogSound sound model, the "new ClearVideo" will have the following features: Quality Improvement: In the ...- 3.7k
-
Meta Open Source Small-Language AI Models MobileLLM Family: Smartphone Friendly, 125M-1B Version Available
In a press release last week, Meta announced that it has officially open sourced the MobileLLM family of small language models that run on smartphones, and has added three new parameterized versions of the family, 600M, 1B, and 1.5B to the project's GitHub project page (click here to visit). According to Meta researchers, the MobileLLM family of models, built for smartphones, claims to have a lean architecture and introduces "SwiGLU activation functions," "grouped-query attenuation," and a "new language model with a new language model. ... -
Tencent Launches Hunyuan-Large Large Model: 389B Total Parameters, Industry's Largest Transformer-Based MoE Model Open-Sourced
Tencent announced the launch of the Hunyuan-Large model, which is the largest Transformer-based MoE model that has been open sourced in the industry, with 389 billion total parameters (389B) and 52 billion activation parameters (52B). Tencent has open sourced Hunyuan-A52B-Pretrain, Hunyuan-A52B-Instruct, and Hunyuan-A52B-Instruct-FP8 at Hugging Face, and released...- 3.4k
-
ElevenLabs pushes open-source mini-project X-to-Voice: transforming Twitter accounts into personalized avatars with one click mp
Artificial intelligence company ElevenLabs recently released an open-source project called "X-to-Voice," a tool that intelligently analyzes Twitter user profiles and automatically generates digital voices and animated avatars that match a user's personality. The project integrates a number of cutting-edge technologies: ElevenLabs' self-developed voice design API is responsible for voice generation, while the Taedra tool is in charge of dynamic avatar creation. On the technical support side, the project uses Apify for profile and image data collection, Hedra for dynamic avatar...- 5.3k
-
World's first open source AI standard released, developed by Microsoft, Google, Amazon, Meta, Intel, Samsung and other giants
At the ALL THINGS OPEN 2024 conference at the end of this month, the open source organization Open Source Initiative (OSI) officially released the Open Source Artificial Intelligence Definition (OSAID) version 1.0, marking the birth of the world's first open source AI standard. Founded in 1998, OSI is a global non-profit organization that aims to define and "manage" all things open source. The OSAID standard was co-designed by more than 25 organizations, including Microsoft, Google, Amazon, Meta, Intel,...- 5.4k
-
OpenAI Opens New SimpleQA Benchmark to Cure Big Models of "Nonsense"
On October 31, OpenAI announced that it is open-sourcing a new benchmark called SimpleQA, which measures the ability of language models to answer short fact-seeking questions, in order to measure the accuracy of language models. One of the open challenges in AI is how to train models to generate factually correct answers. Current language models sometimes produce incorrect output or unsubstantiated answers, a problem known as "hallucinations". Language models that can generate more accurate and less illusory answers are more reliable and can be used...- 3.3k
-
Google DeepMind opens SynthID Text tool to recognize AI-generated text
Google DeepMind announced on October 23 that it has officially open-sourced its SynthID Text text watermarking tool for free use by developers and businesses. Google launched the SynthID tool in August 2023, which has the ability to create AI content watermarks (declaring that the work was created by AI) and recognize AI-generated content. It can embed digital watermarks directly into AI-generated images, sound, text, and video without compromising the original content, as well as scanning that content for existing digital water... -
Wizen Robotics Announces Global Open Source of Rhinoceros X1, a Startup Program of "Wizen"
October 24th, Zhiyuan Robotics announced today that "Rhinoceros X1" is officially open-sourced for the world, and a full set of drawings and code for the hardware and software are online on GitHub, and the development guide is online on the official website of Zhiyuan Robotics. Zhiyuan Robotics official said, as the industry's first full-stack open source humanoid robot drawings and code company, the open source will unreservedly provide "one-stop" software and hardware technology resources, the total size of the material more than 1.2GB. In the machine structure hardware, open source content includes detailed machine structure drawings, hardware block diagrams and bill of materials (BOM), installation instructions, and the machine. (The open source content includes detailed drawings of the whole machine structure, hardware block diagrams and bill of materials (BOM), and instructions for installing the machine. ...- 5.5k
-
Open Source Venn diagram AI heavyweights are new: Stable Diffusion 3.5 arrives in a bucket, "out-of-the-box" on consumer-grade hardware
In a blog post yesterday (October 22), Stability AI announced the release of Stable Diffusion 3.5, which marks a significant advancement in open source AI graphical models. Stable Diffusion 3.5 is available in Medium (released on October 29), Large and Large Turbo sizes, designed to meet the different needs of scientific researchers, enthusiasts, startups and enterprises, with the following introduction: Stable Dif...- 3.2k
-
Wisdom Spectrum open source CogView3-Plus, related functions on the Wisdom Spectrum Clear Words App
Oct. 14, 2012 - Smart Spectrum's technical team announced today that it has open-sourced the text2img models CogView3 and CogView3-Plus-3B, and the capabilities of this series of models are now available on the Smart Spectrum Clear Words app. According to the introduction, CogView3 is a text2img model based on cascading diffusion. According to the introduction, CogView3 is a text2img model based on cascade diffusion, which consists of three stages as follows: Stage 1: Generate a 512x512 low-resolution image using the standard diffusion process. The second stage: using the relay diffusion process, the implementation of 2 times the super-resolution generation, from 512x512 ...- 4.6k
❯
Search
Scan to open current page
Top
Checking in, please wait
Click for today's check-in bonus!
You have earned {{mission.data.mission.credit}} points today!
My Coupons
-
¥CouponsLimitation of useExpired and UnavailableLimitation of use
before
Limitation of usePermanently validCoupon ID:×Available for the following products: Available for the following products categories: Unrestricted use:Available for all products and product types
No coupons available!
Unverify
Daily tasks completed: