{"id":54182,"date":"2026-06-23T15:46:25","date_gmt":"2026-06-23T07:46:25","guid":{"rendered":"https:\/\/www.1ai.net\/?p=54182"},"modified":"2026-06-23T15:46:25","modified_gmt":"2026-06-23T07:46:25","slug":"%e4%ba%ac%e4%b8%9c%e5%bc%80%e6%ba%90%e5%ae%9e%e6%97%b6%e8%a7%86%e9%a2%91%e8%a7%86%e8%a7%89%e8%af%ad%e8%a8%80%e4%ba%a4%e4%ba%92%e6%a8%a1%e5%9e%8b-joyai-vl-interaction","status":"publish","type":"post","link":"https:\/\/www.1ai.net\/en\/54182.html","title":{"rendered":"JyoAI-VL-Interaction"},"content":{"rendered":"<p>June 23rd news, yesterday<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e4%ba%ac%e4%b8%9c\" title=\"[Sees articles with tags]\" target=\"_blank\" >JD.com<\/a>Announce<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e5%bc%80%e6%ba%90\" title=\"[View articles tagged with [open source]]\" target=\"_blank\" >Open Source<\/a>Real-time video visual language<a href=\"https:\/\/www.1ai.net\/en\/tag\/%e4%ba%a4%e4%ba%92%e6%a8%a1%e5%9e%8b\" title=\"[Sees articles with [interactive model] labels]\" target=\"_blank\" >Interactive Model<\/a> JoyAI-VL-Interaction. Officially, this is the first all-inputing model and system in the world and is supported by vLLM-Omni from day-0\u3002<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-54183\" title=\"aa6c3bfej00th2qui003sd000ufzm\" src=\"https:\/\/www.1ai.net\/wp-content\/uploads\/2026\/06\/aa6c3bfej00th2qui003sd000u000fzm.jpg\" alt=\"aa6c3bfej00th2qui003sd000ufzm\" width=\"1080\" height=\"575\" \/><\/p>\n<p>JoyAI-VL-Interaction supports voice input output, visual interface, long-term memory, back-office model interface and vLLM deployment programme. According to Kyoto, developers can replace ASR, TTS, backstage models, external tools and operating modules, which can be converted into real-time AI assistants such as security surveillance, elder child care, live talk shows, electrician procurement, operational guidance, AI glasses or accessibility aids\u3002<\/p>\n<p>Official data show that in 58 cases of real-life blind evaluation, JoyAI-VL-Interaction had a total success rate of 77.6% compared to the total success rate of Gemini video call assistant 87.9%\u3002<\/p>\n<p>GitHub: Gythub.com\/jd-opensource\/JoyAI-VL-Interaction<\/p>\n<p>Hugging Face: hugingface.co\/jdopensource\/JoyAI-VL-Interaction-Preview<\/p>","protected":false},"excerpt":{"rendered":"<p>On June 23rd, yesterday, Kyoto announced an open source live video visual language interactive model, JoyAI-VL-Interaction. Officially, this is the first all-inputing model and system in the world and is supported by vLLM-Omni from day-0. JoyAI-VL-Interaction supports voice input output, visual interface, long-term memory, back-office model interface and vLLM deployment programme. According to Kyoto, developers can replace ASR, TTS, backstage models, external tools and business modules, which can be converted into real-time AI assistance such as security surveillance, geriatric child care, live talk shows, electrician procurement, operating guidance, AI glasses or accessibility aids<\/p>","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[146],"tags":[8610,1624,219],"collection":[],"class_list":["post-54182","post","type-post","status-publish","format-standard","hentry","category-news","tag-8610","tag-1624","tag-219"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/54182","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/comments?post=54182"}],"version-history":[{"count":0,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/posts\/54182\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/media?parent=54182"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/categories?post=54182"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/tags?post=54182"},{"taxonomy":"collection","embeddable":true,"href":"https:\/\/www.1ai.net\/en\/wp-json\/wp\/v2\/collection?post=54182"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}