{"id":58709,"date":"2024-10-02T11:01:54","date_gmt":"2024-10-02T10:01:54","guid":{"rendered":"https:\/\/dataconomy.ru\/?p=58709"},"modified":"2024-10-02T11:01:54","modified_gmt":"2024-10-02T10:01:54","slug":"open-source-nvidia-nvlm-1-0-models","status":"publish","type":"post","link":"https:\/\/dataconomy.ru\/2024\/10\/02\/open-source-nvidia-nvlm-1-0-models\/","title":{"rendered":"Nvidia introduces open-source NVLM 1.0 models"},"content":{"rendered":"
Nvidia has officially entered the ring with a powerful open-source AI model, NVLM 1.0, challenging industry giants like OpenAI and Google.<\/p>\n
The company\u2019s new NVLM 1.0 family of large multimodal language models promises to deliver cutting-edge capabilities across both visual and text-based tasks.<\/p>\n
Leading the pack is the 72 billion parameter NVLM-D-72B, a model designed to perform at the highest level, making a massive impact on vision-language tasks while improving traditional text-based outputs.<\/p>\n
The release of NVLM 1.0<\/strong> marks a notable shift in the AI ecosystem, which proprietary models have largely dominated. Nvidia\u2019s decision to make these model weights publicly available\u2014and eventually release the training code\u2014offers researchers and developers access to tools that rival the likes of GPT-4<\/strong>. This is a rare move in an industry where most advanced models remain under lock and key, tightly controlled by tech giants.<\/p>\n As Nvidia stated in their research paper<\/a>, “NVLM 1.0 achieves state-of-the-art results on vision-language tasks, rivaling both proprietary and open-access models.”<\/strong><\/p>\n What this means for developers is a new frontier in AI accessibility<\/strong>, much like what Meta did with Llama 3.2<\/a>, giving smaller labs and independent researchers a chance to work with top-tier AI tools without having to navigate the often prohibitive costs or corporate restrictions.<\/p>\n The open-source release of NVLM 1.0<\/strong> has generated excitement across the AI research community. One prominent researcher highlighted the significance of the model on social media, stating:<\/p>\n Wow nvidia just published a 72B model with is ~on par with llama 3.1 405B in math and coding evals and also has vision \ud83e\udd2f pic.twitter.com\/c46DeXql7s<\/a><\/p>\n — Phil (@phill__1) October 1, 2024<\/a><\/p><\/blockquote>\n\n