{"id":18063,"date":"2024-11-21T00:30:49","date_gmt":"2024-11-21T00:30:49","guid":{"rendered":"https:\/\/gpt.m2mbeta.com\/?p=18063"},"modified":"2024-11-21T00:30:49","modified_gmt":"2024-11-21T00:30:49","slug":"uh-oh-xs-grok-ai-can-now-understand-images-2","status":"publish","type":"post","link":"https:\/\/gpt.m2mbeta.com\/?p=18063","title":{"rendered":"Uh-oh, Xs Grok AI can now understand images"},"content":{"rendered":"<div id=\"article\">\n<p>Elon Musk&#8217;s AI chatbot <a href=\"https:\/\/www.engadget.com\/the-latest-version-of-xais-grok-can-process-images-120025782.html\" target=\"_blank\" data-ga-click=\"1\" data-ga-label=\"$text\" data-ga-item=\"text-link\" data-ga-module=\"content_body\" title=\"(opens in a new window)\">can now &#8220;understand&#8221; images<\/a>, including information-riddled diagrams and charts. Sorry, doesn&#8217;t everyone use the platform once known as Twitter for multi-disciplinary research and optimizing their work flows??<\/p>\n<p>Introduced as <a href=\"https:\/\/x.ai\/blog\/grok-1.5v\" target=\"_blank\" data-ga-click=\"1\" data-ga-label=\"$text\" data-ga-item=\"text-link\" data-ga-module=\"content_body\" title=\"(opens in a new window)\">Grok-1.5V<\/a> \u2014 Or Grok 1.5 &#8220;Vision,&#8221; the company&#8217;s &#8220;first-generation multimodal model&#8221; \u2014 the bot will be able to not only respond to your uploaded pictures and screenshots but also reason through complex documents, science diagrams, charts, screenshots, and photographs, the company says. Additionally, Grok-1.5V will gain &#8220;real-world spatial understanding&#8221; to better understand the physical world depicted in the images uploaded by its users. <\/p>\n<p>&#8220;Advancing both our multimodal understanding and generation capabilities are important steps in building beneficial AGI that can understand the universe,&#8221; the company wrote in its&#8217; announcement. &#8220;In the coming months, we anticipate to make significant improvements in both capabilities, across various modalities such as images, audio, and video.&#8221;<\/p>\n<div class=\"flex mx-auto mt-8 w-full max-w-3xl font-sans text-lg leading-normal md:text-xl md:leading-7\">\n        <span class=\"font-bold text-primary-400\">SEE ALSO:<\/span><br \/>\n        <a href=\"https:\/\/mashable.com\/article\/x-twitter-elon-musk-grok-ai-generated-user-tweets\" class=\"flex items-center text-secondary-300\"><br \/>\n            <span class=\"ml-1\">If you&#8217;re a paying X user, Elon Musk wants his Grok AI to write your posts for you, report says<\/span><br \/>\n            <svg class=\"ml-1 w-4 h-4 font-normal fill-current\"><use href=\"\/images\/icons\/spritemap.svg#sprite-arrow-right-thin\"\/><\/svg><br \/>\n        <\/a>\n    <\/div>\n<p>Example use cases include translating a diagram into Python code, turning a child&#8217;s drawing into a bedroom story, pinpointing the largest object among a group of many, and telling a driver if they have enough space to drive around an obstacle. <\/p>\n<section x-data=\"window.newsletter()\" x-init=\"init()\" class=\"relative p-8 my-12 mx-auto w-full max-w-3xl border md:p-12 md:my-16 border-secondary-300\" data-ga-impression=\"\" data-ga-category=\"newsletters\" data-ga-module=\"incontent_nl_signup\" data-ga-label=\"mashablelightspeed\">\n<p>\n            Mashable Light Speed\n        <\/p>\n<\/section>\n<p>Grok-1.5V is released along with xAI&#8217;s <a href=\"https:\/\/data.x.ai\/realworldqa.zip\" target=\"_blank\" data-ga-click=\"1\" data-ga-label=\"$text\" data-ga-item=\"text-link\" data-ga-module=\"content_body\" title=\"(opens in a new window)\">RealWorldQA<\/a>, an image and prompt dataset designed to test other GenAI models against Grok&#8217;s real world reasoning. <\/p>\n<blockquote class=\"twitter-tweet\"><p>\n    <a class=\"text-gray-600\" href=\"https:\/\/twitter.com\/xDaily\/status\/1778976040368369892\" target=\"_blank\" rel=\"noopener\" title=\"(opens in a new window)\"><br \/>\n        Tweet may have been deleted<br \/>\n    <\/a>\n<\/p><\/blockquote>\n<p>Competition is the least of Grok&#8217;s worries, however. Despite xAI&#8217;s continued investment, <a href=\"https:\/\/mashable.com\/article\/what-is-grok-xai-chatbot\" target=\"_self\" data-ga-click=\"1\" data-ga-label=\"$text\" data-ga-item=\"text-link\" data-ga-module=\"content_body\">Grok<\/a> has yet to stick with early users and staff \u2014 a new report alleges its own developers struggle to use the slow xAI API. That same report, published by <em>Fortune <\/em>this week, highlighted X employee concerns about Musk suggesting Grok <a href=\"https:\/\/mashable.com\/article\/x-twitter-elon-musk-grok-ai-generated-user-tweets\" target=\"_self\" data-ga-click=\"1\" data-ga-label=\"$text\" data-ga-item=\"text-link\" data-ga-module=\"content_body\">write paid user&#8217;s posts for them<\/a>, despite warnings from developers and staff. Last week, Grok came under fire for <a href=\"https:\/\/mashable.com\/article\/elon-musk-x-twitter-ai-chatbot-grok-fake-news-trending-explore\" target=\"_self\" data-ga-click=\"1\" data-ga-label=\"$text\" data-ga-item=\"text-link\" data-ga-module=\"content_body\">generating fake news headlines<\/a> from an alternate reality where Iran had assailed Tel Aviv with a military arsenal \u2014 <a href=\"https:\/\/lifehacker.com\/tech\/grok-is-making-up-fake-news-on-x\" target=\"_blank\" data-ga-click=\"1\" data-ga-label=\"$text\" data-ga-item=\"text-link\" data-ga-module=\"content_body\" title=\"(opens in a new window)\">not its first time<\/a>. <\/p>\n<p>While GenAI chatbots hallucinating realities and generating fake news is par for the course, Grok&#8217;s gaffe is indicative of yet another site wide issue. The bot, a par for the course response to ChatGPT from Musk, is integrating into a platform that has slowly whittled away at its defenses against AI gone bad. Combined with X&#8217;s all around poor reputation for moderation and the CEO&#8217;s own refusal to address misinformation in aid of the site&#8217;s &#8220;citizen journalists,&#8221; Grok occupies a precarious spot in the platform&#8217;s besieged information ecosystem. <\/p>\n<p>Grok-1.5V will be available to early testers and select users soon. <\/p>\n<section class=\"mx-auto max-w-7xl\">\n<\/section><\/div>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><script>\n    var facebookPixelLoaded = false;\n    window.addEventListener('load', function(){\n        document.addEventListener('scroll', facebookPixelScript);\n        document.addEventListener('mousemove', facebookPixelScript);\n    })\n    function facebookPixelScript() {\n        if (!facebookPixelLoaded) {\n            facebookPixelLoaded = true;\n            document.removeEventListener('scroll', facebookPixelScript);\n            document.removeEventListener('mousemove', facebookPixelScript);\n            !function(f,b,e,v,n,t,s){if(f.fbq)return;n=f.fbq=function(){n.callMethod?\n                n.callMethod.apply(n,arguments):n.queue.push(arguments)};if(!f._fbq)f._fbq=n;\n                n.push=n;n.loaded=!0;n.version='2.0';n.queue=[];t=b.createElement(e);t.async=!0;\n                t.src=v;s=b.getElementsByTagName(e)[0];s.parentNode.insertBefore(t,s)}(window,\n                document,'script','\/\/connect.facebook.net\/en_US\/fbevents.js');\n            fbq('init', '1453039084979896');\n            fbq('track', \"PageView\");\n        }\n    }\n<\/script><\/p>\n<hr style=\"border-top: 2px solid #ccc; margin-top: 20px;\">\n<p><em>Source: <\/em> <em><a href=\"https:\/\/mashable.com\/article\/x-twitter-grok-ai-can-understand-images\">mashable.com\u2026<\/a><\/em><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Elon Musk&#8217;s AI chatbot can now &#8220;understand&#8221; images, including information-riddled diagrams and charts. Sorry, doesn&#8217;t everyone use the platform once known as Twitter for multi-disciplinary research and optimizing their work flows?? Introduced as Grok-1.5V \u2014 Or Grok 1.5 &#8220;Vision,&#8221; the company&#8217;s &#8220;first-generation multimodal model&#8221; \u2014 the bot will be able to not only respond to [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-18063","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"_links":{"self":[{"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=\/wp\/v2\/posts\/18063","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=18063"}],"version-history":[{"count":0,"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=\/wp\/v2\/posts\/18063\/revisions"}],"wp:attachment":[{"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=18063"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=18063"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/gpt.m2mbeta.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=18063"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}