
xAI has rolled out a new visual AI feature for Grok called ‘Grok Vision,’ allowing users to point their smartphone camera at objects—like signs, products, or documents—and ask the chatbot questions about them. The functionality mirrors real-time vision features in Gemini and ChatGPT. For now, it’s only available in the iOS Grok app, with Android support expected later.
GROK CAN SEE WHAT YOU SEE—LITERALLY
— Mario Nawfal (@MarioNawfal) April 20, 2025
Grok’s voice mode comes with camera access, letting users point their phone at something and ask, “What am I looking at?”
The Vision feature on iOS allows the chatbot to analyze real-world objects, text, and environments through your… https://t.co/cmtINP8yp6 pic.twitter.com/N1b6pcYZOi
New Grok features—multilingual audio and real-time voice search—are now available, though only for Android users on the $30/month SuperGrok plan.
Grok has been gaining new features at a steady clip. Earlier this month, xAI added a “memory” component to Grok that lets the bot pull on details from past conversations. Grok also got a canvas-like tool for creating docs and apps.