- WakeTheAI
- Posts
- 👁️ AI that can 'see': Grok Vision
👁️ AI that can 'see': Grok Vision
PLUS: The Washington Post x OpenAI

Hola, AI fam 🤖
In today’s WakeTheAI edition:
xAI releases Grok Vision
The Washington Post partners with OpenAI
Prompt: Deep Work Philosophy Transformation Plan
Fireflies releases 200+ AI mini apps
OpenAI told a judge it would buy Chrome from Google
Apple removed the “available now“ label as per the NAD’s instruction
Google pays Samsung an “enormous sum“ to make Gemini the default AI
xAI
👁️ xAI releases Grok Vision

Lazy Bits: xAI has launched Grok Vision, giving its chatbot the power to “see” through your smartphone camera, making it capable of identifying and responding to real-world visual input, much like ChatGPT and Gemini.
In-Depth Details:
Introducing Grok Vision, multilingual audio, and realtime search in Voice Mode. Available now.
Grok habla español
Grok parle français
Grok Türkçe konuşuyor
グロクは日本語を話す
ग्रोक हिंदी बोलता है— Ebby Amir (@ebbyamir)
11:16 PM • Apr 22, 2025
Multimodal Model: Grok-1.5V is xAI’s first vision-capable model, designed to process both images and text, including documents, charts, and real-world scenes.
Real-Time Visual Understanding: Grok Vision allows iOS users to point their camera at objects, signs, or documents and get contextual answers instantly.
Voice & Language Enhancements: Grok now supports multilingual audio and real-time search in voice mode—features currently limited to Android SuperGrok users.
Spatial Reasoning Benchmark: Grok-1.5V scored 68.7% on the RealWorldQA test, outperforming GPT-4V, Claude 3, and Gemini Pro 1.5 in real-world reasoning.
Persistent Memory: A recently added memory system enables Grok to recall details from previous conversations for more context-aware interactions.
Creative Capabilities: Grok also includes a canvas-like tool to help users build documents and lightweight apps directly within the interface.
Lazy Conclusion: With Grok Vision, xAI is no longer just competing in the chatbot race; it’s setting a new standard for AI that can see, reason, and act in the real world. Vision is no longer a luxury for AI models, it’s the new baseline.

Lazy Bits: The Washington Post has partnered with OpenAI to make its journalism directly accessible in ChatGPT. Users will now see summaries, quotes, and links to original Post articles in AI-generated responses.
In-Depth Details:
Trusted News in ChatGPT: ChatGPT will now include content from The Washington Post in response to relevant user queries, spanning politics, world news, tech, and more.
Always Attributed: Every mention includes clear attribution and direct links to full articles for deeper context.
Media Partnerships Grow: The Post joins over 20 publishers contributing to ChatGPT, expanding AI access to 160+ trusted news outlets.
WaPo's AI Ambitions: The Post has been building its own AI tools like “Ask The Post AI” and “Climate Answers,” while remaining model-agnostic in its newsroom experiments.
Lazy Conclusion: As AI becomes a primary information gateway, this partnership ensures reliable journalism stays visible, verifiable, and front-and-center in the ChatGPT era.

Deep Work Philosophy Transformation Plan
You can ask ChatGPT to act as your Deep Work implementation strategist, creating a step-by-step plan to transition from distraction-driven habits to a focus-first workflow.
ChatGPT will identify attention leaks, design a structured deep work schedule, and suggest keystone habits to build long-term focus.
The output will be organized in a clear markdown table, offering actionable, measurable changes to help you reclaim mental clarity and produce high-value work consistently.
Prompt:
Act as a Deep Work philosophy implementation strategist with expertise in high-performance focus systems. Your task is to design a structured, step-by-step transformation plan that shifts a professional’s daily routine from distraction-prone habits to a Deep Work–driven workflow.
The goal is to eliminate mental clutter, reinforce long-form thinking, and reclaim attention for high-priority output. Your plan should guide the user through identifying weak points in their current routine and systematically replacing them with intentional, deep work practices.
Key Focus Areas:
1. Identifying Cognitive Leaks – Map out distractions, inefficiencies, and habits that prevent sustained attention.
2. Designing a Deep Work Schedule – Recommend a realistic daily structure that includes protected time blocks and shutdown rituals.
3. Creating a Focus-Friendly Environment – Suggest workspace modifications and digital constraints that support deep concentration.
4. Building Momentum with Keystone Habits – Introduce routines that train the mind for depth over time.
5. Tracking and Iterating – Propose simple metrics to track consistency, focus duration, and task output.
Key Information to Include:
High-Priority Tasks: [Insert your top cognitive tasks that demand focus]
Current Work Environment: [Describe your workspace, routines, or setting]
Biggest Distractions: [List key interruptions or habits that break concentration]
Typical Daily Schedule: [Outline your standard workday]
Productivity Goals: [Define what outcomes or habits you aim to build]
Output Requirements:
• Present the implementation plan in a markdown table with two columns:
• Current Practices – Briefly describe behaviors, routines, or habits that dilute focus.
• Deep Work Practices – Recommend targeted changes aligned with Deep Work principles.
• Each row should address one specific area: time management, environment, task prioritization, mental clarity, etc.
• Use clear, brief language that makes the plan actionable and measurable.
Result:



OpenAI told a judge it would buy Chrome if Google is forced to sell it, as part of an antitrust case aiming to restore competition in online search.
Motorola launches SVX, a device with a body cam, mic, and AI assistant to speed up emergency response, cut report time, and integrate with 911 systems.
Apple removed the “available now” label from its AI features after NAD said it was misleading, as many tools like Siri upgrades weren't yet fully released.
Fireflies releases 200+ AI mini apps to auto-extract meeting insights, boosting productivity and integrating with Slack, Salesforce, and more.
Google pays Samsung an “enormous sum” and ad revenue share to make Gemini the default AI on Galaxy S25 amid DOJ scrutiny and rival bids from Microsoft and Perplexity.

✍️ NoteX: Transform lengthy content into smart insights
🧑💻 SideJot: AI-powered task planner for focused productivity
✈️ MagicTrips: Generate custom travel itineraries in seconds
📊 Edraw AI: AI-powered diagramming and visualization tool
🌐 FetchFox: Scrape any data from any website with AI



Did you like & enjoy today's newsletter?Your feedback will help us improve the newsletter for you. |