• WakeTheAI
  • Posts
  • 🗣️ Claude's Voice Mode is finally here

🗣️ Claude's Voice Mode is finally here

PLUS: OpenAI launches Stargate in UAE

Hola, AI fam 🤖

In today’s WakeTheAI edition:

  • Claude’s Voice Mode is here

  • OpenAI launches Stargate in UAE

  • Prompt: Write Engaging Video Scripts

  • Meta splits its AI team into two

  • Opera unveils Neon, an AI browser

  • OpenAI is testing the “Sign in with ChatGPT“ feature

  • xAI will pay Telegram $300M to deploy its Grok chatbot

  • Nick Clegg says requiring artists’ consent to train AI would kill the AI industry

ANTHROPIC
🗣️ Claude’s Voice Mode is here

Lazy Bits: Anthropic has rolled out Voice Mode for Claude’s mobile app, enabling full voice conversations with the AI. Users can talk to Claude, hear natural voice replies, and switch between text and voice seamlessly. It’s currently in beta and rolling out in English across iOS and Android.

In-Depth Details:

  • Voice Mode lets users speak naturally to Claude and receive audio responses — ideal for moments when typing isn’t convenient.

  • As Claude speaks, key points from the response appear on-screen in real-time, helping users follow along without missing context.

  • From daily planning and creative thinking to interview prep and learning while on the go, Voice Mode makes Claude more accessible in everyday life.

  • You can jump between text and voice within the same conversation, with full context retained across both modes.

  • Users on paid plans can connect Google Calendar, Gmail, and (for Enterprise users) Docs to access information via voice commands.

  • Free users can send 20–30 voice messages per session. Paid users get significantly higher limits for extended voice conversations.

  • Claude avoids voice cloning by offering only preset voices and includes safeguards to prevent impersonation and ensure policy compliance.

Lazy Conclusion: With OpenAI and Google already in the voice game, Anthropic’s move feels less like a leap ahead and more like catching up, but doing it right. Claude’s Voice Mode stands out with real-time key point displays and seamless mode-switching, pushing toward a more thoughtful, utility-driven voice experience.

TOGETHER WITH COSINE

Lazy Bits: Genie is your AI-powered senior software engineer. It reads your codebase, picks up tasks from Jira or Linear, and writes production-ready code—so you can focus on product, not pull requests.

Genie simplifies software development with features like:

  • Autonomous task execution: Assign issues directly from Linear or Jira. Genie branches out, implements the changes, tests the code, and submits the PR, no babysitting required.

  • Real-time collaborative coding: Watch Genie write code live. Intervene anytime and collaborate like you're pair-programming with a top engineer.

  • Context-aware reasoning: Genie reads your entire repo before making a move, so every PR is aligned with your architecture and stack.

  • Built for large codebases: Whether it’s 50k or 500k+ lines, Genie handles complexity like a pro. Multi-service systems, legacy code, or monorepos, it’s got it covered.

  • Seamless integration: It connects with GitHub, Linear, Slack, and Vercel, so it fits right into your existing workflow without friction.

Whether you're a startup CTO, dev team lead, or indie hacker, Genie helps you move faster, clean up technical debt, and ship with confidence.

Let your AI teammate handle the code →

Lazy Bits: OpenAI launched Stargate UAE, its first international AI infrastructure project under the new OpenAI for Countries program. Backed by the US government and UAE investment, it aims to build sovereign AI capability, scale compute globally, and bring ChatGPT to an entire nation.

In-Depth Details:

  • A 1GW compute cluster is being built in Abu Dhabi, with 200MW coming online in 2026 to support large-scale AI workloads.

  • UAE will be the first country to deploy ChatGPT nationwide, enabling access across sectors like education, healthcare, and transportation.

  • The UAE is also investing in US-based Stargate projects as part of a broader $1.4 trillion commitment to boost American tech and jobs.

  • Developed with support from Oracle, NVIDIA, Cisco, G42, and SoftBank, in coordination with the US government.

  • Stargate UAE will offer compute within a 2,000-mile radius, potentially serving half the global population.

  • This is the first of 10 planned partnerships to help nations build sovereign AI infrastructure aligned with democratic values.

Lazy Conclusion: By becoming the first to host a national Stargate, the UAE is claiming early leadership in AI infrastructure. It sets a precedent for how countries can secure strategic advantage in the AI age, not through regulation alone, but by owning the compute

Write Engaging Video Scripts

You can ask ChatGPT to act as your video content strategist and expert scriptwriter, creating a professional-quality script designed to maximize viewer engagement and narrative clarity.

ChatGPT will structure the video using a dependency-driven format, ensuring smooth logical flow, audience alignment, and brand consistency.

With clear timestamps, visual direction, and narration cues, the final script will be ready for seamless production and optimized viewer retention.

Prompt:

Act as a video content strategist and expert scriptwriter with a specialty in audience engagement. Your task is to develop a structured, professional-quality video script using a dependency-driven format that ensures flow, clarity, and impact.

The script should be tailored to the specified topic, target audience, and brand tone, while aligning with the core goal of the video. The structure must support visual planning, production timing, and smooth narrative delivery.

Key Focus Areas:

1. Topic Breakdown & Audience Intent
– Analyze the topic deeply to understand key value propositions
– Consider audience pain points, interests, and expectations

2. Script Structuring with Dependency Grammar
– Build the outline so each section logically flows into the next
– Ensure foundational concepts precede advanced ones

3. Narrative Planning with Time Segments
– Allocate approximate timestamps to each script section
– Provide pacing suggestions for delivery

4. Visual & Audio Layering
– Suggest visual elements (on-screen text, cut scenes, motion graphics)
– Align narration/dialogue with visuals for maximum impact

5. Brand Voice Integration
– Tailor tone, language, and storytelling to reflect the brand identity
– Use hooks and rhetorical devices aligned with the audience’s emotional triggers

Key Information To Include:

• Topic: [Describe the central theme or subject of the video]
• Target Audience: [Define the viewers – demographics, interests, pain points]
• Goal: [Clarify the core objective – educate, convert, entertain, inspire]
• Video Length: [Specify desired total runtime]
• Brand Voice: [Specify tone – e.g., friendly, bold, humorous, analytical]

Output Requirements:

• Present the script in a clearly formatted layout with the following elements:
- Timestamped Sections
- Headings and Subheadings
- Visual Direction (on-screen text, animations, cutaways)
- Audio/Narration (dialogue, monologue, voiceover cues)
• Ensure the structure reflects logical progression, engaging pacing, and room for visual interpretation.
• All instructions should be easy to follow for video editors, voiceover artists, or on-screen presenters.

Result:

TOGETHER WITH PACASO

He’s already IPO’d once – this time’s different

Spencer Rascoff grew Zillow from seed to IPO. But everyday investors couldn’t join until then, missing early gains. So he did things differently with Pacaso. They’ve made $110M+ in gross profits disrupting a $1.3T market. And after reserving the Nasdaq ticker PCSO, you can join for $2.80/share until 5/29.

This is a paid advertisement for Pacaso’s Regulation A offering. Please read the offering circular at invest.pacaso.com. Reserving a ticker symbol is not a guarantee that the company will go public. Listing on the NASDAQ is subject to approvals. Under Regulation A+, a company has the ability to change its share price by up to 20%, without requalifying the offering with the SEC.

  • Meta splits its AI team into two: one for consumer AI products and another for AGI research, aiming to speed up development and stay competitive in the AI race.

  • Opera unveils Neon, a new AI browser that handles coding, shopping, and form-filling, streamlining online tasks with features like Chat, Do, and Make, currently on a waitlist.

  • Nick Clegg says requiring artists’ consent to train AI would “kill” the UK’s AI industry, as Parliament debates new rules on transparency and copyright use.

  • Elon Musk’s xAI will pay Telegram $300M to deploy its Grok chatbot, aiming to reach 1B+ users and gain data to train its models. Telegram gets 50% of sales.

  • OpenAI is testing the â€śSign in with ChatGPT” feature for third-party apps, aiming to rival Apple, Google, and Microsoft’s sign-in services. Devs can apply now.

  1. 📚 EasilyLearn: AI learning assistant

  2. 📊 Hoox: Create short video ads with AI

  3. 🎥 Flow: Google’s AI-powered filmmaking tool

  4. 🧑‍🧑‍🧒‍🧒 Remento: Capture family memories using AI

Did you like & enjoy today's newsletter?

Your feedback will help us improve the newsletter for you.

Login or Subscribe to participate in polls.