🥳 ChatGPT Finally Gets Video Recognition

PLUS: Gemini 2.0 Better Than ChatGPT?

December is full of new AI surprises 🎁 

Length: 5 minutes

  • ChatGPT gets video recognition (& Santa surprise) 🤖 

  • Everything you need to know about Gemini 2.0  

  • All the new features of iOS 18.2 📱 

  • ChatGPT almost caused a global panic 😲 

  • Loom + ChatGPT = 🔥🔥🔥

  • Top AI Tools of the week! 🛠️ 

Your daily AI dose

Mindstream is the HubSpot Media Network’s hottest new property. Stay on top of AI, learn how to apply it… and actually enjoy reading. Imagine that.

Our small team of actual humans spends their whole day creating a newsletter that’s loved by over 150,000 readers. Why not give us a try?

BREAKING AI NEWS

ChatGPT can finally understand videos! 📽️ 

OpenAI has finally released the real-time video capabilities for ChatGPT that it demoed nearly seven months ago  

This new feature, part of the Advanced Voice Mode, allows users to point their phones at objects and have ChatGPT respond in near real-time. It can also understand what's on a device's screen via screen sharing, making it a versatile tool for various applications 📱 

To access this feature, users subscribed to ChatGPT Plus, Team, or Pro can tap the voice icon next to the ChatGPT chat bar, then tap the video icon on the bottom left to start video. For screen sharing, users can tap the three-dot menu and select "Share Screen".

However, ChatGPT Enterprise and Edu subscribers will have to wait until January for this feature, and there is no timeline for users in the EU, Switzerland, Iceland, Norway, or Liechtenstein 👀 

In addition to this, OpenAI has also introduced a festive "Santa Mode" for ChatGPT. This mode adds Santa's voice as a preset voice in ChatGPT, allowing users to interact with a jolly, festive voice. This feature is available to all users and will remain active until the end of December 🎅 

First o1, and now this. OpenAI is not disappointing with their "ship-mas" releases!

Google reveals Gemini 2.0  🤖 

Google has revealed Gemini 2.0, its latest flagship AI model, designed for the "agentic era." This advanced AI can generate text, images, and speech, making it a versatile tool for various applications. Here’s everything you need to know 👇️ 

  • Gemini 2.0 can natively generate images and audio in addition to text. This includes creating photorealistic visuals and narrating text with customizable voices 🗣️ 

  • The model can use third-party apps and services, such as Google Search, to execute tasks and provide more comprehensive responses.

  • Gemini 2.0 introduces agentic AI features, allowing it to independently accomplish tasks with adaptive decision-making. This includes automating tasks like shopping or scheduling appointments based on user prompts 🤔 

  • The model is twice as fast as its predecessor, Gemini 1.5 Pro, and offers improved reasoning and understanding in areas like coding and image analysis 🧑‍💻 

  • Google has implemented SynthID technology to watermark all audio and images generated by Gemini 2.0, ensuring outputs are flagged as synthetic to prevent misuse.

Gemini 2.0 is available through the Gemini API and Google’s AI developer platforms, AI Studio and Vertex AI.

An experimental release is currently accessible to developers, with a wider rollout planned for January 📆 

Do you think Gemini 2.0 will be more powerful than OpenAI’s o1 model? 🤔 

Here’s everything that’s new with iOS 18.2 📱 

Apple has officially released iOS 18.2, bringing a set of new features and enhancements to iPhone users. Here are some of the key updates 👇️ 

  • Image Playground: A new app that allows users to generate playful images using descriptions or elements from their photo library. This feature supports animation and illustration styles and is integrated into Messages.

  • Genmoji: Create custom emojis directly from the keyboard. These Genmoji can be synced across devices via iCloud, adding a personalized touch to your chats.

  • ChatGPT Integration: Siri now supports ChatGPT, providing more comprehensive responses to user queries. Privacy is maintained, with conversations processed on-device.

  • Visual Intelligence: Instantly learn about objects or places by pointing your camera. This feature enhances the Camera Control experience.

  • Two-stage Shutter: Lock focus and exposure with precision.

  • Mail app redesign: The Mail app has been redesigned to categorize emails into four tabs: Primary, Transactions, Updates, and Promotions.

  • Layered Recording: Create multi-track recordings with ease, ideal for creative users. This feature is available only on the iPhone 16 Pro and 16 Pro Max.

  • Default App Settings: Customize default apps for messaging and calling globally.

After Apple’s promise of bringing AI to all iPhones, this is the first significant update which is actually useful and caters to the wide audience. Whether it’s the fun of creating Genmoji, the practicality of the new Mail app, or the smarter Siri with ChatGPT, there’s something for everyone  

How ChatGPT caused global panic 👀

OpenAI experienced a significant outage affecting ChatGPT, Sora, and its developer-facing API on December 11, 2024.

The outage began around 3 PM PT and lasted until approximately 9 PM PT. During this period, users encountered error messages and were unable to access these services (including me 😭)

OpenAI quickly acknowledged the issue on social media, stating that they had identified the problem and were working on a fix. The outage coincided with the launch of ChatGPT's integration with Apple's iOS 18.2, leading some users to speculate about a connection, although OpenAI clarified that the outage was unrelated 👀 

This AI will research for you 🤖 

Google has introduced a new AI tool called Deep Research, which uses its Gemini bot to conduct comprehensive web-based research on behalf of users.

When a user inputs a query, Deep Research generates a multi-step research plan that can be edited or approved by the user 🧑‍💻 

The tool performs multiple related searches to gather and refine information, ensuring thorough analysis.

Once the research is complete, Deep Research provides a detailed report of its key findings, including links to the original sources  

Users can export the AI-generated research to Google Docs for further use or sharing.

Try Deep Research Here ◀️ 

YOUR OWN AI CLONE

Delegate tasks like a pro using Loom and ChatGPT 🤖📽️

Now you can use Loom and ChatGPT to create effective Standard Operating Procedures (SOPs) using just a simple prompt ⌨️

It’s pretty helpful if you spend hours creating documentations for either onboarding, meetings, or any other use case which involves the use of official documents 📃 

Here’s how to get started (for FREE) 👇️ 

  1. Log into Loom and record an instructional video of something you want to delegate 📽️

    Make sure to copy the transcript of the whole video in the “Transcript” tab

  2. Open ChatGPT and paste this prompt:

    "Transform this transcript from an instructional video into a comprehensive set of actionable steps to create an SOP (standard operating procedure), that will be easy for a team member to understand and execute. The transcript includes [briefly describe the task or process covered]. You should highlight key points, cautionary notes, and tips for efficiency. The goal outcome is a step-by-step guide I can use for effective delegation, that minimizes misunderstandings and errors and maximizes productivity and accuracy in task execution: [Include the transcript]"

  3. Now, just copy the response and give your team member or supplier the video and the new SOP  

  4. Review and export your presentation in your preferred format.

Drop a comment and tell us which guide you'd like to see next 🤝

The best place on the Internet to learn about AI! 🌐 🤖 

I just started my own X page where I provide AI news, guides, hacks and more (for FREE)!

We don’t miss a day, that means you get the most recent AI news, practical workflows, expert guides and premium content everyday.

TOOLS OF THE WEEK

These AI tools are going VIRAL 🐝 

Remy AI - Charismatic AI sleep coach that takes care of tracking sleep metrics, circadian rhythms, evening routines, and sleep environment 😴 

Magic Clips - Turn long videos into viral shorts instantly with AI

AgentPlace - Create AI-driven websites and apps through simple text instructions 🤖 

Magic Roll - Create viral shorts in one click with B-roll, motion graphics, and AI-powered captions 📱 

Warm regards,
Leo Grundström / Founder of Daily AI Edge

Reply

or to participate.