😲 OpenAI Introduces "Images in ChatGPT"

+ Sam Altman Pivots His Role in OpenAI 👀

Today’s menu 🍽️👇️ 

Length: 5 minutes

  • OpenAI introduces "Images in ChatGPT” 🖼️ 

  • Sam Altman pivots his OpenAI role 👀 

  • Google rolls out Gemini’s real-time AI video features 📹️

  • Perplexity wants to buy TikTok 😲 

  • Otter’s new AI agent can speak up in meetings 🗣️

  • Top AI Tools of the week! 🛠️ 

BREAKING AI NEWS

 

ChatGPT’s image-generation feature gets an upgrade 📈 

OpenAI has introduced image generation capabilities powered by its advanced GPT-4o model directly into ChatGPT. This feature, called "Images in ChatGPT," allows users to create high-quality images within the chatbot itself. It is available across all subscription tiers, including Free, Plus, Pro, and Team  

The GPT-4o model represents a significant upgrade over previous image-generation tools like DALL-E 3. It is a multimodal AI system capable of handling text, images, and code within the same framework. Key improvements include:

  • Enhanced Object Handling: GPT-4o can accurately render up to 15-20 objects in a scene, compared to the 5-8 object limit of earlier models.

  • Interactive Image Editing: Users can refine images dynamically through real-time chat, adjusting elements like backgrounds and objects.

  • Context-Aware Modifications: The model can seamlessly edit existing images, including those with people, integrating changes to foreground and background elements.

OpenAI has implemented safeguards to prevent misuse, such as blocking explicit or harmful imagery requests and embedding metadata for transparency.

This rollout is expected to empower creators, designers, and developers with more precise and versatile tools for visual content creation  

OpenAI shuffles its leadership structure 👀 

OpenAI has announced a significant leadership reshuffle, with CEO Sam Altman shifting his focus to the company's technical direction. Altman will now concentrate on guiding OpenAI's research and product development efforts, while COO Brad Lightcap takes on expanded responsibilities, including overseeing day-to-day operations, international expansion, and partnerships with major tech companies like Microsoft and Apple  

The company has also promoted Mark Chen to Chief Research Officer and Julia Villagra to Chief People Officer, reflecting its growing ambitions and organizational scale. These changes come amidst a broader restructuring at OpenAI, which has seen high-profile departures, including former CTO Mira Murati, who left to start her own AI venture.

OpenAI remains committed to its mission of advancing frontier AI research and delivering products that benefit humanity. This aims to streamline operations and accelerate innovation in the competitive AI landscape.

Google rolls out Gemini’s real-time AI video features 📹️ 

Google has started rolling out real-time AI video features for its Gemini platform. These features, part of the broader "Project Astra" initiative, allow Gemini to analyze smartphone screens and live camera feeds in real time 😲 

Users can now ask Gemini questions about what’s displayed on their screens or through their camera lenses, receiving contextual and immediate responses.

Currently, these capabilities are available to select Google One AI Premium subscribers, with plans to expand access gradually. Demonstrations have shown Gemini assisting users with tasks like choosing paint colors for pottery by analyzing live video feeds. This rollout reinforces Google's leadership in AI-driven virtual assistants, as competitors like Amazon and Apple are still preparing similar updates for their platforms 👀 

Perplexity wants to buy TikTok and open-source its algorithm 😲 

Perplexity AI has proposed acquiring TikTok and transforming its algorithm into an open-source system. This comes amidst mounting pressure on TikTok's parent company, ByteDance, to divest its U.S. operations due to national security concerns.

Perplexity's plan includes:

  • Rebuilding TikTok's recommendation algorithm from scratch in U.S.-based data centers under American oversight.

  • Making the "For You" feed transparent and open-source.

  • Integrating Perplexity's AI-powered search engine with TikTok's video library.

  • Enhancing personalization for users who connect their Perplexity and TikTok accounts.

  • Adding multilingual capabilities through automatic translation.

The startup also aims to introduce features like real-time citations for videos to combat misinformation and foster trust. However, ByteDance has shown reluctance to sell TikTok's U.S. operations 👀 

Otter’s new AI agent can speak up in meetings 🗣️ 

Otter.ai has introduced a new feature: a voice-activated AI Meeting Agent that actively participates in meetings. Unlike its previous transcription-focused tools, this agent can now answer questions, schedule follow-ups, and draft emails using natural voice interaction  

It draws on a company's historical meeting data to provide relevant responses, ensuring confidentiality by limiting access to authorized participants only.

Currently, the AI Meeting Agent is compatible with Zoom, with plans to expand to Microsoft Teams and Google Meet soon. Otter has also launched two other AI agents: the Sales Agent, which provides real-time coaching during sales calls, and the SDR Agent, capable of conducting autonomous product demos.

CREATE CUSTOMISED ILLUSTRATIONS

Create customized illustrations in your own style 🎨 🖌️ 

With Freepik, you can develop a unique branded illustration style by uploading reference images to ensure uniform visuals throughout your content

Here’s how to get started 👇️ 

  1. Open Freepik AI's “Create” tab and hit the plus icon in the Style section.

  2. Upload your reference images (10–50 images suggested).

  3. Select your preferred quality (Ultra, High, or Medium).

  4. Now reate illustrations by choosing your style and entering a prompt.

After you input all the photos and information needed, simply generate your illustration!

Drop a comment and tell us which guide you'd like to see next 🤝

Everything you need to know about AI… in one place!

I just started my own X page where I provide AI news, guides, hacks and more (for FREE)!

We don’t miss a day, that means you get the most recent AI news, practical workflows, expert guides and premium content everyday.

TOP AI TOOLS OF THE WEEK

These AI tools are going VIRAL 🐝 

Gemma 3 - Google’s multimodal, multilingual, 128k context AI model family 🤖 

Cube 3D - Roblox’s new open-source text-to-3D object generator 🧊

Zoom AI Companion - Agentic AI for meeting productivity, and other tasks  

ReCamMaster - Edit camera angles and movement in existing videos 📹️ 

Warm regards,
Leo Grundström / Founder of Daily AI Edge

Reply

or to participate.