- Microsoft Copilot now includes native image generation using GPT-4o.
- The app is testing Agent Actions for managing daily tasks.
- Visual updates include evolving Copilot characters and full-screen voice mode.
- New features reflect Microsoft’s push for a unified AI productivity platform.
Copilot Integrates Native GPT-4o Image Creation
Microsoft’s Copilot app has taken a significant step by embedding native image generation powered by OpenAI’s GPT-4o model. This upgrade replaces the earlier DALL-E 3 integration, allowing users across platforms to generate high-quality visuals directly within the app without relying on third-party services or external tools. The move highlights Microsoft’s strategy to streamline creative workflows inside its expanding AI ecosystem.
Agent Actions: A Glimpse into Automated Task Handling
One of the most intriguing developments under the hood is the “Action” feature, recently surfaced in the app’s code. Though still marked “coming soon” in the Labs tab, Agent Actions are designed to let Copilot autonomously handle everyday computing tasks during short, five to ten-minute sessions. While currently limited to early internal tests, the feature seems aimed specifically at Windows users, aligning with Microsoft’s broader ecosystem-centric approach. Access is expected to roll out gradually, possibly prioritizing Copilot Pro subscribers or select testers.
Visual Identity Evolves with Copilot Characters
Alongside functional upgrades, Microsoft is refining Copilot’s visual identity. In voice mode, Copilot’s AI characters now take over the full screen, marking a shift from the app’s previous, more compact conversational interface. A still-unnamed fourth character, visually resembling a bubblegum or cloud, has received design enhancements, following a similar evolution seen in the character Erin, whose appearance transitioned from lava to mushroom-like forms. These characters serve both as a branding element and as potential interactive avatars, though their final implementation remains under development.
Positioning Copilot at the Core of Microsoft’s AI Ambitions
These updates signal Microsoft’s continuing ambition to unify productivity, assistance, and personality in the Copilot platform. By blending functional tools like Agent Actions with creative enhancements such as GPT-4o image generation and evolving AI characters, the company is shaping Copilot into a central pillar of its future-facing AI strategy.
Source: Microsoft

