
OpenAI pushes agents from ephemeral assistants to persistent workers with memory, shells, and Skills
Read Our Expert Analysis
Create an account or login for free to unlock our expert analysis and key takeaways for this development.
By continuing, you agree to receive marketing communications and our weekly newsletter. You can opt-out at any time.
Recommended for you

MiniMax’s M2.5 slashes AI costs and reframes models as persistent workers
Shanghai startup MiniMax unveiled M2.5 in two flavors, claiming near–state-of-the-art accuracy while cutting consumption costs dramatically and enabling sustained, low-cost agent deployments. The release couples a sparse Mixture-of-Experts design and a proprietary RL training loop with aggressive pricing, but licensing and weight availability remain unresolved.

Anthropic’s Claude Code Adds Persistent Tasks to Turn Agents into Project Managers
Anthropic updated Claude Code with a persistent Task primitive that moves project state out of ephemeral chat and onto durable, filesystem-backed artifacts, enabling cross-session coordination, CI-friendly runs, and stronger dependency enforcement. The change arrives alongside rising integration work—examples include Asana-style connectors that bind agents to real project data and permission models—making agent durability and governance primitives timely for teams adopting AI-driven pipelines.
OpenAI Internal Data Assistant Scales Analytics Across Teams
OpenAI built an internal, natural‑language data assistant that turns prompts into charts, dashboards and written analyses in minutes — a tool two engineers shipped in three months using roughly 70% Codex‑generated code — and which the company now uses broadly to compress analyst workflows. The project both exemplifies and benefits from emerging platform primitives (persistent state, hosted runtimes, Skills) that enable agentic workflows, but realizing the productivity gains at scale requires disciplined data governance, provenance, and runtime safety to avoid errors, leakage, or vendor‑lock‑in.
OpenAI Acquires Promptfoo to Harden AI-Agent Security
OpenAI bought Promptfoo to embed prompt- and agent-testing into its Frontier and agent orchestration tooling, accelerating in-house validation while heightening concerns about shrinking vendor-neutral red-team capacity and multi-vendor procurement dynamics in enterprise and defense.

Apple integrates agentic AI into Xcode 26.3 with Anthropic and OpenAI support
Apple’s Xcode 26.3 Release Candidate embeds agent-capable workflows that let MCP-compatible agents from Anthropic and OpenAI operate inside the IDE, inspecting projects, editing code and running tests while developers keep visibility and control. The move arrives alongside vendor launches (notably OpenAI’s new Codex macOS client) that preserve long-running agent context and modular skills — underscoring a market shift toward orchestration, UX and governance as the decisive factors for adoption.
GitHub expands Agent HQ to host Anthropic’s Claude and OpenAI’s Codex inside developer workflows
GitHub has added Anthropic’s Claude and OpenAI’s Codex as selectable coding agents inside Copilot interfaces for Copilot Pro Plus and Enterprise subscribers, integrating agent choice directly into issues, PRs and editor workflows. The move aligns with a broader industry shift toward embeddable agent orchestration (Copilot SDK, MCP-enabled tooling and native clients) and raises new operational priorities around billing, grounding, auditability and vendor comparison.
OpenAI debuts Frontier to integrate AI agents across enterprise systems
OpenAI launched Frontier, a platform that lets AI agents access and act across internal corporate systems and data to simplify enterprise deployment and management. The move mirrors an industry shift toward multi-agent, platform-level orchestration — but adoption will hinge on clear governance, security guarantees and pricing.

OpenAI Builds Bidirectional Audio Model to Power Voice Assistants
OpenAI has developed a bidirectional audio model that listens and replies within a single conversational turn, aiming to reduce latency for voice assistants and enable on‑device deployment. The work comes as competitors, strategic cloud partners and defense customers all jockey for access, distribution and governance, raising questions about licensing, privacy and hardware integration.