News from the AI & ML world

DeeperML - #softwareengineering

@www.marktechpost.com //
AI agents are rapidly transforming software engineering workflows, offering increased efficiency and accessibility. Mistral AI has launched its Agents API, a platform designed to enable developers to integrate autonomous, generative AI capabilities into existing applications. This API allows for the creation of AI agents capable of performing tasks such as running Python code securely, generating images, and performing retrieval-augmented generation (RAG). These agents can access real-time information from the web and utilize user-provided document libraries, significantly enhancing their ability to provide accurate and up-to-date responses.

Designed to complement Mistral’s existing Chat Completion API, the Agents API focuses on agentic orchestration, built-in connectors, and persistent memory. The API is equipped with several built-in connectors, including Code Execution, Image Generation, Document Library, and Web Search. This flexibility allows for the coordination of multiple AI agents to tackle complex tasks, surpassing the limitations of traditional language models by enabling them to perform real-world tasks and maintain conversational context over time.

The rise of AI agents is also changing the economics of software engineering. The emergence of "cheap SWE agents" is enabling teams with more millions in ARR than employees. Tools like GitHub Copilot, Claude Code, and OpenAI Codex are democratizing the field, making software development accessible to individuals without a technical background. These agents are also improving developer productivity and code quality.

Recommended read:
References :
  • AI News | VentureBeat: Mistral launches API for building AI agents that run Python, generate images, perform RAG and more
  • www.marktechpost.com: Mistral Launches Agents API: A New Platform for Developer-Friendly AI Agent Creation

Sean Michael@AI News | VentureBeat //
Windsurf has launched SWE-1, a family of AI models specifically designed for the entire software engineering process, marking a departure from traditional AI coding tools. The company aims to accelerate software development by 99% by optimizing for the complete engineering workflow, encompassing tasks beyond just code generation. According to Windsurf co-founder Anshul Ramachandran, the SWE-1 initiative was born from the realization that "Writing code is just a fraction of what engineers do. A ‘coding-capable’ model won’t cut it." The SWE-1 family includes SWE-1, SWE-1-lite, and SWE-1-mini, each tailored for different use cases within the software development lifecycle.

SWE-1 represents Windsurf's entry into frontier model development, boasting performance comparable to Claude 3.5 Sonnet in key human-in-the-loop tasks. Internal benchmarks indicate that SWE-1 demonstrates higher engagement, better retention, and more trusted outputs compared to Windsurf's previous Cascade Base model. SWE-1-lite is replacing Cascade Base for all users, while SWE-1-mini powers the predictive Windsurf Tab experience. The models are already live inside Windsurf’s dev surfaces and are available to users.

Windsurf emphasizes "flow awareness" as a key innovation, enabling the AI system to understand and operate within the complete timeline of development work. This stems from the company’s experience with its Windsurf Editor, which facilitates collaboration between humans and AI. By owning every layer of the software development process, from model inference to interface design, Windsurf aims to provide cost savings and improved performance to its users. The company's approach highlights a fundamental shift in AI assistance for developers, focusing on the entire software engineering workflow rather than just coding tasks.

Recommended read:
References :
  • devops.com: Windsurf Launches SWE-1: AI Models Built for the Entire Software Engineering Process
  • AI News | VentureBeat: Software engineering-native AI models have arrived: What Windsurf’s SWE-1 means for technical decision-makers
  • Maginative: Windsurf Launches SWE-1, Homegrown AI Models for Software Engineering

Ross Kelly@Latest from ITPro //
OpenAI has launched Codex, a cloud-based AI agent designed to revolutionize software engineering. Integrated within ChatGPT for Pro, Team, and Enterprise users, Codex allows developers to delegate tasks such as writing features, fixing bugs, and suggesting pull requests. This agentic AI tool leverages codex-1, a specialized version of OpenAI's o3 reasoning model, optimized for coding tasks. The aim is to align its outputs closely with human coding preferences and standards, resulting in cleaner, more efficient code generation.

Codex distinguishes itself by operating in parallel cloud sandboxes, preloaded with the user's codebase. This enables developers to run multiple tasks simultaneously without disrupting their local environments. Each task is processed independently, allowing for efficient delegation of coding operations. The tool can read and edit files, run tests, execute commands like linters and type checkers, and log its results, with task completion typically ranging from 1 to 30 minutes depending on complexity.

The AI agent also provides verifiable evidence of its actions through citations of terminal logs and test outputs, allowing developers to trace each step taken during task completion. Users can review the results, request revisions, open a GitHub pull request, or directly integrate the changes into their local environment. OpenAI highlights that internal use of Codex has already significantly reduced project timelines for its API team, suggesting a substantial boost in productivity and a shift towards a new paradigm in software development. The company trained Codex using reinforcement learning on real-world software development tasks, aiming to replicate the human coding style and preferences for pull requests.

Recommended read:
References :
  • Latest from ITPro in News: OpenAI just launched 'Codex', a new AI agent for software engineering
  • Analytics Vidhya: Meet OpenAI Codex: Cloud-based Software Engineering Agent
  • MarkTechPost: AI Agents Now Write Code in Parallel: OpenAI Introduces Codex, a Cloud-Based Coding Agent Inside ChatGPT
  • THE DECODER: OpenAI launches Codex: Autonomous AI agents for software development
  • AI News | VentureBeat: OpenAI has launched a research preview of its new coding assistant, called  . A tool designed to help seasoned developers hand off repetitive programming chores to an AI that not only writes usable code but also explains each decision it makes in the process.
  • www.marktechpost.com: OpenAI has introduced Codex, a cloud-native software engineering agent integrated into ChatGPT, signaling a new era in AI-assisted software development.
  • TestingCatalog: OpenAI rolled out Codex for automated coding tasks in ChatGPT Pro
  • the-decoder.com: OpenAI launches Codex: Autonomous AI agents for software development

Sean Michael@AI News | VentureBeat //
Windsurf, an AI coding startup reportedly on the verge of being acquired by OpenAI for a staggering $3 billion, has just launched SWE-1, its first in-house small language model specifically tailored for software engineering. This move signals a shift towards software engineering-native AI models, designed to tackle the complete software development workflow. Windsurf aims to accelerate software engineering with SWE-1, not just coding.

The SWE-1 family includes models like SWE-1-lite and SWE-1-mini, designed to perform tasks beyond generating code. Unlike general-purpose AI models adapted for coding, SWE-1 is built to address the entire spectrum of software engineering activities, including reviewing, committing, and maintaining code over time. Built to run efficiently on consumer hardware without relying on expensive cloud infrastructure, the models offer developers the freedom to adapt them as needed under a permissive license.

SWE-1's key innovation lies in its "flow awareness," which enables the AI to understand and operate within the complete timeline of development work. Windsurf users have given the company feedback that existing coding models tend to do well with user guidance, but over time tend to miss things. The new models aim to support developers through multiple surfaces, incomplete work states and long-running tasks that characterize real-world software development.

Recommended read:
References :
  • Shelly Palmer: Windsurf, the AI coding startup that is reportedly in the process of being acquired by OpenAI for $3 billion, just launched SWE-1: its first in-house small language model designed specifically for software engineering.
  • AI News | VentureBeat: Windsurf's new SWE-1 AI models tackle the complete software engineering workflow, potentially reducing development cycles and technical debt.
  • Maginative: Windsurf launches SWE-1, its in-house, vertically integrated model family built specifically for software engineering—not just coding.
  • devops.com: Windsurf has unveiled its first family of specialized models designed to transform developers’ work in a significant development for AI-assisted software engineering.
  • shellypalmer.com: Windsurf, the AI coding startup that is reportedly in the process of being acquired by OpenAI for $3 billion, just launched SWE-1: its first in-house small language model designed specifically for software engineering.
  • MarkTechPost: Windsurf Launches SWE-1: A Frontier AI Model Family for End-to-End Software Engineering
  • www.marktechpost.com: Windsurf Launches SWE-1: A Frontier AI Model Family for End-to-End Software Engineering
  • computational-intelligence.blogspot.com: Windsurf Launches SWE-1, Homegrown AI Models for Software Engineering
  • TestingCatalog: Discover Windsurf's new Wave 9 SWE-1 AI model, optimised for real-time, on-device applications. Enjoy low-latency performance on mobile.

Ross Kelly@Latest from ITPro //
OpenAI has launched Codex, a new AI agent designed for software engineering, integrated within ChatGPT. This cloud-based coding agent represents a significant advancement in AI-assisted software development, going beyond simple code completion to autonomously perform various programming tasks. Codex is built upon codex-1, a fine-tuned version of OpenAI's reasoning model, specifically optimized for software engineering workflows. It enables users to delegate tasks such as writing features, fixing bugs, answering questions about the codebase, and proposing pull requests, with each task running in its own cloud sandbox environment preloaded with the repository.

The Codex agent is accessible through the ChatGPT interface and is available to Pro, Team, and Enterprise users, with broader access planned. Developers can interact with Codex by typing simple prompts, and the agent will handle the coding behind the scenes, surfacing results for review and feedback. This integration allows for parallel tasking, enabling users to delegate different coding operations without disrupting their local development environment. The activities of the tool can also be monitored in real-time and upon completion, Codex provides verifiable evidence of its actions, including citations of terminal logs and test outputs.

Sam Altman, OpenAI's CEO, has expressed an ambition for OpenAI to become the "Microsoft of AI," envisioning a subscription-based operating system built on ChatGPT. The company could develop a core AI subscription, featuring ChatGPT's user experience, as well as surfaces like future devices, similar to operating systems. According to one user who has used Codex internally for a few months, Codex has significantly reduced the time it takes to complete projects, stating that "software engineering will truly never be the same".

Recommended read:
References :
  • bsky.app: i’ve used codex internally for a few months and have cut days or weeks off several projects on the API team. software engineering will truly never be the same https://openai.com/index/introducing-codex/
  • Latest from ITPro in News: OpenAI just launched 'Codex', a new AI agent for software engineering
  • AI News | VentureBeat: OpenAI's new coding agent, Codex, is available as a research preview for ChatGPT Pro, Enterprise, and Team users.
  • MarkTechPost: OpenAI introduces Codex, a cloud-based coding agent inside ChatGPT, signaling a new era in AI-assisted software development.
  • AI News | VentureBeat: OpenAI brings GPT-4.1 and 4.1 mini to ChatGPT — what enterprises should know
  • github.com: The OpenAI's Codex product documentation.
  • www.analyticsvidhya.com: OpenAI released Codex, a cloud‑native software agent designed to work alongside developers. Codex is not a single product but a family of agents powered by codex‑1, OpenAI’s […] The post appeared first on .
  • Latent.Space: ChatGPT Codex is here - the first cloud hosted Autonomous Software Engineer (A-SWE) from OpenAI. Josh Ma and Alexander Embiricos tell us how to WHAM every codebase like a power user.
  • www.marktechpost.com: OpenAI Introduces Codex, a Cloud-Based Coding Agent Inside ChatGPT
  • BetaNews: Codex, OpenAI's new coding agent, is now available in ChatGPT.
  • THE DECODER: OpenAI is rolling out Codex, a cloud-based AI agent for software development that automates tasks like bug fixes and feature implementation.
  • Analytics Vidhya: OpenAI released Codex, a cloud‑native software agent designed to work alongside developers.
  • the-decoder.com: The Decoder's report on OpenAI's Codex launch.
  • SiliconANGLE: OpenAI updates ChatGPT with coding-optimized Codex AI agent
  • Last Week in AI: Last Week in AI #309 - OpenAI keeps non-profit & launches Codex, AlphaEvolve, and more!
  • Maginative: Meet Codex: OpenAI’s New Software Engineering AI Agent
  • TestingCatalog: Discover OpenAI Codex, a cloud-based AI agent for automating coding tasks. Available for ChatGPT Pro, Team and Enterprise users now.
  • TestingCatalog: OpenAI prepares SWE Agent that answers code questions and drafts PR
  • pub.towardsai.net: AI-assisted code generation can help improve efficiency and reduce errors in the development process, but experts warn that it is not a replacement for human programmers.
  • The Tech Basic: OpenAI’s New Codex AI Helps Write Code Faster in ChatGPT
  • Runtime: Article about OpenAI's coding tool.
  • devops.com: OpenAI's Codex transforms software development with cloud-based AI agents that can tackle multiple coding tasks simultaneously, enhancing developer productivity.
  • Ars OpenForum: OpenAI introduces Codex, its first full-fledged AI agent for coding. It replicates your development environment and takes up to 30 minutes per task.
  • www.eweek.com: OpenAI’s Codex agent helps developers write code, fix bugs, and test features—all from ChatGPT. Early adopters include Cisco, Temporal, and Superhuman.
  • www.infoworld.com: OpenAI has announced the release of Codex, an AI coding agent it said was designed to help software engineers write code, fix bugs, and run tests.
  • eWEEK: OpenAI Debuts Codex AI Agent for Developers: ‘Like a Remote Teammate’
  • www.infoq.com: OpenAI Launches Codex Software Engineering Agent Preview
  • Ken Yeung: The New GitHub Copilot Agent Doesn’t Just Help You Code—it Codes for You
  • pub.towardsai.net: TAI #153: AlphaEvolve & Codex — AI Breakthroughs in Algorithm Discovery & Software Engineering

@www.pcworld.com //
References: Entrepreneur , PCWorld
Microsoft CEO Satya Nadella revealed that a significant portion of the company's new code is being written by artificial intelligence. Nadella stated at Meta's LlamaCon conference that approximately 20% to 30% of Microsoft's code is now generated by AI. This marks a significant shift in software development practices, with AI tools becoming increasingly integrated into the coding process. He also noted the increased usage of AI agents for code reviews, highlighting AI's role in enhancing efficiency and productivity within the company's software engineering workflows.

This trend isn't exclusive to Microsoft. Google CEO Sundar Pichai has also indicated that a substantial percentage of Google's new code is being written by AI, exceeding 30%. Meta, under the leadership of Mark Zuckerberg, is also aggressively pursuing AI-driven software development, with plans to have AI handle half of its software development within the next year. These developments signify a broader industry movement towards leveraging AI to automate and augment coding tasks, transforming how software is created.

The rise of AI in code development and review has significant implications for software engineers. While some experts, like Microsoft CTO Kevin Scott, predict that AI could write as much as 95% of code within the next five years, others view it as an opportunity to enhance productivity and focus on more complex, creative tasks. The transition to AI-assisted coding requires software engineers to adapt and learn how to effectively collaborate with AI tools. These technologies, like Microsoft's Copilot, powered by OpenAI's ChatGPT, are increasingly vital for businesses aiming to drive innovation and efficiency through AI-driven solutions.

Recommended read:
References :
  • Entrepreneur: AI Is Already Writing About 30% of Code at Microsoft and Google. Here's What It Means for Software Engineers.
  • PCWorld: Microsoft CEO claims 30% of its new code is written by AI