
Redazione RHC : 3 November 2025 11:23
OpenAI has unveiled Aardvark, an autonomous assistant based on the GPT-5 model , designed to automatically find and fix vulnerabilities in software code. This AI tool, dubbed a “security research agent,” is already in beta testing and will enable rapid identification and resolution of potential issues in repositories.
According to the description, Aardvark monitors source code changes in real time, analyzes commits, and determines which ones might pose a threat. Based on this analysis, the system creates its own threat model, aligned with the project’s architecture and goals.
It then analyzes the change history and tests new fragments, attempting to reproduce potential vulnerabilities in an isolated environment. If a problem is confirmed, a patch is generated using the OpenAI Codex and then submitted for human review.
Aardvark is based on GPT-5, introduced in August 2025, a model with improved logic and modular routing that allows it to dynamically select the appropriate operating mode based on the task. This allows the agent to not only identify vulnerabilities, but also assess their severity, prioritize them, and recommend targeted solutions.
According to OpenAI, internal testing on its own repositories and those of several partners has already yielded results: at least ten CVEs have been identified in open source projects. This demonstrates the tool’s potential as a constant gatekeeper during development, preventing delays in the release of new features.
Aardvark ‘s development is paralleled by similar initiatives from other companies. For example, Google recently announced CodeMender , an artificial intelligence system capable of identifying, rewriting, and fixing vulnerable code to prevent further attacks. This technology will be integrated into major open source projects with the participation of their developers.
Amid the competition between solutions like Aardvark, CodeMender, and XBOW , a new class of tools is emerging for continuous code review, exploitability testing, and patch generation. The release of Aardvark also coincided with the recent launch of the gpt-oss-safeguard models, specifically trained for security-based classification tasks.
Redazione