Home Blog Reducing Time-to-Protect with Cato’s Self-Evolving Vulnerability Protection Agent

June 1, 2026 9m read

Reducing Time-to-Protect with Cato’s Self-Evolving Vulnerability Protection Agent

Dr. Guy Waizel , Nir Manor , Roei Kriger , Matan Mittelman

Wondering where to begin your SASE journey?

We've got you covered!

Listen to post:

Getting your Trinity Audio player ready...

TL;DR: In the age of frontier AI models, vulnerability discovery and exploit development are scaling faster than human defenders can manually respond. Security teams already face growing CVE volumes, shorter exploitation windows, and manual workflows for researching vulnerabilities, creating protections, validating them, and preparing them for deployment. As attackers weaponize vulnerabilities faster than organizations can patch them, time-to-protect is becoming a critical security metric.

At Cato, we built the Multi-Modal Vulnerability Protection Self-Evolving Agent to accelerate the path from CVE disclosure to customer protection. Across two pilot phases, the agent generated usable protections across multiple vulnerability classes with consistent results. In the first phase, multiple generated protections matched or exceeded protections already deployed in production. After workflow improvements, the second phase produced usable signatures for all six evaluated CVEs, with the fastest completed in 45 minutes. In this blog, we share the methodology, results, and key insights from evaluating a model-agnostic agentic workflow that improves through validation feedback, audit logs, and researcher review.

CVE Protection Agent Workflow

The goal of the workflow is to transform a newly disclosed vulnerability into a validated protection as quickly as possible while keeping security researchers in control of final decisions.

At a high level, the workflow consists of five stages and including learning and refining loop:

Vulnerability intelligence collection
Exploit analysis
Protection generation
Validation
Researcher review

The workflow combines multiple AI models with the ability to analyze different types of security artifacts, including vendor advisories, CVE records, reputable private and public repositories, screenshots, and technical diagrams. This allows the agent to build a broader understanding of the vulnerability before generating and validating a protection. Figure 1 presents a high-level view of the agentic CVE protection workflow, from vulnerability analysis through validation and researcher review.

Figure 1. The Multi-Modal CVE Protection high-level Agent Workflow

The workflow is both multi-model and multi-modal. It is multi-model because it can use different AI models depending on task complexity, and multi-modal because it can analyze different types of security artifacts, including advisories, source code, screenshots, technical diagrams, and proof-of-concept exploits.

One of the key design goals was flexibility. Different stages of vulnerability research require different reasoning capabilities, making it inefficient to use the same model for every task. During our evaluation, we primarily used Claude Sonnet 4.6 and Claude Opus 4.7, while also beginning evaluations with Claude Opus 4.8 and OpenAI GPT-5.5-Cyber shortly after their release.

Tasks such as metadata extraction, advisory parsing, and workflow orchestration can be handled efficiently by lower-cost models. More complex activities, such as exploit analysis, signature generation, and validation refinement, benefit from stronger reasoning models. The architecture is intentionally model-agnostic, allowing improvements in foundation models to benefit the workflow without redesigning the underlying system.

Behind the scenes, the agent is managed by an orchestration layer that coordinates 16 steps and sub-agents across the end-to-end workflow. Each sub-agent can also trigger additional sub-agents in parallel to improve efficiency. Integrity gates validate whether each step was completed as expected, and if a step does not meet the required criteria, it is automatically re-initiated until it passes validation.

Signature generation is not performed by a generic model alone. The workflow uses specialized agents trained on Cato’s accumulated security research knowledge, detection logic, IPS signature patterns, evasion techniques, and validation experience. Each agent is designed to specialize in a specific vulnerability class, such as remote code execution, authentication bypass, path traversal, denial of service, information disclosure, or generic exploitation. This class-specific specialization helps the workflow produce stronger protections for each vulnerability type instead of relying on a one-size-fits-all approach.

The workflow is also designed to analyze information from multiple sources and formats, including CVE advisories, vendor disclosures, security research blogs, reputable private and public repositories, technical diagrams, images, and screenshots. Combining these inputs allows the agent to build a richer understanding of vulnerability behavior before generating candidate protections.

Validation is a core part of the workflow, not an afterthought. Generated protections are tested for both false positives and false negatives against recent real-world traffic data from Cato’s data warehouse. This helps verify that protections detect relevant attack permutations without blocking legitimate traffic.

Finally, the workflow improves through feedback loops generated from validation outcomes, researcher reviews, execution traces, and audit logs. These insights are used to refine prompts, skills, workflow logic, specialized agents, and model routing decisions over time.

Self-Evolving Through Operational Feedback

A key design principle of the Multi-Modal CVE Protection Agent is continuous improvement. Each time the agent completes an end-to-end workflow and generates a pull request, security researchers review the proposed protection and provide feedback when refinements are needed. Corrections can be applied manually by researchers or automatically through supporting agents.

The feedback process does not end with a single pull request. On a recurring basis, an automated learning workflow analyzes completed pull requests, reviewer comments, and resolved issues. By comparing the original agent-generated protection with the final approved version, the system identifies recurring mistakes, missing logic, and improvement opportunities. These insights are then used to update the skills, workflows, validation logic, and knowledge used by the agent, helping reduce similar errors in future runs.

As a result, the agent does not remain static. Each review cycle becomes an opportunity to improve protection quality, accelerate future investigations, and continuously refine the end-to-end vulnerability research process.

Research Method

To evaluate the effectiveness of the Multi-Modal CVE Protection Agent, we conducted a pilot study across multiple vulnerability categories. The evaluation included 20 real-world vulnerabilities spanning six major attack classes. In the first phase, we evaluated 14 real-world vulnerabilities spanning six major attack classes, using CVEs for which protections already existed. This allowed us to compare the agent-generated protections against existing production protections. In the second phase, after improving the workflow based on the first phase, we evaluated six additional CVEs without existing protections in our environment. These second-phase CVEs came from some of the same vulnerability classes, including RCE, authentication bypass, and path traversal, but were not intended to cover every class one-by-one.

Table 1 summarizes the vulnerability categories included in the pilot and the number of CVEs tested in each category.

Table 1. Vulnerability Categories Included in the Pilot Evaluation

Vulnerability Category	Count of CVEs
Remote Code Execution	5
Authentication Bypass	3
Path Traversal	5
Denial of Service	2
Information Disclosure	1
General Exploitation	4

Evaluation Process

For each vulnerability, the agent analyzed the available vulnerability information, generated candidate protections, and validated the results against available telemetry and testing workflows.

The evaluation differed slightly between the two pilot phases. In the first phase, existing protections were removed from the testing environment, allowing us to compare the agent-generated protections against protections already available in production. In the second phase, the workflow was tested against vulnerabilities without pre-existing protections, allowing us to evaluate the improved workflow under fresh, real-world conditions.

Across both phases, the evaluation measured time-to-protect, protection quality, and the level of researcher intervention required.

Results

Figure 2 summarizes the first phase of the pilot evaluation, covering 14 vulnerabilities across multiple attack classes. The figure highlights execution time, lowest time per CVE, protection quality, and workflow success rates.

Figure 2. First Phase Agentic CVE Pipeline Pilot Results

The pilot results show that the agentic workflow was able to generate protections across a diverse CVE dataset with consistent time, and quality characteristics.

Table 2 summarizes the key pilot evaluation metrics of the first phase.

Table 2: First phase Pilot Evaluation Summary

Metric	Result
Vulnerabilities Evaluated	14
Pull Requests Generated	14
Successful End-to-End Runs	12
Lowest Time-to-Protect	45min

Second Pilot Results

After refining the workflow based on the first pilot phase, we evaluated the agent on six vulnerabilities without pre-existing protections.

Figure 3 summarize the results: In the second round 6-CVE pilot, our CVE agent produced usable vulnerability signatures for every case and the lowest time per CVE was 45 min.

Figure 3. Second phase Agentic CVE Pipeline results

Key Findings

Fast and Efficient

Across both pilot phases, the workflow generated protections in hours rather than days. This is significant because manual protection creation typically requires hours of expert security researcher time, including vulnerability analysis, detection logic design, validation, review, and refinement. In the first phase, the fastest time-to-protect was 45min. After workflow improvements, the second phase produced usable signatures for all six evaluated CVEs, with multiple protections generated in under an hour and the fastest completed was 45 minutes as well.

High Success Rate

The workflow successfully generated usable protections for nearly all evaluated vulnerabilities. In the first phase, multiple generated protections matched or exceeded protections already deployed in production. In the second phase, the agent produced usable vulnerability signatures for all six newly disclosed CVEs.

Consistent and Predictable

Results across both phases showed repeatable behavior across multiple vulnerability classes, with execution time remaining within practical ranges rather than depending on isolated successful runs.

Safety Controls Functioned as Intended

In one case, the workflow correctly halted progress because sufficient public exploit information was unavailable. This demonstrated the importance of policy enforcement and guardrails within autonomous security workflows.

Table 3 presents the full pilot evaluation dataset and results.

Table 3: First Phase Pilot Evaluation Dataset and Results

CVE	Category	Model	Time	Outcome
CVE-2019-5418	Information Disclosure	Sonnet 4.6	4h	Production-quality protection
CVE-2022-25369	Authentication Bypass	Sonnet 4.6	1.5h	Improved detection coverage
CVE-2019-11253	Denial of Service	Sonnet 4.6	2h	Production-quality protection
CVE-2021-22054	SSRF	Sonnet 4.6	1h	Improved existing protection
CVE-2024-34351	SSRF	Sonnet 4.6	3h	Production-quality protection
CVE-2021-3374	Path Traversal	Sonnet 4.6	1h 50m	Protection generated
CVE-2022-24632	Path Traversal	Opus 4.7	45m	Improved detection coverage
CVE-2020-3452	File Disclosure	Opus 4.7	1h 25m	Improved detection coverage
CVE-2021-39226	Authentication Bypass	Opus 4.7	50m	Improved existing protection
CVE-2024-9264	Remote Code Execution	Opus 4.7	1h 45m	Production-quality protection
CVE-2020-22079	Denial of Service	Opus 4.7	1h 15m	Improved existing protection
CVE-2024-6670	SQL Injection	Opus 4.7	1h 50m	Partial chain coverage
CVE-2025-5777	Information Disclosure	Opus 4.7	1h 20m	Production-quality protection
CVE-2026-20079	Remote Code Execution	Opus 4.7	2h 53m	Protection generated

Table 4 summarizes the results from the second pilot phase.

Table 4: Second Phase Results Using the Improved Workflow

CVE-ID	Product	Time for PR	Model
CVE-2024-36420	Flowise	55 min	Opus 4.7 & Sonnet 4.6
CVE-2024-3848	MLflow	50 min	Opus 4.7 & Sonnet 4.6
CVE-2026-41940	cPanel	58 min	Opus 4.8 & Sonnet 4.6
CVE-2025-69971	FUXA	45 min	Opus 4.7 & Sonnet 4.6
CVE-2025-29635	D-Link	75 min	Opus 4.7 & Sonnet 4.6
CVE-2026-23744	MCPJam	58 min	Opus 4.7 & Sonnet 4.6

Protection Quality Analysis

Beyond speed, the evaluation focused on whether the generated protections were precise, efficient, and suitable for researcher review. Across the pilot, multiple generated protections reached production-quality levels, with some requiring only limited refinement before approval.

The agent was able to account for important detection considerations such as attack variations, encoding differences, signature specificity, and false-positive reduction. This helped ensure that the generated protections were not only fast to produce, but also practical for real-world enforcement.

These findings suggest that agentic vulnerability research can contribute to faster and more efficient protection engineering while keeping researchers in control of final validation and approval.

Early Evaluation of Claude Opus 4.8

One of the advantages of a model-agnostic architecture is the ability to rapidly adopt new foundation models as they become available.

Shortly after Claude Opus 4.8 became available on May 28, 2026, we evaluated it using CVE-2026-41940, a cPanel vulnerability we previously discussed. Table 5 shows the results of this early Claude Opus 4.8 evaluation.

Table 5: Early Evaluation of Claude Opus 4.8

CVE	Model	Time	Outcome
CVE-2026-41940	Claude Opus 4.8	58m	High-quality protection on first run

The workflow generated a high-quality protection on the first run in less than an hour.

Why the Platform Matters

The effectiveness of agentic vulnerability protection is not determined solely by the underlying AI model. A model can help analyze a vulnerability or generate a candidate protection, but the real value comes from the architecture around it, orchestration, validation, feedback loops, and the ability to deliver protections quickly and safely at scale.

Cato’s cloud-native architecture enables the Multi-Modal CVE Protection Agent to move from vulnerability analysis to customer protection without requiring customer action. New protections can be validated against real-world traffic data, deployed across Cato’s global PoP network, and delivered through a unified security platform without maintenance windows, appliance updates, or manual intervention. This reinforces a broader principle we discussed in The Mythos Moment: long-term AI value comes not only from the foundation model, but from the architecture, orchestration, and feedback loops built around it.

The Mythos Moment | Read the blog

Discussion and Conclusion

The pilot and follow-up research demonstrated that agentic vulnerability research can meaningfully reduce time-to-protect across multiple vulnerability classes. The Multi-Modal CVE Protection Agent consistently generated protections with predictable execution times, reasonable costs, and strong quality outcomes. Just as important, the research showed that the bottleneck is shifting: generating protections is no longer the hardest part, validating and operationalizing them safely at scale is where platform architecture, real-world telemetry, and researcher oversight become critical.

This is no longer only a pilot. Cato researchers are already using this workflow to help deliver faster and stronger protections to customers. Human expertise remains central to the process, researchers review outcomes, validate complex cases, and make final detection decisions when needed, while the agent automates repetitive research and engineering tasks at machine scale. As foundation models continue to evolve, model-agnostic architectures like this provide a practical path for translating AI advances into faster customer protection.

Dr. Guy Waizel

Tech Evangelist

Dr. Guy Waizel is a Tech Evangelist at Cato Networks and a member of Cato CTRL. As part of his role, Guy collaborates closely with Cato's researchers, developers, and tech teams to bridge and evangelize tech by researching, writing, presenting, and sharing key insights, innovations, and solutions with the broader tech and cybersecurity community. Prior to joining Cato in 2025, Guy led and evangelized security efforts at Commvault, advising CISOs and CIOs on the company’s entire security portfolio. Guy also worked at TrapX Security (acquired by Commvault) in various hands-on and leadership roles, including support, incident response, forensic investigations, and product development. Guy has more than 25 years of experience spanning across cybersecurity, IT, and AI, and has held key roles at tech startups acquired by Philips, Stanley Healthcare, and Verint. Guy holds a PhD with magna cum laude honors from Alexandru Ioan Cuza University, his research thesis focused on the intersection of marketing strategies, cloud adoption, cybersecurity, and AI; an MBA from Netanya Academic College; a B.Sc. in technology management from Holon Institute of Technology; and multiple cybersecurity certifications.

Nir Manor

Research Engineer

Nir Manor is a Research Engineer at Cato Networks, where he conducts cybersecurity research, investigates emerging threats and CVEs, and develops protections against them. Prior to joining Cato, Nir worked at Check Point as a Cyber Security Analyst. Nir holds a B.Sc. in Information Systems, specializing in AI and Data Science. He is currently pursuing an M.Sc. in Information Systems at the University of Haifa, where his research focuses on AI-driven approaches to cybersecurity.

Roei Kriger

Security Engineer

Roei Kriger is a security engineer at Cato Networks and member of Cato CTRL. He analyzes, researches, and develops protections against emerging threats and CVEs. Roei brings more than 3 years of experience in cybersecurity threat protection. Prior to joining Cato in 2023, Roei worked at IBM Trusteer as a cyber software developer. Roei holds a Bachelor of Science in Information Systems from Haifa University.

Matan Mittelman

Matan Mittelman is a Threat Prevention Team Leader at Cato Networks and member of Cato CTRL. He's responsible for analyzing, researching, and developing protections against emerging threats and CVEs. Matan brings nearly 10 years of experience leading cybersecurity teams. Matan holds a Master's degree in Clinical Neuropsychology from The Hebrew University of Jerusalem and a Bachelor's degree in Psychology from Ben-Gurion University of the Negev.

AI Security

Cato CTRL™ Threat Research: Threat Actors Abuse Simplified AI to Steal Microsoft 365 Credentials

Dr. Guy Waizel

Tech Evangelist

Executive Summary AI marketing platforms have exploded in popularity, becoming everyday tools for creative teams in enterprises worldwide. Platforms like Simplified AI offer marketers the ability to generate content, clips, and campaigns at scale. For CISOs and IT leaders, approving such services often seems straightforward: allow...

Read

AI Security

Cato CTRL™ Threat Research: PoC Attack Targeting Atlassian’s Model Context Protocol (MCP) Introduces New “Living Off AI” Risk

Dr. Guy Waizel

Tech Evangelist

Executive Summary Most organizations assume a clear boundary between external users, who submit support tickets or service requests, and internal users, who handle them using privileged access. However, when an internal user triggers an AI action from a model context protocol (MCP) tool, such as summarizing...

Read

AI Security

Cato CTRL™ Threat Research: When OpenClaw, Your AI Personal Assistant, Becomes the Backdoor

Vitaly Simonovich

Senior Security Researcher

TL;DR Executive Summary Cato CTRL’s Vitaly Simonovich (senior security researcher) has identified a threat actor selling root shell access to a UK-based automation company through a compromised AI personal assistant based on OpenClaw. The listing, which was posted on February 22, 2026 by a user called...

Read

Join the fastest-growing SASE channel ecosystem

Reducing Time-to-Protect with Cato’s Self-Evolving Vulnerability Protection Agent

Table of Contents

Wondering where to begin your SASE journey?

CVE Protection Agent Workflow

Self-Evolving Through Operational Feedback