June 1, 2026 9m read

Reducing Time-to-Protect with Cato’s Self-Evolving Vulnerability Protection Agent

Dr. Guy Waizel
Nir Manor
Roei Kriger
Matan Mittelman
Dr. Guy Waizel , Nir Manor , Roei Kriger , Matan Mittelman

Table of Contents

Wondering where to begin your SASE journey?

We've got you covered!
Listen to post:
Getting your Trinity Audio player ready...

TL;DR: In the age of frontier AI models, vulnerability discovery and exploit development are scaling faster than human defenders can manually respond. Security teams already face growing CVE volumes, shorter exploitation windows, and manual workflows for researching vulnerabilities, creating protections, validating them, and preparing them for deployment. As attackers weaponize vulnerabilities faster than organizations can patch them, time-to-protect is becoming a critical security metric.

At Cato, we built the Multi-Modal Vulnerability Protection Self-Evolving Agent to accelerate the path from CVE disclosure to customer protection. Across two pilot phases, the agent generated usable protections across multiple vulnerability classes with consistent results. In the first phase, multiple generated protections matched or exceeded protections already deployed in production. After workflow improvements, the second phase produced usable signatures for all six evaluated CVEs, with the fastest completed in 45 minutes. In this blog, we share the methodology, results, and key insights from evaluating a model-agnostic agentic workflow that improves through validation feedback, audit logs, and researcher review.

CVE Protection Agent Workflow

The goal of the workflow is to transform a newly disclosed vulnerability into a validated protection as quickly as possible while keeping security researchers in control of final decisions.

At a high level, the workflow consists of five stages and including learning and refining loop:

  • Vulnerability intelligence collection
  • Exploit analysis
  • Protection generation
  • Validation
  • Researcher review


The workflow combines multiple AI models with the ability to analyze different types of security artifacts, including vendor advisories, CVE records, reputable private and public repositories, screenshots, and technical diagrams. This allows the agent to build a broader understanding of the vulnerability before generating and validating a protection. Figure 1 presents a high-level view of the agentic CVE protection workflow, from vulnerability analysis through validation and researcher review.

Multi_Modal_CVE

Figure 1. The Multi-Modal CVE Protection high-level Agent Workflow

Multi-Model and Multi-Modal Design

The workflow is both multi-model and multi-modal. It is multi-model because it can use different AI models depending on task complexity, and multi-modal because it can analyze different types of security artifacts, including advisories, source code, screenshots, technical diagrams, and proof-of-concept exploits.

One of the key design goals was flexibility. Different stages of vulnerability research require different reasoning capabilities, making it inefficient to use the same model for every task. During our evaluation, we primarily used Claude Sonnet 4.6 and Claude Opus 4.7, while also beginning evaluations with Claude Opus 4.8 and OpenAI GPT-5.5-Cyber shortly after their release.

Tasks such as metadata extraction, advisory parsing, and workflow orchestration can be handled efficiently by lower-cost models. More complex activities, such as exploit analysis, signature generation, and validation refinement, benefit from stronger reasoning models. The architecture is intentionally model-agnostic, allowing improvements in foundation models to benefit the workflow without redesigning the underlying system.

Behind the scenes, the agent is managed by an orchestration layer that coordinates 16 steps and sub-agents across the end-to-end workflow. Each sub-agent can also trigger additional sub-agents in parallel to improve efficiency. Integrity gates validate whether each step was completed as expected, and if a step does not meet the required criteria, it is automatically re-initiated until it passes validation.

Signature generation is not performed by a generic model alone. The workflow uses specialized agents trained on Cato’s accumulated security research knowledge, detection logic, IPS signature patterns, evasion techniques, and validation experience. Each agent is designed to specialize in a specific vulnerability class, such as remote code execution, authentication bypass, path traversal, denial of service, information disclosure, or generic exploitation. This class-specific specialization helps the workflow produce stronger protections for each vulnerability type instead of relying on a one-size-fits-all approach.

The workflow is also designed to analyze information from multiple sources and formats, including CVE advisories, vendor disclosures, security research blogs, reputable private and public repositories, technical diagrams, images, and screenshots. Combining these inputs allows the agent to build a richer understanding of vulnerability behavior before generating candidate protections.

Validation is a core part of the workflow, not an afterthought. Generated protections are tested for both false positives and false negatives against recent real-world traffic data from Cato’s data warehouse. This helps verify that protections detect relevant attack permutations without blocking legitimate traffic.

Finally, the workflow improves through feedback loops generated from validation outcomes, researcher reviews, execution traces, and audit logs. These insights are used to refine prompts, skills, workflow logic, specialized agents, and model routing decisions over time.

Self-Evolving Through Operational Feedback

A key design principle of the Multi-Modal CVE Protection Agent is continuous improvement. Each time the agent completes an end-to-end workflow and generates a pull request, security researchers review the proposed protection and provide feedback when refinements are needed. Corrections can be applied manually by researchers or automatically through supporting agents.

The feedback process does not end with a single pull request. On a recurring basis, an automated learning workflow analyzes completed pull requests, reviewer comments, and resolved issues. By comparing the original agent-generated protection with the final approved version, the system identifies recurring mistakes, missing logic, and improvement opportunities. These insights are then used to update the skills, workflows, validation logic, and knowledge used by the agent, helping reduce similar errors in future runs.

As a result, the agent does not remain static. Each review cycle becomes an opportunity to improve protection quality, accelerate future investigations, and continuously refine the end-to-end vulnerability research process.

Research Method

To evaluate the effectiveness of the Multi-Modal CVE Protection Agent, we conducted a pilot study across multiple vulnerability categories. The evaluation included 20 real-world vulnerabilities spanning six major attack classes. In the first phase, we evaluated 14 real-world vulnerabilities spanning six major attack classes, using CVEs for which protections already existed. This allowed us to compare the agent-generated protections against existing production protections. In the second phase, after improving the workflow based on the first phase, we evaluated six additional CVEs without existing protections in our environment. These second-phase CVEs came from some of the same vulnerability classes, including RCE, authentication bypass, and path traversal, but were not intended to cover every class one-by-one.

Table 1 summarizes the vulnerability categories included in the pilot and the number of CVEs tested in each category.

Table 1. Vulnerability Categories Included in the Pilot Evaluation

Vulnerability Category Count of CVEs
Remote Code Execution 5
Authentication Bypass 3
Path Traversal 5
Denial of Service 2
Information Disclosure 1
General Exploitation 4

Evaluation Process

For each vulnerability, the agent analyzed the available vulnerability information, generated candidate protections, and validated the results against available telemetry and testing workflows.

The evaluation differed slightly between the two pilot phases. In the first phase, existing protections were removed from the testing environment, allowing us to compare the agent-generated protections against protections already available in production. In the second phase, the workflow was tested against vulnerabilities without pre-existing protections, allowing us to evaluate the improved workflow under fresh, real-world conditions.

Across both phases, the evaluation measured time-to-protect, protection quality, and the level of researcher intervention required.

Results

Figure 2 summarizes the first phase of the pilot evaluation, covering 14 vulnerabilities across multiple attack classes. The figure highlights execution time, lowest time per CVE, protection quality, and workflow success rates.

First_Phase_Agentic_CVE_Pipeline

Figure 2. First Phase Agentic CVE Pipeline Pilot Results

The pilot results show that the agentic workflow was able to generate protections across a diverse CVE dataset with consistent time, and quality characteristics.

Table 2 summarizes the key pilot evaluation metrics of the first phase.

Table 2: First phase Pilot Evaluation Summary

Metric Result
Vulnerabilities Evaluated 14
Pull Requests Generated 14
Successful End-to-End Runs 12
Lowest Time-to-Protect 45min

Second Pilot Results

After refining the workflow based on the first pilot phase, we evaluated the agent on six vulnerabilities without pre-existing protections.

Figure 3 summarize the results: In the second round 6-CVE pilot, our CVE agent produced usable vulnerability signatures for every case and the lowest time per CVE was 45 min.

Second_phase_Agentic_CVE_Pipeline_results

Figure 3. Second phase Agentic CVE Pipeline results

Key Findings

Fast and Efficient

Across both pilot phases, the workflow generated protections in hours rather than days. This is significant because manual protection creation typically requires hours of expert security researcher time, including vulnerability analysis, detection logic design, validation, review, and refinement. In the first phase, the fastest time-to-protect was 45min. After workflow improvements, the second phase produced usable signatures for all six evaluated CVEs, with multiple protections generated in under an hour and the fastest completed was 45 minutes as well.

High Success Rate

The workflow successfully generated usable protections for nearly all evaluated vulnerabilities. In the first phase, multiple generated protections matched or exceeded protections already deployed in production. In the second phase, the agent produced usable vulnerability signatures for all six newly disclosed CVEs.

Consistent and Predictable

Results across both phases showed repeatable behavior across multiple vulnerability classes, with execution time remaining within practical ranges rather than depending on isolated successful runs.

Safety Controls Functioned as Intended

In one case, the workflow correctly halted progress because sufficient public exploit information was unavailable. This demonstrated the importance of policy enforcement and guardrails within autonomous security workflows.

Table 3 presents the full pilot evaluation dataset and results.

Table 3: First Phase Pilot Evaluation Dataset and Results

CVE Category Model Time Outcome
CVE-2019-5418 Information Disclosure Sonnet 4.6 4h Production-quality protection
CVE-2022-25369 Authentication Bypass Sonnet 4.6 1.5h Improved detection coverage
CVE-2019-11253 Denial of Service Sonnet 4.6 2h Production-quality protection
CVE-2021-22054 SSRF Sonnet 4.6 1h Improved existing protection
CVE-2024-34351 SSRF Sonnet 4.6 3h Production-quality protection
CVE-2021-3374 Path Traversal Sonnet 4.6 1h 50m Protection generated
CVE-2022-24632 Path Traversal Opus 4.7 45m Improved detection coverage
CVE-2020-3452 File Disclosure Opus 4.7 1h 25m Improved detection coverage
CVE-2021-39226 Authentication Bypass Opus 4.7 50m Improved existing protection
CVE-2024-9264 Remote Code Execution Opus 4.7 1h 45m Production-quality protection
CVE-2020-22079 Denial of Service Opus 4.7 1h 15m Improved existing protection
CVE-2024-6670 SQL Injection Opus 4.7 1h 50m Partial chain coverage
CVE-2025-5777 Information Disclosure Opus 4.7 1h 20m Production-quality protection
CVE-2026-20079 Remote Code Execution Opus 4.7 2h 53m Protection generated

Table 4 summarizes the results from the second pilot phase.

Table 4: Second Phase Results Using the Improved Workflow

CVE-ID Product Time for PR Model
CVE-2024-36420 Flowise 55 min Opus 4.7 & Sonnet 4.6
CVE-2024-3848 MLflow 50 min Opus 4.7 & Sonnet 4.6
CVE-2026-41940 cPanel 58 min Opus 4.8 & Sonnet 4.6
CVE-2025-69971 FUXA 45 min Opus 4.7 & Sonnet 4.6
CVE-2025-29635 D-Link 75 min Opus 4.7 & Sonnet 4.6
CVE-2026-23744 MCPJam 58 min Opus 4.7 & Sonnet 4.6

Protection Quality Analysis

Beyond speed, the evaluation focused on whether the generated protections were precise, efficient, and suitable for researcher review. Across the pilot, multiple generated protections reached production-quality levels, with some requiring only limited refinement before approval.

The agent was able to account for important detection considerations such as attack variations, encoding differences, signature specificity, and false-positive reduction. This helped ensure that the generated protections were not only fast to produce, but also practical for real-world enforcement.

These findings suggest that agentic vulnerability research can contribute to faster and more efficient protection engineering while keeping researchers in control of final validation and approval.

Early Evaluation of Claude Opus 4.8

One of the advantages of a model-agnostic architecture is the ability to rapidly adopt new foundation models as they become available.

Shortly after Claude Opus 4.8 became available on May 28, 2026, we evaluated it using CVE-2026-41940, a cPanel vulnerability we previously discussed. Table 5 shows the results of this early Claude Opus 4.8 evaluation.

Table 5: Early Evaluation of Claude Opus 4.8

CVE Model Time Outcome
CVE-2026-41940 Claude Opus 4.8 58m High-quality protection on first run

The workflow generated a high-quality protection on the first run in less than an hour.

Why the Platform Matters

The effectiveness of agentic vulnerability protection is not determined solely by the underlying AI model. A model can help analyze a vulnerability or generate a candidate protection, but the real value comes from the architecture around it, orchestration, validation, feedback loops, and the ability to deliver protections quickly and safely at scale.

Cato’s cloud-native architecture enables the Multi-Modal CVE Protection Agent to move from vulnerability analysis to customer protection without requiring customer action. New protections can be validated against real-world traffic data, deployed across Cato’s global PoP network, and delivered through a unified security platform without maintenance windows, appliance updates, or manual intervention. This reinforces a broader principle we discussed in The Mythos Moment: long-term AI value comes not only from the foundation model, but from the architecture, orchestration, and feedback loops built around it.

The Mythos Moment | Read the blog

Discussion and Conclusion

The pilot and follow-up research demonstrated that agentic vulnerability research can meaningfully reduce time-to-protect across multiple vulnerability classes. The Multi-Modal CVE Protection Agent consistently generated protections with predictable execution times, reasonable costs, and strong quality outcomes. Just as important, the research showed that the bottleneck is shifting: generating protections is no longer the hardest part, validating and operationalizing them safely at scale is where platform architecture, real-world telemetry, and researcher oversight become critical.

This is no longer only a pilot. Cato researchers are already using this workflow to help deliver faster and stronger protections to customers. Human expertise remains central to the process, researchers review outcomes, validate complex cases, and make final detection decisions when needed, while the agent automates repetitive research and engineering tasks at machine scale. As foundation models continue to evolve, model-agnostic architectures like this provide a practical path for translating AI advances into faster customer protection.

Related Topics

Wondering where to begin your SASE journey?

We've got you covered!
Dr. Guy Waizel

Dr. Guy Waizel

Tech Evangelist

Dr. Guy Waizel is a Tech Evangelist at Cato Networks and a member of Cato CTRL. As part of his role, Guy collaborates closely with Cato's researchers, developers, and tech teams to bridge and evangelize tech by researching, writing, presenting, and sharing key insights, innovations, and solutions with the broader tech and cybersecurity community. Prior to joining Cato in 2025, Guy led and evangelized security efforts at Commvault, advising CISOs and CIOs on the company’s entire security portfolio. Guy also worked at TrapX Security (acquired by Commvault) in various hands-on and leadership roles, including support, incident response, forensic investigations, and product development. Guy has more than 25 years of experience spanning across cybersecurity, IT, and AI, and has held key roles at tech startups acquired by Philips, Stanley Healthcare, and Verint. Guy holds a PhD with magna cum laude honors from Alexandru Ioan Cuza University, his research thesis focused on the intersection of marketing strategies, cloud adoption, cybersecurity, and AI; an MBA from Netanya Academic College; a B.Sc. in technology management from Holon Institute of Technology; and multiple cybersecurity certifications.

Read More
Nir Manor

Nir Manor

Research Engineer

Nir Manor is a Research Engineer at Cato Networks, where he conducts cybersecurity research, investigates emerging threats and CVEs, and develops protections against them. Prior to joining Cato, Nir worked at Check Point as a Cyber Security Analyst. Nir holds a B.Sc. in Information Systems, specializing in AI and Data Science. He is currently pursuing an M.Sc. in Information Systems at the University of Haifa, where his research focuses on AI-driven approaches to cybersecurity.

Read More
Roei Kriger

Roei Kriger

Security Engineer

Roei Kriger is a security engineer at Cato Networks and member of Cato CTRL. He analyzes, researches, and develops protections against emerging threats and CVEs. Roei brings more than 3 years of experience in cybersecurity threat protection. Prior to joining Cato in 2023, Roei worked at IBM Trusteer as a cyber software developer. Roei holds a Bachelor of Science in Information Systems from Haifa University.

Read More
Matan Mittelman

Matan Mittelman

Matan Mittelman is a Threat Prevention Team Leader at Cato Networks and member of Cato CTRL. He's responsible for analyzing, researching, and developing protections against emerging threats and CVEs. Matan brings nearly 10 years of experience leading cybersecurity teams. Matan holds a Master's degree in Clinical Neuropsychology from The Hebrew University of Jerusalem and a Bachelor's degree in Psychology from Ben-Gurion University of the Negev.

Read More