CyberCane: Privacy-Preserving Phishing Detection with Ontology

CyberCane: A Breakthrough in Privacy-Preserving Phishing Detection

In an era where cybersecurity is paramount, the need for effective phishing detection systems has never been more pressing. The introduction of CyberCane, a neuro-symbolic framework designed for privacy-critical domains, promises to tackle the complexities of phishing detection while adhering to stringent privacy regulations. This innovative approach was detailed in a recent paper published on arXiv (arXiv:2604.23563v1) and presents a compelling solution for organizations grappling with the dual challenges of false positives and sensitive data exposure.

Challenges in Current Phishing Detection Systems

Phishing detection systems face several contradictory constraints that complicate their effectiveness:

Near-zero false positives: Preventing disruption in workflow is critical, particularly in environments with non-technical staff.
Transparent explanations: Non-expert staff must understand detection mechanisms to trust and comply with them.
Regulatory compliance: Many sectors are bound by regulations preventing sensitive data from being exposed to external APIs.
Robustness against AI-generated attacks: As phishing techniques evolve, so must detection systems.

Traditional rule-based systems struggle with novel phishing campaigns due to their rigid nature. Conversely, large language model (LLM)-based detectors often violate privacy regulations by transmitting unredacted data, highlighting the urgent need for a more sophisticated approach.

The CyberCane Framework

CyberCane addresses these challenges through a neuro-symbolic framework that combines deterministic symbolic analysis with privacy-preserving retrieval-augmented generation (RAG). This dual-phase pipeline operates as follows:

Lightweight Symbolic Rules: The first phase applies symbolic rules to email metadata, filtering out obvious phishing attempts.
Semantic Classification: For borderline cases, the system escalates to a semantic classification stage utilizing RAG, which includes automated sensitive data redaction.
Phishing-only Corpus Retrieval: Information is retrieved from a dedicated phishing-only corpus, minimizing the risk of exposing sensitive data.

Introducing PhishOnt

To enhance its detection capabilities, CyberCane introduces PhishOnt, an OWL ontology that facilitates verifiable attack classification through formal reasoning chains. This addition not only strengthens the detection process but also provides organizations with a robust framework for understanding and classifying phishing threats.

Evaluation and Results

CyberCane has been rigorously tested on the DataPhish2025 dataset, which comprises 12,300 emails that include a mix of human-generated and AI-generated content. The results indicate a significant improvement in performance:

Recall Gain: CyberCane demonstrated a 78.6-point recall gain over traditional symbolic-only detection methods, particularly against AI-generated threats.
High Precision: The framework achieved a precision rate exceeding 98%, with a false positive rate (FPR) as low as 0.16%.

These results suggest that CyberCane is not only effective but also reliable, making it well-suited for deployment in sensitive environments such as healthcare, where the projected return on investment (ROI) is estimated at 542 times the initial investment.

Conclusion and Open Source Implementation

CyberCane represents a significant advancement in the field of phishing detection, effectively balancing the need for security, privacy, and compliance. With its open-source implementation available at GitHub, organizations can explore this innovative solution and tailor it to their specific risk tolerances, paving the way for a safer digital landscape.

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

CyberCane: Privacy-Preserving Phishing Detection with Ontology

CyberCane: A Breakthrough in Privacy-Preserving Phishing Detection

Challenges in Current Phishing Detection Systems

The CyberCane Framework

Introducing PhishOnt

Evaluation and Results

Conclusion and Open Source Implementation

Related AI Insights

Subscribe

More like thisRelated

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!

More like this
Related