Tag: agentic AI safety

Browse our exclusive articles!

Jailbreak Attacks on Large Reasoning Models Using Semantic Triggers

AI News

Lazarus Omolua - April 20, 2026

Explore novel jailbreak attacks on large reasoning models via semantic triggers and psychological framing, revealing key vulnerabilities and defense needs.

Symbolic Guardrails for Safer Domain-Specific AI Agents

AI News

Lazarus Omolua - April 20, 2026

Discover how symbolic guardrails improve safety and security in domain-specific AI agents without compromising their utility or performance.

HarmfulSkillBench: Detecting Dangerous Skills in AI Agents

AI News

Lazarus Omolua - April 20, 2026

Discover how HarmfulSkillBench identifies and measures harmful skills in AI agents, enhancing safety in large language model ecosystems.

Subliminal Transfer of Unsafe Behaviors in AI Distillation

AI News

Lazarus Omolua - April 20, 2026

Explore how unsafe behaviors subliminally transfer in AI agent distillation, revealing risks beyond explicit data sanitation in model training.

IatroBench: Evidence of AI Safety Risks in Medical Advice

AI News

Lazarus Omolua - April 18, 2026

IatroBench reveals AI safety measures causing harm by withholding critical medical info, highlighting risks in AI-generated healthcare guidance.

1...293031...66 Page 30 of 66

Popular

RichlyAI Blog AI Guide, Tutorials, Industrial Insights, & more!

Company

Tag: agentic AI safety

Browse our exclusive articles!

Subscribe

About us

Company

The latest

Subscribe

RichlyAI Blog
AI Guide, Tutorials, Industrial Insights, & more!