Designing, deploying, and defending
and defending
autonomous AI systems
Research at the intersection of agentic AI architecture, enterprise security, and human-centered system design.
Read Latest PaperAI Systems, Agents and Security
Primary technical research axis focused on agentic architectures, security red-teaming, and runtime enforcement frameworks.
Core Question
"How do we design, deploy, and defend autonomous AI systems operating in real organizations?"
Representative Topics
- Agentic architectures and LLM orchestration
- Agent-to-agent influence vectors
- Prompt injection and policy puppetry
- Runtime enforcement frameworks
- MITRE ATLAS operationalization
Output Types
Selected Work
Agentic Binary Reverse Engineering: State of the Art, Architecture, Benchmarks, Failure Modes, and Research Agenda
Agentic Patch Validation in Automated Vulnerability Repair
Sandboxing and Capability Control for Tool-Using Autonomous Agents
Tool-using LLM agent security and prompt-injection defenses
Research Integration Model
Active Research Threads
Multi-Agent Prompt Injection Chains
Indirect Injection Propagation
Orchestrator Policy Enforcement
Glitch Token Mining
Enhanced Token Validation
Embedding Cluster Analysis
Email Extraction Failure Modes
Responsible Disclosure Workflow
Multi-Agent Prompt Injection Pillar Page
Glitch Token Glossary Cluster
AI Referral Measurement Setup
llms.txt + Sitemap Foundation
Triage Rubric v1
Selected Papers
Preserving Learning in Generative AI Tutoring Systems: Pedagogical Safety, Cognitive Effort, and Adaptive Scaffolding
Agentic Binary Reverse Engineering: State of the Art, Architecture, Benchmarks, Failure Modes, and Research Agenda
Agentic Patch Validation in Automated Vulnerability Repair
Generative AI Tutors and Personalized Adaptive Learning Systems
Effects of AI Assistance on Critical Thinking and Cognitive Offloading
Tool-use reliability, function-calling robustness, and structured output enforcement
Compound AI systems and orchestration patterns for multi-step automation
Sandboxing and Capability Control for Tool-Using Autonomous Agents
Tool-using LLM agent security and prompt-injection defenses
Hardening Multi-Agent Systems Against Prompt Injection
NOW9000: A Voice-Based AI Jailbreak Game
Full-Vocabulary Glitch Token Census and ASR Validation Methodology Correction
Auditing Glitcher's ASR Validation and Mining Coverage: Deterministic Decoding Bugs and Candidate Generation Gaps in Glitch Token Discovery
Prompt Injection, Tool Hijacking, and Data Exfiltration Defenses in RAG/Agent Systems
Glitcher: Mining and Classifying Glitch Tokens in Large Language Models
Harnessing Large Language Models for Enhanced Malware Reverse Engineering
Fund Independent Research
This is self-funded, independent security research. Contributions directly support compute costs, API access, and open publication.
not configured
Background
Richards.AI is an independent research practice focused on the security, reliability, and human impact of autonomous AI systems. The work spans academic research, enterprise consulting, and open-source tooling.
Current primary focus: agent architecture security, with particular emphasis on multi-agent influence vectors, runtime enforcement frameworks, and operationalizing threat models like MITRE ATLAS for enterprise deployments.
"The three pillars are not separate silos. Security asks can we control it?Applied intelligence asks can we make it useful? And human learning askscan it genuinely improve lives? Each informs and strengthens the others."