§ 02 — Projects

Tools & Research

Open-source security tools, research projects, and offensive methodologies. Everything here is built to be used, not just demonstrated.

01
Featured

Cosmos

AI-powered application security testing product at Bishop Fox. Autonomous agents that test entire application portfolios at scale, finding vulnerabilities across web apps that manual testing misses. Shipped to production, used by enterprise customers.

AI productappsecagentic
P01
02

Arbiter

AI judge agent for NEBULA:FOG 2026. Watches hackathon demos via Gemini Live, scores with a multi-model ensemble (Claude + Gemini + Groq), defends against prompt injection in 7 languages. Judged 25 live demos. 1,451 tests passing.

AI judgeprompt injectionClaude
P02
03

Starlog

Expert-curated deep dives on offensive security tools and AI agents. CLI pipeline powered by Claude that ingests GitHub stars, analyzes repos, and generates long-form articles autonomously. Live at starlog.is.

publicationClaudeagentic
P03
04

LLM Testing Findings

Open-source templates for documenting vulnerabilities in LLM integrations. Curated list of every open-source LLM testing tool. 74 stars, community standard.

AI/LLMmethodologyopen-source
P04
05

METR Cybersecurity Benchmark

Contributed the 'rcrce' cybersecurity challenge to METR's HCAST benchmark for evaluating autonomous AI agents. A 2.8-hour PHP race condition exploit task requiring RCE and flag retrieval. Zero AI agents have solved it. Built to the METR Task Standard for measuring frontier model capabilities.

AI safetybenchmarksMETR
P05
06

my-precious-pii

AI safety research: GPT-2 model trained on fake PII to study data leakage from language models. CTF-style challenge where participants extract synthetic PII using prompt injection and NLP techniques. Demonstrates real risks in model memorization.

AI safetyPII leakagemodel security
P06
07

SmogCloud

Find cloud assets that no one wants exposed. Discovers internet-facing AWS resources across 14 services. 348 stars. Used by security engineers and pentesters worldwide.

AWScloudGo
P07
08

Google Hacking Diggity Project

The search engine hacking toolkit. GoogleDiggity, BingDiggity, SHODANDiggity, and more. Created the Bing Hacking Database. Started it all.

OSINTreconclassic
P08