Autonomous Video Hunter AI Agents for Real Time OSINT
Kevin Dela Rosa
Recon Village @ DEF CON 33 · Day 1 · Recon Village
In an era where video content dominates digital landscapes, a vast amount of intelligence remains "trapped" within these visual and auditory streams, largely inaccessible to traditional analytical methods. Kevin Dela Rosa, founder of Cloud Glue, presented a groundbreaking talk at Recon Village, introducing the **Autonomous Video Hunter**, an AI agent designed to revolutionize **Real-Time OSINT (Open-Source Intelligence)** extraction from video. This innovative system leverages the latest advancements in multimodal AI to transform overwhelming video data into actionable insights, addressing a critical challenge for intelligence professionals, security researchers, and even general users seeking to monitor specific events or individuals.
AI review
Competent proof-of-concept demo of LLM-orchestrated video OSINT tooling, showing real working code and sensible architecture choices. The speaker clearly built the thing himself, which puts it ahead of most AI talks, but the underlying components — Deepface, SIFT+RANSAC, Owl-ViT, Langraph — are all well-documented off-the-shelf pieces, and the orchestration layer isn't doing anything architecturally novel. Fine for Recon Village, won't make anyone's year-end list.