Webclaw
Web extraction engine for LLMs using TLS fingerprinting to bypass bot detection, output as markdown.
🛡️ AgentReady threat assessment
MAESTRO 7-layer threat model + OWASP AIVSS risk score for Webclaw, derived from its capabilities.
AIVSS 8.5 · High
View MAESTRO 7-layer threat model →Overview
Webclaw is a web-extraction engine for LLMs that uses TLS fingerprinting to evade bot detection and returns clean markdown for agent consumption. It fetches pages from arbitrary URLs. Bot-detection evasion is a dual-use capability, and the returned page content is untrusted input that can carry prompt injection.
Key features
- TLS-fingerprint bot evasion
- Clean markdown output
- LLM-optimized extraction
- Arbitrary URL fetching
Use cases
- Extracting content from protected sites
- Feeding web content to LLMs
- Agentic research