Define a JSON schema, send a URL. Get clean, typed data back — powered by your own LLM key. No more brittle CSS selectors.
Define what you want. Purify + your LLM extracts it from any page.
curl -X POST https://purify.verifly.pro/api/v1/extract \ -H "Authorization: Bearer YOUR_API_KEY" \ -H "Content-Type: application/json" \ -d '{ "url": "https://news.ycombinator.com", "schema": { "type": "array", "items": { "type": "object", "properties": { "title": { "type": "string" }, "url": { "type": "string" }, "points": { "type": "number" }, "comments": { "type": "number" } } } }, "llm_api_key": "sk-your-openai-key" }'{ "success": true, "data": [ { "title": "Show HN: Purify – Web to Markdown API", "url": "https://github.com/Easonliuliang/purify", "points": 342, "comments": 89 }, { "title": "Why LLMs need clean input data", "url": "https://example.com/llm-data", "points": 256, "comments": 67 } ], "processing_time_ms": 1240}From RAG pipelines to autonomous agents — structured extraction powers your data layer.
Extract structured content from documentation, articles, and knowledge bases for your retrieval-augmented generation system.
Give your agents the ability to understand and extract data from any website, not just APIs with JSON endpoints.
Augment your CRM, product database, or research corpus with structured web data at scale.
Everything you need to turn unstructured web pages into structured data.
Define your JSON schema, Purify + your LLM extracts exactly the fields you need.
Use your own OpenAI, Anthropic, or any LLM API key. We never store or proxy your key.
Submit multiple URLs in one request. Get structured JSON arrays back.
Same schema, same structure every time. No more parsing brittle HTML selectors.
Feed structured data directly into your vector store or knowledge graph.
Built-in MCP server lets your agents extract structured data in a single tool call.
Structured extraction requires an LLM to interpret the schema. Pass your own OpenAI, Anthropic, or any compatible API key in the request. We never store, log, or proxy your key — it goes directly to your LLM provider.
Free tier includes 1,000 extraction requests/month. No credit card required.