Article Content Extraction API

Extract clean article body, title, author, and publish date from any blog or news page — without the ads and clutter.

The Agenty Content API extracts the main article body from any blog or news URL, automatically removing navigation, ads, sidebars, and footers. Get the title, author, publish date, hero image, and a clean HTML or plain-text body — ready for aggregators, newsletters, or LLM pipelines.

Features

Use cases

API examples

Extract article content with cURLbash
curl -X GET "https://api.agenty.ai/v1/content?url=https://example.com/blog/post" \
  -H "Authorization: Bearer YOUR_API_KEY"
Extract article content in Node.jsjavascript
const res = await fetch(
  'https://api.agenty.ai/v1/content?url=https://example.com/blog/post',
  { headers: { 'Authorization': 'Bearer YOUR_API_KEY' } },
);
const article = await res.json();
console.log(article.title, article.text);
Extract article content in Pythonpython
import requests

res = requests.get(
    "https://api.agenty.ai/v1/content",
    headers={"Authorization": "Bearer YOUR_API_KEY"},
    params={"url": "https://example.com/blog/post"},
)
article = res.json()
print(article["title"], article["text"])

How Agenty compares

FeatureAgentyReadabilityMercuryPostlight
Automatic content detectionYesYesYesYes
Author & date extractionYesLimitedYesYes
Multi-language (50+)YesLimitedYesYes
Image & video extractionYesNoYesYes
Hosted API + free tierYesSelf-hostSelf-hostYes

Frequently asked questions

What is the Article Content Extraction API?

The Agenty Content API automatically identifies and extracts the main article on any web page. It returns clean structured data: title, author, publish date, article body, and embedded media URLs.

Can I get plain text instead of HTML?

Yes. Set outputFormat: "text" to receive plain text with paragraphs preserved. The default is "html" which returns clean semantic HTML.

Does it work with paywalled content?

Yes. Pass cookies or auth headers via the headers parameter. We also support session-based authentication for platforms like Medium and Substack.

Is there a free tier?

Yes. All accounts include a free tier. Visit our pricing page for details.

Web scraping with AI

Start scraping data from any website using the Agenty's web scraping agents with AI.

No credit card required
14-day free trial
Cancel anytime
Log inSign up