Web Article Extractor
Extracts key information, summaries, and relevant data from web articles based on user prompts.
Install on your platform
We auto-selected Claude Code based on this skill’s supported platforms.
Run in terminal (recommended)
claude mcp add web-article-extractor npx -- -y @trustedskills/web-article-extractor
Or manually add to ~/.claude/settings.json
{
"mcpServers": {
"web-article-extractor": {
"command": "npx",
"args": [
"-y",
"@trustedskills/web-article-extractor"
]
}
}
}Requires Claude Code (claude CLI). Run claude --version to verify your install.
About This Skill
The web-article-extractor skill retrieves and processes full text from specified URLs, converting raw HTML into clean, readable article content for AI agents. It is designed to strip away navigation menus, ads, and boilerplate code to deliver only the core narrative or data within a webpage.
When to use it
- Research synthesis: Quickly gather context from multiple news articles or research papers before summarizing them.
- Data extraction: Pull specific text blocks from documentation sites or blogs to feed into analysis workflows.
- Content auditing: Retrieve full versions of web pages to verify accuracy against cached or summarized versions.
- Cross-referencing: Fetch related articles from different domains to build a comprehensive knowledge base for complex queries.
Key capabilities
- Fetches content directly from provided URLs.
- Parses HTML to remove non-essential elements like headers, footers, and scripts.
- Returns clean text suitable for further processing by AI agents.
Example prompts
- "Extract the main article text from https://example.com/blog/latest-updates and summarize the key points."
- "Use web-article-extractor to get the full content of this documentation page so I can answer a user's technical question accurately."
- "Retrieve the text from these three URLs about market trends and compare their perspectives on inflation."
Tips & gotchas
Ensure the target URL is publicly accessible and not behind a login wall, as the tool cannot bypass authentication. For best results, provide direct links to specific article pages rather than homepage URLs to minimize irrelevant content extraction.
Tags
TrustedSkills Verification
Unlike other registries that point to live repositories, TrustedSkills pins every skill to a verified commit hash. This protects you from malicious updates — what you install today is exactly what was reviewed and verified.
Security Audits
| Gen Agent Trust Hub | Pass |
| Socket | Pass |
| Snyk | Pass |
🌐 Community
Passed automated security scans.