Web Scraping Resources
56 resources available
Web Scraping MCP Servers
MCP YouTube Extract
A Model Context Protocol (MCP) server for YouTube operations, demonstrating core MCP concepts including tools and logging.
Firecrawl Self-Hosted on Railway
This repository contains a complete Firecrawl setup optimized for Railway deployment with GitHub integration.
Electric scraper
This project collects tools for scraping data of electric compoents from the web. The project includes a MCP server, to let AI perform the scraping.
Extractous MCP Server
A Model Context Protocol (MCP) server for text extraction using the extractous library.
Unsplash MCP Server
An MCP server to download images from Unsplash.
全网短视频去水印链接提取 MCP服务
FastMCP
BrowserLoop
A Model Context Protocol (MCP) server for taking screenshots and reading console logs from web pages using Playwright. This tool allows AI agents to automatically capture screenshots and monitor browser console output for debugging, testing, and development tasks.
mcp-n8n-firecrawl
MCP Server and Client with n8n and Firecrawl A comprehensive Model Context Protocol (MCP) implementation using n8n workflows, featuring an intelligent scraping agent client powered by Google Gemini and a robust server utilizing Firecrawl API for web content extraction.
Puppeteer
A Model Context Protocol server that provides browser automation capabilities using Puppeteer. This server enables LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment.
YouTube Transcript MCP Server
A simple MCP server that fetches YouTube video transcripts and saves them to Notion.
Flight Search MCP Server
The Flight Search MCP Server enables Large Language Models (LLMs) to retrieve detailed flight search and information without requiring an API key. Using web scraping, it collects comprehensive data on flights, including prices, schedules, airlines, baggage allowances, and more. This server integrates seamlessly with MCP-compatible clients (e.g., Claude Desktop) to enhance flight search and analysis capabilities.
Secondhand Malta MCP Server
A Model Context Protocol (MCP) server for interacting with secondhand.com.mt, Malta's classified ads marketplace.
Dafty MCP Server
This is an independent, open-source project and is not affiliated with, endorsed, or sponsored by Daft.ie. This tool is provided for educational and experimental purposes only. The data is scraped from a publicly available website, and its use is subject to the terms of service of that website. The author assumes no liability for the use or misuse of this software. Please use it responsibly and ethically.
AlsoAsked MCP Server
A Model Context Protocol (MCP) server for the AlsoAsked API, providing access to Google's "People Also Ask" data for SEO research and content optimization.
MCP Web-Finder
Un servicio MCP para bsqueda y anlisis web avanzado.
Rightmove MCP Server
A Model Context Protocol (MCP) server for accessing Rightmove.co.uk property data. This server provides tools to search properties, get detailed property information, and retrieve area statistics from the UK's largest property portal.
MCP Web Scraper
A production-ready Model Context Protocol (MCP) server for intelligent web scraping with advanced cookie consent handling. Built with TypeScript and Playwright, supporting 30+ languages and 25+ consent management platforms.
WebSage 🌐
Welcome to the WebSage repository! This project is an AI agent designed to scrape websites, crawl pages, and extract structured data in real-time. It can help you answer complex queries efficiently and effectively.
🚇 MCP Traffic - Tokyo Traffic Data Collection System
Real-time Tokyo transportation data collection and visualization system using ODPT API
@just-every/mcp-read-website-fast
Fast, token-efficient web content extraction for AI agents - converts websites to clean Markdown.
Web Scraper MCP Server
A web scraping server that implements the Minecraft Control Protocol (MCP) using FastAPI. This server provides tools for extracting content and links from web pages in a structured way.
YouTube Transcript MCP Server
A Model Context Protocol (MCP) server that fetches transcripts from YouTube videos. This server can be deployed remotely and integrated with Claude to provide YouTube transcript fetching capabilities directly in your conversations.
Dealwatcher AI
Dealwatcher AI is an automated price monitoring agent built with Pydantic AI, Logfire, and an MCP server. It tracks price fluctuations across e-commerce platforms by scraping product pages, extracting key information (title, description, price), storing data in a structured database, and triggering notifications when price changes are detected.
Dirsearch MCP - Intelligent Directory Scanner with AI Integration
A powerful, multi-threaded directory and file enumeration tool enhanced with MCP (Machine Coordination Protocol) intelligence and AI agent integration for advanced web application scanning.
Social Media Scraper - Custom MCP Server
A comprehensive Model Context Protocol (MCP) server that provides social media scraping capabilities for LinkedIn, Facebook, Instagram, and Google search functionality.
scrapbox-cosense-mcp
English |
Crawl4AI RAG MCP Server
Web Crawling and RAG Capabilities for AI Agents and AI Coding Assistants
Bract - MCP Browser Automation Server
Bract is a Go implementation of a Model Context Protocol (MCP) server that enables browser automation through a Chrome extension. It provides a standardized interface for AI assistants and automation tools to control web browsers programmatically.
Real Estate Aggregator MX
Consolidated Mexican Real Estate Aggregator platform combining the best features from multiple repositories.
CLG-MCP: Cyndi's List Genealogy MCP Server
A Model Context Protocol (MCP) server that provides comprehensive genealogy resource discovery capabilities through web scraping of Cyndi's List. This server is designed to run on Cloudflare Workers free tier.
DocsScraper MCP Server
An MCP server that connects to the DocsScraper web API to provide semantic search capabilities through documentation chunks.
MCP Browser Automation
Algonius Browser is an open-source MCP (Model Context Protocol) server that provides browser automation capabilities to external AI systems. It exposes a comprehensive set of browser control tools through the MCP protocol, enabling AI assistants and other tools to navigate websites, interact with DOM elements, and extract web content programmatically.
Subdomain Screenshot Tool
This is a Gradio application that allows users to input a web address and get screenshots of its subdomains. The tool uses Selenium with Chrome in headless mode to capture screenshots and BeautifulSoup to crawl for subdomains.
PlayMCP Browser Automation Server
A comprehensive MCP (Model Context Protocol) server for browser automation using Playwright. This server provides powerful tools for web scraping, testing, and automation.
Facebook Ads Library MCP 🚀
Welcome to the Facebook Ads Library MCP repository! This project provides a server that connects to Facebook's Ads Library, allowing users to get instant answers and insights from Facebook's extensive advertising data.
PuppeteerMCP Server
Developing website UI's with MCP just got a lot easier. A Model Context Protocol (MCP) server that provides screenshot tools for AI assistants using Puppeteer. This server integrates with MCP-compatible hosts like Cursor to enable AI agents to capture and analyze web page screenshots, console logs, errors, and warnings.
Scrapbox MCP
English |
YggTorrent MCP Server & Wrapper
This repository provides a Python wrapper for the YggTorrent website and an MCP (Model Context Protocol) server to interact with it programmatically. This allows for easy integration of YggTorrent functionalities into other applications or services.
LeadScraper Agent
This is a modular, AI-powered lead generation bot for scraping builder job posts and auto-notifying high-value opportunities.
Discord MCP Server
A Model Context Protocol (MCP) server that lets LLMs read messages, discover channels, send messages, and monitor Discord communities using web scraping.
ScrapeMCP
cloudbrowser mcp server
Puppeteer Vision MCP Server - Specify4IT Configuration
This is a configured version of the Puppeteer Vision MCP server specifically set up for scraping specify4it.com. It includes custom settings and scripts for handling the site's specific structure, animations, and interactive elements.
YouTube Transcript Server
A Model Context Protocol server that enables retrieval of transcripts from YouTube videos. This server provides direct access to video captions and subtitles through a simple interface.
Twitter MCP Server
An MCP (Model Context Protocol) server that provides tools for interacting with Twitter using the agent-twitter-client library.
X/Twitter MCP Server
A Model Context Protocol (MCP) server that provides unofficial X/Twitter API access through browser automation using Playwright. This server enables AI agents and applications to interact with X/Twitter programmatically for content creation, scraping, and social media automation.
Curate-MCP
Curate-MCP is an automation tool designed to help users curate and customize their resumes based on specific job descriptions (JD), with a focus on LinkedIn job postings. It leverages browser automation and scraping to extract job requirements and provides tools for tailoring resumes to better match targeted roles.
Fetch MCP Server
This MCP server provides functionality to fetch web content in various formats, including HTML, JSON, plain text, and Markdown.
WebScraping.AI MCP Server
A Model Context Protocol (MCP) server implementation that integrates with WebScraping.AI for web data extraction capabilities.
conduit-mcp 🐱
The purr-fect MCP server for feline-fast file operations, web prowling, and data hunting!
MCPBrowserServer
A comprehensive Model Context Protocol (MCP) server for browser automation using Selenium WebDriver. This server provides AI agents with powerful web browsing capabilities including session management, tab control, navigation, and element interaction.
crawl_medical_news
Medical Data Crawler MCP Server
Bright Data MCP
Welcome to the official Bright Data Model Context Protocol (MCP) server, enabling LLMs, agents and apps to access, discover and extract web data in real-time. This server allows MCP clients, such as Claude Desktop, Cursor, Windsurf and others, to seamlessly search the web, navigate websites, take action and retrieve data - without getting blocked - perfect for scraping tasks.
MCP-Scrape: SEAL Team Six-Grade Web Scraping MCP Server
MCP-Scrape is a military-grade web scraping MCP server that combines the most powerful scraping capabilities from multiple battle-tested tools. Built with SEAL Team Six principles: Precision, Reliability, Adaptability, and No Failure.
🕷️ Decodo Website Scraper
A Model Context Protocol (MCP) server that provides website scraping capabilities using the Decodo scraping API. This tool allows you to extract text content from specific HTML elements on web pages.
web-app-mcp
An MCP server using FastMCP and Playwright to interact with web applications.
Page 2 of 5