Ressources Web Scraping

56 ressources disponibles

Web Scraping MCP Servers

MCP Browser Control Server MCP server logo

MCP Browser Control Server

The World's First Complete Media Testing Platform with Revolutionary Audio & Video Capabilities
Playwright MCP Automation MCP server logo

Playwright MCP Automation

An AI-powered browser automation platform that converts natural language test steps into executable browser automation using the Model Context Protocol (MCP) and Playwright.
Grasp MCP server logo

Grasp

Grasp is an open-source and self-hosted agentic browser. With built-in support for MCP and A2A support, it can seamlessly integrate with any other AI apps or agents.
WaterCrawl MCP MCP server logo

WaterCrawl MCP

A Model Context Protocol (MCP) server for WaterCrawl, built with FastMCP. This package provides AI systems with web crawling, scraping, and search capabilities through a standardized interface.
YouTube Transcript MCP Server MCP server logo

YouTube Transcript MCP Server

This project implements a Model Context Protocol (MCP) server that provides a tool for fetching YouTube video transcripts in various formats. Leveraging the youtube-transcript-api, the server allows Large Language Models (LLMs) to access YouTube transcripts securely and efficiently.
MCP Network Capture MCP server logo

MCP Network Capture

Capture real browser network requests (URLs, methods, status, headers, optional bodies/traces) and save them locally as newline-delimited JSON (JSONL). This repo wires Playwright MCP (browser automation) with a Filesystem MCP (safe writes) and a small headless Python client.
股票分析 MCP 服务 MCP server logo

股票分析 MCP 服务

mcp-stock-scanner
MCP Server Playwright MCP server logo

MCP Server Playwright

A Model Context Protocol server that provides browser automation capabilities using Playwright Enable LLMs to interact with web pages, take screenshots, and execute JavaScript in a real browser environment
BrowserTools MCP MCP server logo

BrowserTools MCP

Make your AI tools 10x more aware and capable of interacting with your browser
MCP app to get clean markdown from a url MCP server logo

MCP app to get clean markdown from a url

I wanted to try MCP and used the repository MCP-searxng by SecretiveShell as a starting point.
Thordata MCP Server MCP server logo

Thordata MCP Server

Built on a 195+ country proxy network, Thordata MCP breaks through web data barriers, delivering pure, structured, globally unlimited real-time information streams to AI models
Safari Screenshot MCP server logo

Safari Screenshot

A Node.js MCP Server for capturing screenshots using Safari on macOS.
MCP Chrome Integration MCP server logo

MCP Chrome Integration

A protocol that enables AI models to control Chrome browser and perform web automation.
Web Scraper MCP Server MCP server logo

Web Scraper MCP Server

An MCP (Model Context Protocol) server that can scrape web pages and extract content using CSS selectors. Built with deno-dom for fast HTML parsing.
Apple RAG Collector MCP server logo

Apple RAG Collector

Cloudflare Worker-based batch processing system for Apple Developer Documentation with intelligent content comparison and automated scheduling.
🤖 BotBrowser MCP server logo

🤖 BotBrowser

The Cross-Platform Browser That Actually Works in Authorized Testing Environments 99.7% Success Rate Zero Configuration Perfect Mobile Emulation Windows profiles run flawlessly on macOS Android simulation on desktop
Firecrawl MCP Server MCP server logo

Firecrawl MCP Server

A Model Context Protocol (MCP) server implementation that integrates with Firecrawl for web scraping capabilities.
Fetch MCP Server MCP server logo

Fetch MCP Server

A Model Context Protocol server that provides web content fetching capabilities. This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.
🧪 MCP AI Travel Agent (Demo Project) MCP server logo

🧪 MCP AI Travel Agent (Demo Project)

A simple CLI project that combines Bright Datas Browser API (MCP) with OpenAI to scrape hotel listings and generate a 3-day travel itinerary using AI (which is just the example used, but you can use it for any scraping task).
🤖 AIPex - AI-Powered Browser Automation Extension MCP server logo

🤖 AIPex - AI-Powered Browser Automation Extension

Automate your browser with natural language commands - The open source browser-use solution
Government Project Announcement Scraper MCP server logo

Government Project Announcement Scraper

This project is a web scraper designed to collect information about government support projects from various websites.
mcp-tavily-extract MCP server logo

mcp-tavily-extract

MCP server to give client the ability to extract a web page
WebDriverIO MCP Server MCP server logo

WebDriverIO MCP Server

A Model Context Protocol (MCP) server that enables Claude Desktop to interact with web browsers using WebDriverIO. This allows Claude to perform web automation tasks like clicking elements, filling forms, taking screenshots, and more.
MCP Network Capture MCP server logo

MCP Network Capture

Capture real browser network requests (URLs, methods, status, headers, optional bodies/traces) and save them locally as newline-delimited JSON (JSONL). This repo wires Playwright MCP (browser automation) with a Filesystem MCP (safe writes) and a small headless Python client.
MCP-GetWeb MCP server logo

MCP-GetWeb

A Model Context Protocol (MCP) server that provides web search and content extraction capabilities.
mcp.apify.com MCP server logo

mcp.apify.com

The Apify Model Context Protocol (MCP) server at mcp.apify.com enables your AI agents to extract data from social media, search engines, maps, e-commerce sites, or any other website using thousands of ready-made scrapers, crawlers, and automation tools available on the Apify Store.
Chrome MCP Browser Launcher MCP server logo

Chrome MCP Browser Launcher

English |
Open Crawler MCP Server MCP server logo

Open Crawler MCP Server

A Model Context Protocol (MCP) server for web crawling and content extraction from web pages with multiple output formats.
LCBro MCP server logo

LCBro

The Cool Browser Automation MCP Server
🕷️ crawl4ai-mcp-server - Simple Setup for Web Scraping Tools MCP server logo

🕷️ crawl4ai-mcp-server - Simple Setup for Web Scraping Tools

The crawl4ai-mcp-server is a lightweight server that allows you to access web scraping and crawling tools easily. It provides similar capabilities to Firecrawl's API but offers a self-hosted and free option. This server integrates seamlessly with AI frameworks like OpenAI Agents SDK, Cursor, and Claude Code. You can use it for various AI workflows, making it a valuable resource for anyone needing web data.
LLM Researcher MCP server logo

LLM Researcher

A lightweight MCP (Model Context Protocol) server for LLM orchestration that provides efficient web content search and extraction capabilities. This CLI tool enables LLMs to search DuckDuckGo and extract clean, LLM-friendly content from web pages.
Docfork MCP - Up-to-date Docs for Devs and AI Agents in a Single Tool Call MCP server logo

Docfork MCP - Up-to-date Docs for Devs and AI Agents in a Single Tool Call

Docfork MCP pulls @latest documentation and code examples straight from the source - and adds them right into your context.
Daily Tech Digest Agent MCP server logo

Daily Tech Digest Agent

This repository hosts an AI agent that runs daily and collects the most interesting English-language articles on:
mastr-mcp: An MCP server example. MCP server logo

mastr-mcp: An MCP server example.

Fetch data from German Marktstammdatenregister (marktstammdatenregister.de)
mcp-scraper-main1 MCP server logo

mcp-scraper-main1

URL-Context-MCP MCP Server MCP server logo

URL-Context-MCP MCP Server

The URL-Context-MCP MCP Server provides a tool to analyze and summarize the content of URLs using Google Gemini's URL Context capability via the Gemini API.
Sponsors MCP server logo

Sponsors

Easy, effortless Web Scraping as it should be!
YouTube Transcript Remote MCP Server MCP server logo

YouTube Transcript Remote MCP Server

The first remote Model Context Protocol (MCP) server that enables Claude AI to extract transcripts from YouTube videos. This server offers zero-setup access for users on any platform including mobile devices.
🌐 geo-ai-agent - Optimize Your Website Content Effortlessly MCP server logo

🌐 geo-ai-agent - Optimize Your Website Content Effortlessly

Welcome to the geo-ai-agent! This tool helps you audit and enhance your website content. It uses AI to crawl your URLs, analyze H1 tags, and provide GEO recommendations. With this tool, you can improve your sites SEO and user engagement without any technical knowledge.
Chrome MCP Server MCP server logo

Chrome MCP Server

A Model Context Protocol (MCP) server for controlling and automating Google Chrome browser operations using Puppeteer. This server provides tools for web automation, element interaction, and browser management.
MCP YouTube Extract MCP server logo

MCP YouTube Extract

A Model Context Protocol (MCP) server for YouTube operations, demonstrating core MCP concepts including tools and logging.
YouTube MCP Server MCP server logo

YouTube MCP Server

A Model Context Protocol (MCP) server for interacting with YouTube data. This server provides resources and tools to query YouTube videos, channels, comments, and transcripts through a stdio interface.
Supadata MCP Server MCP server logo

Supadata MCP Server

A Model Context Protocol (MCP) server implementation that integrates with Supadata for video & web scraping capabilities.
xss-mcp-tester MCP server logo

xss-mcp-tester

For an explanation and POC of what this mcp server does, please visit my article on medium : Mypost. But globally, it's an MCP server for performing XSS tests with AI.
MCP Puppeteer Server MCP server logo

MCP Puppeteer Server

A Model Context Protocol (MCP) server that provides Claude Code with comprehensive browser automation capabilities through Puppeteer. This server allows Claude to interact with web pages, take screenshots, execute JavaScript, and perform various browser automation tasks.
🔥 MediaCrawler_MCP_Server - MCP for MediaCrawler 🕷️ MCP server logo

🔥 MediaCrawler_MCP_Server - MCP for MediaCrawler 🕷️

https://github.com/NanmiCoder/MediaCrawler
🚀 HeroForge.ai Competitor Analysis System MCP server logo

🚀 HeroForge.ai Competitor Analysis System

A comprehensive competitor analysis tool built for HeroForge.ai to analyze Aloa.co and other competitors, with advanced MCP integration, context management, and future SEO capabilities.
CodeDox - Documentation Code Extraction & Search MCP server logo

CodeDox - Documentation Code Extraction & Search

A powerful system for crawling documentation websites, extracting code snippets, and providing fast search capabilities via MCP (Model Context Protocol) integration.
Xenopus MCP MCP server logo

Xenopus MCP

Xenopus MCP is a tool that provides an interface to the Screaming Frog SEO Spider. It allows you to automate crawling and data extraction from websites.
BrowserMCP Enhanced MCP server logo

BrowserMCP Enhanced

Enhanced Model Context Protocol (MCP) server for browser automation with improved element selection, tab management, and token optimization. Built on top of the original BrowserMCP with significant improvements for AI-powered browser interaction.
GoLogin MCP Server MCP server logo

GoLogin MCP Server

Manage your GoLogin browser profiles and automation directly through AI conversations. This MCP server connects to the GoLogin API, letting you create, configure, and control browser profiles using natural language.
BookMinder MCP server logo

BookMinder

A tool to extract content and highlights from Apple Books for LLM analysis.
Browser Agent MCP MCP server logo

Browser Agent MCP

Advanced web scraping with AI-powered challenge solving and stealth capabilities.
🌐 The Most Advanced Web Fetching MCP Server MCP server logo

🌐 The Most Advanced Web Fetching MCP Server

The most feature-rich, production-ready web fetching MCP server available
yutu MCP server logo

yutu

yutu is a fully functional MCP server and CLI for YouTube to automate your YouTube workflows. It can manipulate almost all YouTube resources, like videos, playlists, channels, comments, captions, and more.
🤖 Puppeteer MCP Server MCP server logo

🤖 Puppeteer MCP Server

A self-hosted Puppeteer MCP (Model Context Protocol) server with remote SSE access, API key authentication, and Docker deployment. This server provides 16 comprehensive Puppeteer tools including advanced mouse interactions and authentication cookie management, with enhanced security, monitoring, and production-ready features.
Page 1 sur 5