web-search-mcp
Health Uyari
- License — License: MIT
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Low visibility — Only 9 GitHub stars
Code Gecti
- Code scan — Scanned 4 files during light audit, no dangerous patterns found
Permissions Gecti
- Permissions — No dangerous permissions requested
Bu listing icin henuz AI raporu yok.
A Model Context Protocol (MCP) server that provides web search functionality using a headless Chrome browser to scrape Google, DuckDuckGo and Bing search results.
Web Search MCP
A Model Context Protocol (MCP) server that provides web search functionality by scraping Google, DuckDuckGo, and Bing search results through a headless browser.
Features
- Multi-Engine Search: Searches Google, DuckDuckGo, and Bing with automatic fallback when an engine is blocked or returns nothing
- Structured Results: Titles, URLs, domains, snippets, and rankings — with redirect URLs unwrapped to their real destinations
- Web Page Content: Fetch and extract text content from any webpage
- Pluggable Browser Backends: Selenium + headless Chrome (default) or the lightweight Obscura headless browser, selected via an environment variable
- MCP Compatible: Fully compatible with Claude Desktop and other MCP clients
Tools Available
search_web
Search the web across multiple engines (Google → DuckDuckGo → Bing) with automatic fallback and return structured results.
Parameters:
query(string): The search query stringmax_results(int, optional): Maximum number of results to return (default: 10, max: 100)include_snippets(bool, optional): Whether to include text snippets (default: true)
Returns:
- List of search results with:
title: Page titleurl: Full URL (redirect wrappers resolved to the real destination)domain: Domain namesnippet: Text snippet (if enabled)rank: Search result rankingsource_engine: Which engine produced the result (google,duckduckgo, orbing)
get_webpage_content
Fetch and return the text content of a webpage.
Parameters:
url(string): The URL of the webpage to fetchmax_length(int, optional): Maximum content length (default: 5000, max: 20000)
Returns:
- Dictionary with:
url: The requested URLtitle: Page titlecontent: Extracted text contentlength: Content length in characters
get_search_engine_status
Report the active browser backend and the availability of each search engine.
Returns:
- Dictionary with a
backendkey (seleniumorobscura) plus each engine name mapped toavailableorblocked.
reset_search_engines
Clear the list of engines marked as blocked during the session (useful if an engine recovers).
Returns:
- A status message string.
Installation
Install dependencies:
# Using uv (recommended) uv sync # Or using pip pip install -e .Install a browser backend (pick one):
Selenium (default) — requires Chrome:
- On macOS:
brew install --cask google-chrome - On Ubuntu:
sudo apt-get install google-chrome-stable - On Windows: Download from the Google Chrome website
ChromeDriver is downloaded and managed automatically by webdriver-manager.
- On macOS:
Obscura — no Chrome required. See the Browser Backends section below.
Usage
Running the MCP Server
# Run directly
python main.py
# Or using the installed script
web-search-mcp
The server will start and listen for MCP connections.
Using with Claude Desktop
Add this configuration to your Claude Desktop MCP settings:
{
"mcpServers": {
"web-search-mcp": {
"command": "uvx",
"args": ["git+https://github.com/pranavms13/web-search-mcp"]
}
}
}
Example Usage in Claude
Once connected, you can use the tools like this:
Search for "python web scraping tutorials" and show me the top 5 results.
Get the content from this webpage: https://example.com/article
Configuration
The web searcher uses these Chrome options by default:
- Headless mode (no visible browser window)
- Window size: 1920x1080
- User agent: Modern Chrome browser
- Security flags for running in containers
Browser Backends
The server can render pages with one of two interchangeable backends, selected via the BROWSER_BACKEND environment variable:
| Backend | Value | Notes |
|---|---|---|
| Selenium | selenium (default) |
Headless Chrome via Selenium + webdriver-manager. |
| Obscura | obscura |
Obscura headless browser via its CLI. Lightweight, stealthy, no Chrome needed. |
Both backends produce fully JS-rendered HTML, which is then parsed with BeautifulSoup, so search results are identical in shape regardless of backend.
Using Obscura
Install the Obscura binary (see its releases), e.g. on macOS:
curl -LO https://github.com/h4ckf0r0day/obscura/releases/latest/download/obscura-x86_64-macos.tar.gz tar xzf obscura-x86_64-macos.tar.gzRun the server with Obscura selected:
BROWSER_BACKEND=obscura web-search-mcp
If obscura is not on your PATH, point to it explicitly with OBSCURA_BIN=/full/path/to/obscura. When the binary cannot be found, the server logs a warning and falls back to the Selenium backend automatically.
The active backend is reported by the get_search_engine_status tool under the backend key.
Error Handling
The tool includes comprehensive error handling for:
- Network timeouts
- WebDriver failures
- Page parsing errors
- Invalid URLs
Errors are logged and graceful fallbacks are provided.
Requirements
- Python 3.10+
- A browser backend: Chrome (Selenium) or the Obscura binary
- Internet connection
Dependencies
fastmcp: MCP server frameworkselenium: Web browser automation (Selenium backend)beautifulsoup4: HTML parsingwebdriver-manager: Chrome driver management (Selenium backend)lxml: XML/HTML parser
The Obscura backend has no Python dependency — it is invoked as an external CLI binary.
Limitations
- Search engines may rate-limit or block automated access; the multi-engine fallback mitigates but does not eliminate this
- Google in particular often blocks non-stealth headless traffic — DuckDuckGo and Bing typically remain available as fallbacks
- Results may vary based on location and each engine's ranking
- The Selenium backend requires Chrome; the Obscura backend requires the Obscura binary
Development
To modify or extend the functionality:
- Clone the repository
- Install in development mode:
uv syncorpip install -e . - Make your changes
- Run the test suite:
uv run pytest test.py -q - Smoke-test the server:
python main.py(orBROWSER_BACKEND=obscura OBSCURA_BIN=/path/to/obscura python main.py)
License
This project is licensed under MIT License. You can check it out at - LICENSE
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Yorumlar (0)
Yorum birakmak icin giris yap.
Yorum birakSonuc bulunamadi