google-news-api
Health Pass
- License — License: MIT
- Description — Repository has a description
- Active repo — Last push 0 days ago
- Community trust — 12 GitHub stars
Code Pass
- Code scan — Scanned 12 files during light audit, no dangerous patterns found
Permissions Pass
- Permissions — No dangerous permissions requested
No AI report is available for this listing yet.
Unofficial Python client for Google News RSS with sync, async, URL decoding, caching, and MCP support.
Google News API Client for Python
Unofficial Google News API client for Python. Search Google News RSS feeds,
fetch top stories, filter by date or recency, decode Google News article URLs,
and use the same news tools from an MCP server.
This package uses Google News RSS feeds and compatible provider modes. It is not
an official Google API.
Quickstart
pip install google-news-api
from google_news_api import GoogleNewsClient
with GoogleNewsClient(language="en", country="US") as client:
articles = client.search("artificial intelligence", when="24h", max_results=5)
for article in articles:
print(article["title"])
print(article["source"])
print(article["link"])
Each article contains:
{
"title": "Article title",
"link": "https://news.google.com/rss/articles/...",
"published": "Tue, 16 Jun 2026 12:00:00 GMT",
"summary": "Article summary",
"source": "Publisher name",
}
Why Use This Package
- Sync and async clients with the same API shape
- Google News RSS search, topic feeds, date filters, and relative time filters
- Google News URL decoding to original publisher URLs
- Rate limiting, retries, and in-memory caching
- Optional SearchAPI provider modes for direct publisher URLs and richer snippets
- MCP server for agents and AI tools
Features
- Comprehensive news search and retrieval functionality
- Search by keywords with advanced filtering
- Get top news by topic (WORLD, NATION, BUSINESS, TECHNOLOGY, etc.)
- Batch search support for multiple queries
- URL decoding for original article sources
- Both synchronous and asynchronous APIs
GoogleNewsClientfor synchronous operationsAsyncGoogleNewsClientfor async/await support
- Advanced time-based search capabilities
- Date range filtering (after/before)
- Relative time filtering (e.g., "1h", "24h", "7d")
- Maximum 100 results for date-based searches
- High performance features
- In-memory caching with configurable TTL
- Built-in rate limiting with token bucket algorithm
- Automatic retries with exponential backoff
- Concurrent batch searches in async mode
- Multi-language and country support
- ISO 639-1 language codes (e.g., "en", "fr", "de")
- ISO 3166-1 country codes (e.g., "US", "GB", "DE")
- Language-country combinations (e.g., "en-US", "fr-FR")
- Robust error handling
- Specific exceptions for different error scenarios
- Detailed error messages with context
- Graceful fallbacks and retries
- Modern Python packaging with Poetry
Requirements
- Python 3.9 or higher
- Poetry (recommended for installation)
Installation
Using Poetry (recommended)
# Install using Poetry
poetry add google-news-api
# Or clone and install from source
git clone https://github.com/ma2za/google-news-api.git
cd google-news-api
poetry install
Using pip
pip install google-news-api
Usage Examples
Synchronous Client
from google_news_api import GoogleNewsClient
try:
# Get top news by topic
with GoogleNewsClient(
language="en",
country="US",
requests_per_minute=60,
cache_ttl=300
) as client:
world_news = client.top_news(topic="WORLD", max_results=5)
tech_news = client.top_news(topic="TECHNOLOGY", max_results=3)
# Search with date range
date_articles = client.search(
"Ukraine war",
after="2024-01-01",
before="2024-03-01",
max_results=5
)
# Search with relative time
recent_articles = client.search(
"climate change",
when="24h", # Last 24 hours
max_results=5
)
# Batch search multiple queries
batch_results = client.batch_search(
queries=["AI", "machine learning", "deep learning"],
when="7d", # Last 7 days
max_results=3
)
# Process results
for topic, articles in batch_results.items():
print(f"\nTop {topic} news:")
for article in articles:
print(f"- {article['title']} ({article['source']})")
print(f" Published: {article['published']}")
print(f" Summary: {article['summary'][:100]}...")
except Exception as e:
print(f"An error occurred: {e}")
Asynchronous Client
from google_news_api import AsyncGoogleNewsClient
import asyncio
async def main():
async with AsyncGoogleNewsClient(
language="en",
country="US",
requests_per_minute=60
) as client:
# Fetch multiple news categories concurrently
world_news = await client.top_news(topic="WORLD", max_results=3)
tech_news = await client.top_news(topic="TECHNOLOGY", max_results=3)
# Batch search with concurrent execution
batch_results = await client.batch_search(
queries=["AI", "machine learning", "deep learning"],
when="7d",
max_results=3
)
# Decode Google News URLs to original sources
for topic, articles in batch_results.items():
print(f"\nTop {topic} news:")
for article in articles:
original_url = await client.decode_url(article['link'])
print(f"- {article['title']} ({article['source']})")
print(f" Original URL: {original_url}")
if __name__ == "__main__":
asyncio.run(main())
Search Modes
The search(), batch_search(), and top_news() methods support three search
modes:
| Mode | Backend | Best For |
|---|---|---|
"default" |
Google News RSS | Fast baseline |
"searchapi_portal" |
SearchAPI Google News Portal | Direct publisher URLs |
"searchapi_light" |
SearchAPI Google News Light | Fresh results and snippets |
SearchAPI modes require an API key:
export SEARCHAPI_API_KEY="your-api-key"
Windows PowerShell:
$env:SEARCHAPI_API_KEY = "your-api-key"
Example:
from google_news_api import GoogleNewsClient
client = GoogleNewsClient(language="en", country="US")
default_articles = client.search(
"artificial intelligence regulation",
max_results=10,
)
portal_articles = client.search(
"artificial intelligence regulation",
max_results=10,
mode="searchapi_portal",
)
light_articles = client.search(
"artificial intelligence regulation",
max_results=10,
mode="searchapi_light",
)
In practice, use "default" first. Switch to "searchapi_portal" for direct
publisher URLs, or "searchapi_light" for snippet-heavy recent searches.
MCP Server
This repository also includes an MCP server for agent workflows. It exposes
Google News search and top-news tools, decodes Google News URLs, and can extract
article text from publisher pages.
python mcp_server/googlenews.py
See mcp_server/README.md for tool parameters and response
format.
Configuration
The library provides extensive configuration options through the client initialization:
| Parameter | Description | Default | Example Values |
|---|---|---|---|
language |
Two-letter language code (ISO 639-1) or language-country format | "en" |
"en", "fr", "de", "en-US", "fr-FR" |
country |
Two-letter country code (ISO 3166-1 alpha-2) | "US" |
"US", "GB", "DE", "JP" |
requests_per_minute |
Rate limit threshold for API requests | 60 |
30, 100, 120 |
cache_ttl |
Cache duration in seconds for responses | 300 |
600, 1800, 3600 |
Available Topics
The top_news() method supports the following topics:
"WORLD"- World news"NATION"- National news"BUSINESS"- Business news"TECHNOLOGY"- Technology news"ENTERTAINMENT"- Entertainment news"SPORTS"- Sports news"SCIENCE"- Science news"HEALTH"- Health news
Time-Based Search
The library supports two types of time-based search:
Date Range Search
- Use
afterandbeforeparameters - Format:
YYYY-MM-DD - Maximum 100 results
- Example:
after="2024-01-01", before="2024-03-01"
- Use
Relative Time Search
- Use the
whenparameter - Hours:
"1h"to"101h" - Days: Any number of days (e.g.,
"7d","30d") - Cannot be used with
after/before - Example:
when="24h"for last 24 hours
- Use the
Article Structure
Each article in the results contains the following fields:
title: Article titlelink: Google News article URLpublished: Publication date and timesummary: Article summary/descriptionsource: News source name
Error Handling
The library provides specific exceptions for different error scenarios:
from google_news_api.exceptions import (
ConfigurationError, # Invalid client configuration
ValidationError, # Invalid parameters
HTTPError, # Network or server issues
RateLimitError, # Rate limit exceeded
ParsingError # RSS feed parsing errors
)
try:
articles = client.search("technology")
except RateLimitError as e:
print(f"Rate limit exceeded. Retry after {e.retry_after} seconds")
except HTTPError as e:
print(f"HTTP error {e.status_code}: {str(e)}")
except ValidationError as e:
print(f"Invalid parameters: {str(e)}")
except Exception as e:
print(f"Unexpected error: {str(e)}")
Best Practices
Resource Management
- Use context managers (
async with) for async clients - Use context managers (
with) or callclose()for synchronous clients - Implement proper error handling and cleanup
Performance Optimization
- Utilize caching for frequently accessed queries
- Use the async client for concurrent operations
- Batch related requests to maximize cache efficiency
- Configure appropriate cache TTL based on your needs
Rate Limiting
- Set
requests_per_minutebased on your requirements - Implement exponential backoff for rate limit errors
- Monitor rate limit usage in production
Development
Setting up the Development Environment
# Clone the repository
git clone https://github.com/ma2za/google-news-api.git
cd google-news-api
# Install development dependencies
poetry install --with dev
# Set up pre-commit hooks
pre-commit install
Running Tests
# Run tests with Poetry
poetry run pytest
# Include live Google News integration tests
poetry run pytest --run-integration
# Run tests with coverage
poetry run pytest --cov=google_news_api
# Run pre-commit on all files
pre-commit run --all-files
The default test command skips tests that make live network calls. CI runs this
stable test set on supported Python versions.
Contributing
- Fork the repository
- Create a feature branch (
git checkout -b feature/amazing-feature) - Make your changes
- Run tests and linting (
poetry run pytestandpoetry run flake8) - Commit your changes (
git commit -m 'Add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
License
This project is licensed under the MIT License - see the LICENSE file for details.
Author
Paolo Mazza ([email protected])
Acknowledgments
- The URL decoding functionality is based on the work of SSujitX/google-news-url-decoder
Support
For issues, feature requests, or questions:
- Open an issue on GitHub
- Contact the author via email
- Check the examples directory for more usage scenarios
Reviews (0)
Sign in to leave a review.
Leave a reviewNo results found