๐ Universal News Scraper v4.1
A powerful, terminal-based news aggregator that supports RSS feeds, Web Scraping, and Topic Auto-Discovery via Bing News RSS.

โจ Key Features
| Feature | Description |
|---|---|
| ๐ต๏ธ Auto-Discovery | Find news on ANY topic (Crypto, Sports, Politics, AI) without knowing the URL |
| ๐ Preset Categories | 6 built-in categories with 30+ international news sources |
| ๐ก๏ธ Anti-Blocking | Random User-Agent rotation to bypass restrictions |
| ๐พ Multi-Format Export | Save results as CSV, JSON, HTML, or ALL formats |
| ๐จ HTML Reports | Beautiful dark-themed HTML reports with article cards |
| ๐ Noise Filter | Automatically filters out generic Bing category entries |
| ๐ Real URL Extraction | Extracts actual article URLs from Bing redirects |
| ๐ Date Filtering | Only get articles from a specific date onwards |
| ๐ Keyword Filtering | Filter articles by multiple keywords |
| ๐ Settings Memory | Remembers your last configuration for quick re-runs |
๐ What's New in v4.1
- ๐ HTML Export - Beautiful dark-themed HTML reports
- ๐ Enhanced Noise Filter - Filters generic Bing entries (Top stories, Entertainment, etc.)
- ๐ Real URL Extraction - Extracts actual article URLs from Bing redirects
- ๐ฐ Real Source Detection - Shows the actual news source instead of "Bing"
- ๐ค 4 Export Options - CSV, JSON, HTML, or All formats
๐ Preset Categories
| Category | Sources |
|---|---|
| ๐ฐ International News | BBC, CNN, Reuters, Al Jazeera, The Guardian, NPR |
| โฝ Sports | ESPN, BBC Sport, Sky Sports, Bleacher Report |
| ๐ป Tech & Science | TechCrunch, The Verge, Wired, Ars Technica, Space.com |
| ๐ Cybersecurity | The Hacker News, BleepingComputer, Krebs, Dark Reading |
| ๐ฐ Business & Finance | Bloomberg, CNBC, Financial Times, CoinDesk, CoinTelegraph |
| ๐ฌ Entertainment | Variety, Hollywood Reporter, IGN, Kotaku |
๐ Quick Start
1. Clone the Repository
git clone https://github.com/Ilias1988/Universal-News-Scraper.git
cd Universal-News-Scraper
2. Create Virtual Environment (Recommended)
python -m venv venv
# Windows
venv\Scripts\activate
# Linux/Mac
source venv/bin/activate
3. Install Dependencies
pip install -r requirements.txt
4. Run the Scraper
python scraper.py
๐ Usage Guide
Main Menu
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ ๐ UNIVERSAL NEWS SCRAPER v4.1 โ
โ Powered by Python & Bing RSS โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
โโโโโโโโโโโโโโโโโ Main Menu โโโโโโโโโโโโโโโโโโ
โ [1] ๐ Use previous settings โ
โ [2] ๐ Enter new settings manually โ
โ [3] ๐ต๏ธ Auto-Discover & Scrape by Topic โ โ Recommended!
โ [4] ๐ Choose from preset sources โ
โ [5] โ Exit โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
Export Format Options
๐ค Export Format:
[1] CSV only
[2] JSON only
[3] HTML only โ Beautiful dark-themed report!
[4] All formats โ CSV + JSON + HTML
๐ค Output Formats
CSV Output (results.csv)
title,url,date,description,source,matched_keywords
"AI Revolution in 2026...",https://techcrunch.com/...,2026-01-20,"Description...",Techcrunch,"AI, technology"
JSON Output (results.json)
[
{
"title": "AI Revolution in 2026...",
"url": "https://techcrunch.com/...",
"date": "2026-01-20",
"description": "Description...",
"source": "Techcrunch",
"matched_keywords": "AI, technology"
}
]
HTML Output (results.html)
Beautiful dark-themed report with:
- ๐ Stats header showing article count
- ๐ฐ Article cards with hover effects
- ๐ท๏ธ Keyword badges
- ๐ Clickable links to original articles
- ๐ฑ Responsive design
๐ ๏ธ Requirements
requests>=2.31.0
beautifulsoup4>=4.12.0
feedparser>=6.0.0
fake-useragent>=1.4.0
htmldate>=1.6.0
rich>=13.7.0
lxml>=4.9.0
๐ Project Structure
Universal-News-Scraper/
โโโ scraper.py # Main application
โโโ sources.json # Preset RSS sources (editable)
โโโ requirements.txt # Python dependencies
โโโ .scraper_config.json # Auto-saved settings (ignored by git)
โโโ .gitignore # Git ignore file
โโโ LICENSE # MIT License
โโโ README.md # This file
๐ Examples
Example 1: Find Bitcoin News
Select option: 3
Enter topic: Bitcoin
Keywords: (empty for all)
Export format: 4 (All)
โ Saves bitcoin_news.csv, bitcoin_news.json, bitcoin_news.html
Example 2: Scrape Cybersecurity Sources
Select option: 4
Select category: 4 (Cybersecurity)
Select sources: A (ALL)
Keywords: ransomware
Export format: 3 (HTML)
โ Generates beautiful HTML report
โ ๏ธ Disclaimer
This tool is intended for educational and research purposes only.
- Always respect websites' Terms of Service
- Don't overwhelm servers with excessive requests
- Use responsibly for legitimate research and news aggregation
๐ License
MIT License - Feel free to use and modify!
๐ Changelog
v4.1 (Current)
- ๐ Added HTML Export with dark theme
- ๐ Enhanced Noise Filter for Bing RSS
- ๐ Real URL Extraction from Bing redirects
- ๐ฐ Real Source Detection (shows actual source, not "Bing")
- ๐ค 4 export options (CSV, JSON, HTML, All)
v4.0
- ๐จ Complete UI rebrand - "Universal News Scraper"
- ๐ Switched from Google Search to Bing News RSS
- ๐ 6 international preset categories with 30+ sources
v3.0
- Added Topic Discovery via Google Search
- Cybersecurity-focused preset sources
Happy Scraping! ๐๐ฐ