LinuxReport - Multi-Platform News Aggregation

Simple, fast, and intelligent news aggregation platform built with Python/Flask. Designed as a modern drudgereport.com clone that automatically aggregates and curates news from multiple categories, updated 24/7 with AI-powered headline generation.

This project is free and open source software released under the GNU Lesser General Public License v3.0 (LGPL v3).

DeepWiki provides excellent analysis of the codebase, including visual dependency graphs.

🌐 Live Sites

Category	URL	Focus
Linux	linuxreport.net	Linux news, open source, tech
COVID	covidreport.org	Health, pandemic updates
AI	aireport.keithcu.com	Artificial intelligence, ML
Solar/PV	pvreport.org	Solar energy, renewable tech
Techno	news.thedetroitilove.com	Detroit techno music
Space	news.spaceelevatorwiki.com	Space exploration

✨ Key Features

🚀 High performance with thread pools and efficient caching
🤖 AI-powered headlines via OpenRouter.ai using a curated set of reliable models
🎯 Multi-site support: multiple news categories from one shared codebase
🌙 Dark mode, font controls, and mobile-friendly layout
⚡ Multi-layer caching and optional CDN for fast responses
🔒 Security best practices: rate limiting, admin auth, config-based secrets
🛠️ Easy configuration of feeds and report types

🧠 AI-Powered Headlines

LinuxReport uses LLMs via OpenRouter.ai to generate and refine headlines.

Uses multiple high-quality models; failures fall back to a reliable default.
Logic is implemented in auto_update.py (model selection and retries).

🚀 Quick Start

# Clone the repository
git clone https://github.com/KeithCu/LinuxReport
cd LinuxReport

# Option 1: Modern approach with uv (recommended - 10-100x faster)
curl -LsSf https://astral.sh/uv/install.sh | sh
uv sync

# Option 2: Traditional approach with pip
pip install -r requirements.txt

# For CPU-only PyTorch and ML dependencies (optional, for auto-update features)
# Run this script to install CPU versions of PyTorch/sentence-transformers to save space:
./install_cpu_ml_deps.sh

# Configure (see Configuration section below)
cp config.yaml.example config.yaml
# Edit config.yaml with your settings

# Run development server
uv run python -m flask run
# Or with pip: python -m flask run

🏗️ Architecture Overview

High-level design:

Backend:
- Python 3.x + Flask.
- Background workers for scraping and updating feeds.
Storage and caching:
- SQLite via Diskcache for persistent, high-performance caching.
- In-memory cache for hot data.
- File-based HTML snippets for AI-generated headline sections.
Frontend:
- Jinja2 templates with modular JS/CSS.
- Bundled/minified assets for production.
Scraping:
- feedparser + BeautifulSoup4 for most sites.
- Optional Selenium + Tor for complex/JS-heavy or privacy-sensitive sources.
Images:
- Automatic optimization and WebP support.

📋 Configuration

Copy and edit config.yaml:
- Set a strong admin password and secret_key.
- Configure allowed_domains and any deployment-specific settings.
Configure report types:
- Edit *_report_settings.py to define feeds, titles, and behavior for each site.
For production:
- Use httpd-vhosts-sample.conf or equivalent web server configuration as a starting point.

🔧 Development

Project Structure (essential only)

app.py: Flask application setup and configuration.
routes.py: Main routing and request handling.
shared.py: Shared utilities, feature flags, and caches.
workers.py: Background feed processing.
auto_update.py: AI headline generation and scheduling.
*_report_settings.py: Report-specific configuration.
templates/: Jinja2 templates and modular JS/CSS (edit here).
static/: Bundled assets and images (do not hand-edit generated bundles).
tests/: pytest suite.
config.yaml: Runtime configuration.

Developer Notes

JS/CSS:
- Edit source files in templates/; they are bundled into static/linuxreport.js and static/linuxreport.css.
Caching:
- Multi-layer caching is central to performance.
- See Caching.md or agents.md for deeper technical details.
Tests:
- Use pytest to validate changes.

📖 Documentation

agents.md: Technical guide for AI agents and contributors.
Caching.md: Detailed caching and performance internals.
ROADMAP.md: Planned features and improvements.
Scaling.md: Scaling and performance notes.

🔒 Security

Admin Mode Protection

Admin functionality is protected by authentication:

# config.yaml
admin:
  password: "CHANGE_THIS_DEFAULT_PASSWORD"

⚠️ IMPORTANT: Change the default password immediately after installation!

Security Features

Rate Limiting: Configurable per-endpoint throttling
Input Validation: Secure file uploads and form processing
CORS Protection: Configurable domain allowlists
Security Headers: XSS protection, content type validation
IP Blocking: Persistent banned IP storage

🚀 Production Deployment (quick overview)

Use a WSGI-capable web server (e.g., Apache with mod_wsgi, or gunicorn/uwsgi + nginx).
Use httpd-vhosts-sample.conf as a reference if deploying with Apache.
Run background tasks (e.g., headline updates) via systemd timers or cron:
- Example units/scripts are provided; adjust paths and commands for your environment.

🤝 Contributing

We welcome contributions! Please:

Fork the repository
Create a feature branch
Run tests: pytest tests/
Submit a pull request

Feel free to request new RSS feeds or suggest improvements.

📈 Performance (summary)

LinuxReport is designed to be fast in real-world deployments:

Multi-layer caching minimizes database reads and external calls.
Concurrent processing handles many feeds efficiently.
Works well with multi-process setups; each process uses its own in-memory cache on top of shared persistent cache.

📄 License

This project is free and open source software released under the GNU Lesser General Public License v3.0 (LGPL v3). See the LICENSE file for complete details.

CDN and Static Asset Delivery

Optional CDN/object storage integration via s3cmd.
Long cache headers for static assets.
Configuration driven from config.yaml.

Built with ❤️ for the free and open source community

Name		Name	Last commit message	Last commit date
Latest commit History 1,646 Commits
.vscode		.vscode
static		static
templates		templates
tests		tests
.gitignore		.gitignore
Caching.md		Caching.md
FeedHistory.py		FeedHistory.py
LICENSE		LICENSE
LLMModelManager.py		LLMModelManager.py
Logging.py		Logging.py
ObjectStorageLock.py		ObjectStorageLock.py
PLAYWRIGHT_MIGRATION.md		PLAYWRIGHT_MIGRATION.md
PWA.md		PWA.md
README.md		README.md
README_object_storage_sync.md		README_object_storage_sync.md
ROADMAP.md		ROADMAP.md
Reddit.py		Reddit.py
Scaling.md		Scaling.md
Scraping.md		Scraping.md
SqliteLock.py		SqliteLock.py
Tor.py		Tor.py
WEATHER_IMPROVEMENTS.md		WEATHER_IMPROVEMENTS.md
__init__.py		__init__.py
admin_stats.py		admin_stats.py
advanced_minimize_requirements.py		advanced_minimize_requirements.py
agents.md		agents.md
ai_report_settings.py		ai_report_settings.py
analyze_feed_activity.py		analyze_feed_activity.py
app.py		app.py
app_config.py		app_config.py
auto_update.py		auto_update.py
auto_update_visualization.py		auto_update_visualization.py
browser_fetch.py		browser_fetch.py
caching.py		caching.py
chat.py		chat.py
config.py		config.py
config.yaml		config.yaml
convert_png_to_webp.py		convert_png_to_webp.py
covid_report_settings.py		covid_report_settings.py
deploy.py		deploy.py
deploy.sh		deploy.sh
embeddings_dedup.py		embeddings_dedup.py
feedfilter.py		feedfilter.py
forms.py		forms.py
function_dependency_graph.svg		function_dependency_graph.svg
generate_dependency_graph.py		generate_dependency_graph.py
generate_docs.py		generate_docs.py
generate_themed_logo.py		generate_themed_logo.py
headers.md		headers.md
html_generation.py		html_generation.py
httpd-vhosts-sample.conf		httpd-vhosts-sample.conf
image_fetch.py		image_fetch.py
install_cpu_ml_deps.sh		install_cpu_ml_deps.sh
linux_report_settings.py		linux_report_settings.py
linuxreportabove.html		linuxreportabove.html
log_engine.py		log_engine.py
models.py		models.py
object_storage_config.py		object_storage_config.py
object_storage_sync.py		object_storage_sync.py
old_headlines.py		old_headlines.py
openvino_server.py		openvino_server.py
performance_analytics.py		performance_analytics.py
performance_comparison.py		performance_comparison.py
playwrightfetch.py		playwrightfetch.py
process_logo.py		process_logo.py
pulls.sh		pulls.sh
pv_report_settings.py		pv_report_settings.py
pyproject.toml		pyproject.toml
request_utils.py		request_utils.py
requirements.txt		requirements.txt
robot_report_settings.py		robot_report_settings.py
routes.py		routes.py
seleniumfetch.py		seleniumfetch.py
setup_gunicorn_multi_app.py		setup_gunicorn_multi_app.py
shared.py		shared.py
simple_proxy.py		simple_proxy.py
site_debugger.py		site_debugger.py
space_report_settings.py		space_report_settings.py
sync_static.py		sync_static.py
techno_report_settings.py		techno_report_settings.py
technoreportabove.html		technoreportabove.html
test_random_ua.py		test_random_ua.py
test_site_debug.py		test_site_debug.py
trump_report_settings.py		trump_report_settings.py
trumpreportabove.html		trumpreportabove.html
update-headlines.service		update-headlines.service
update-headlines.timer		update-headlines.timer
update_all_logos.sh		update_all_logos.sh
update_free_models.py		update_free_models.py
update_headlines.sh		update_headlines.sh
visitor_map.py		visitor_map.py
weather.py		weather.py
weather_widget_debugging_summary.md		weather_widget_debugging_summary.md
workers.py		workers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LinuxReport - Multi-Platform News Aggregation

🌐 Live Sites

✨ Key Features

🧠 AI-Powered Headlines

🚀 Quick Start

🏗️ Architecture Overview

📋 Configuration

🔧 Development

Project Structure (essential only)

Developer Notes

📖 Documentation

🔒 Security

Admin Mode Protection

Security Features

🚀 Production Deployment (quick overview)

🤝 Contributing

📈 Performance (summary)

📄 License

CDN and Static Asset Delivery

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LinuxReport - Multi-Platform News Aggregation

🌐 Live Sites

✨ Key Features

🧠 AI-Powered Headlines

🚀 Quick Start

🏗️ Architecture Overview

📋 Configuration

🔧 Development

Project Structure (essential only)

Developer Notes

📖 Documentation

🔒 Security

Admin Mode Protection

Security Features

🚀 Production Deployment (quick overview)

🤝 Contributing

📈 Performance (summary)

📄 License

CDN and Static Asset Delivery

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages