π° Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
β1,082Apr 11, 2026Updated this week
Alternatives and similar repositories for newspaper4k
Users that are interested in newspaper4k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides usage examples for the Python module Newspaper3k.β152Jan 2, 2024Updated 2 years ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:β15,024Mar 23, 2026Updated 3 weeks ago
- news-please - an integrated web crawler and information extractor for news that just worksβ2,408Sep 21, 2025Updated 6 months ago
- A Happy and lightweight Python Package that Provides an API to search for articles on Google News and returns a JSON response.β962Jan 16, 2026Updated 2 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XMβ¦β5,703Sep 12, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Converts all website content into a text file for uploading to a custom GPTβ38Jan 18, 2025Updated last year
- fast python port of arc90's readability tool, updated to match latest readability.js!β2,895Jan 26, 2026Updated 2 months ago
- Text Behind Video. Enjoy it is completely free.β30Feb 15, 2025Updated last year
- A CLI tool that bundles source code files into a single context for LLM promptsβ21Jan 9, 2025Updated last year
- Simulate human behavior with mass LLMsβ28Oct 23, 2024Updated last year
- Brofile is a utility app which grants you with a better link handling abilities (works on my machine)β45Jun 4, 2025Updated 10 months ago
- A fast and reliable Telegram channel scraper that fetches posts and exports them to JSON.β272Apr 15, 2025Updated 11 months ago
- A very simple news crawler with a funny nameβ446Updated this week
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.htmlβ905Apr 1, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python scraper based on AIβ23,249Apr 7, 2026Updated last week
- Developed a machine learning model to detect media bias in news articles. Employed natural language processing techniques to analyze textβ¦β10Sep 6, 2025Updated 7 months ago
- π‘ All-in-one AI framework for semantic search, LLM orchestration and language model workflowsβ12,395Updated this week
- IntelliJ Plugin that offers an infinite canvas to organize code bookmarksβ18May 31, 2025Updated 10 months ago
- Turn any document into ready-to-use AI image prompts.β54Sep 3, 2025Updated 7 months ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applicβ¦β60Feb 24, 2025Updated last year
- Article extraction benchmark: dataset and evaluation scriptsβ362Updated this week
- A browser-based tool for comparing and combining before/after images. No server needed, runs entirely in your browser.β17Jan 13, 2025Updated last year
- π Intelligent browser header & fingerprint generatorβ1,048Feb 26, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Automatically extract documents from images and perspectively correct them with classic computer-vision algorithms. In maintenance mode. β¦β87Aug 24, 2025Updated 7 months ago
- An automated discovery engine that monitors multiple platforms to capture high-value, time-sensitive opportunities in the digital gaming β¦β25Apr 9, 2025Updated last year
- Automates Telegram message digests using Claude AI for summaries and Replicate API for image generation, sending results to saved messageβ¦β55Mar 10, 2025Updated last year
- Web app for reading and analyzing exported WhatsApp chat files with a clean, intuitive interface and powerful search and analyticsβ37Dec 17, 2024Updated last year
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aβ¦β42,652Updated this week
- structured outputs for llmsβ12,749Updated this week
- Repurpose your YouTube videos by converting them into blog posts.β175May 1, 2024Updated last year
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pacβ¦β297May 19, 2025Updated 10 months ago
- dynamic YAML-driven URL shortener and command mapper with real-time config updatesβ20Aug 28, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- partdec is a command-line utility for multipart downloading and file splitting. Download a file in parts simultaneously.β56Sep 26, 2025Updated 6 months ago
- Script for GoogleNewsβ378Mar 20, 2026Updated 3 weeks ago
- A standalone version of the readability libβ11,094Jan 21, 2026Updated 2 months ago
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ63,955Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/β10,518May 8, 2025Updated 11 months ago
- VirtualBox Web Control Panel is a lightweight HTTP server script providing a simple web interface to list, control, and interact with Virβ¦β25Apr 15, 2025Updated 11 months ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023β47Sep 1, 2024Updated last year