📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
☆1,094Apr 30, 2026Updated this week
Alternatives and similar repositories for newspaper4k
Users that are interested in newspaper4k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides usage examples for the Python module Newspaper3k.☆152Jan 2, 2024Updated 2 years ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆15,036Apr 16, 2026Updated 2 weeks ago
- news-please - an integrated web crawler and information extractor for news that just works☆2,443Apr 14, 2026Updated 2 weeks ago
- A Happy and lightweight Python Package that Provides an API to search for articles on Google News and returns a JSON response.☆966Jan 16, 2026Updated 3 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆5,866Sep 12, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Converts all website content into a text file for uploading to a custom GPT☆38Jan 18, 2025Updated last year
- A CLI tool that bundles source code files into a single context for LLM prompts☆21Jan 9, 2025Updated last year
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,895Jan 26, 2026Updated 3 months ago
- Text Behind Video. Enjoy it is completely free.☆31Feb 15, 2025Updated last year
- Brofile is a utility app which grants you with a better link handling abilities (works on my machine)☆46Jun 4, 2025Updated 11 months ago
- A fast and reliable Telegram channel scraper that fetches posts and exports them to JSON.☆275Apr 15, 2025Updated last year
- A very simple news crawler with a funny name☆452Apr 27, 2026Updated last week
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆909Apr 1, 2026Updated last month
- Python scraper based on AI☆23,405Apr 26, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,453Updated this week
- IntelliJ Plugin that offers an infinite canvas to organize code bookmarks☆18May 31, 2025Updated 11 months ago
- Turn any document into ready-to-use AI image prompts.☆53Sep 3, 2025Updated 8 months ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applic…☆60Feb 24, 2025Updated last year
- A browser-based tool for comparing and combining before/after images. No server needed, runs entirely in your browser.☆17Jan 13, 2025Updated last year
- Article extraction benchmark: dataset and evaluation scripts☆365Apr 23, 2026Updated last week
- Automatically extract documents from images and perspectively correct them with classic computer-vision algorithms. In maintenance mode. …☆87Aug 24, 2025Updated 8 months ago
- gaming market monitor. discover time-sensitive opportunities across multiple platforms.☆25Apr 9, 2025Updated last year
- 🎭 Intelligent browser header & fingerprint generator☆1,072Feb 26, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Automates Telegram message digests using Claude AI for summaries and Replicate API for image generation, sending results to saved message…☆55Mar 10, 2025Updated last year
- Web app for reading and analyzing exported WhatsApp chat files with a clean, intuitive interface and powerful search and analytics☆37Dec 17, 2024Updated last year
- structured outputs for llms☆12,889Apr 22, 2026Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆45,153Updated this week
- Repurpose your YouTube videos by converting them into blog posts.☆174May 1, 2024Updated 2 years ago
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆297May 19, 2025Updated 11 months ago
- partdec is a command-line utility for multipart downloading and file splitting. Download a file in parts simultaneously.☆56Sep 26, 2025Updated 7 months ago
- dynamic YAML-driven URL shortener and command mapper with real-time config updates☆20Aug 28, 2025Updated 8 months ago
- Script for GoogleNews☆380Mar 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A standalone version of the readability lib☆11,150Jan 21, 2026Updated 3 months ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆64,964Updated this week
- VirtualBox Web Control Panel is a lightweight HTTP server script providing a simple web interface to list, control, and interact with Vir…☆25Apr 15, 2025Updated last year
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆10,709Apr 16, 2026Updated 2 weeks ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆48Sep 1, 2024Updated last year
- DSPy: The framework for programming—not prompting—language models☆34,180Updated this week
- A Python library for calculating a large variety of metrics from text☆363Mar 20, 2026Updated last month