📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
☆1,057Mar 21, 2026Updated this week
Alternatives and similar repositories for newspaper4k
Users that are interested in newspaper4k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides usage examples for the Python module Newspaper3k.☆152Jan 2, 2024Updated 2 years ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆15,010Dec 6, 2025Updated 3 months ago
- news-please - an integrated web crawler and information extractor for news that just works☆2,401Sep 21, 2025Updated 6 months ago
- A Happy and lightweight Python Package that Provides an API to search for articles on Google News and returns a JSON response.☆955Jan 16, 2026Updated 2 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆5,569Sep 12, 2025Updated 6 months ago
- Converts all website content into a text file for uploading to a custom GPT☆38Jan 18, 2025Updated last year
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,895Jan 26, 2026Updated last month
- Text Behind Video. Enjoy it is completely free.☆31Feb 15, 2025Updated last year
- A CLI tool that bundles source code files into a single context for LLM prompts☆21Jan 9, 2025Updated last year
- Brofile is a utility app which grants you with a better link handling abilities (works on my machine)☆45Jun 4, 2025Updated 9 months ago
- A fast and reliable Telegram channel scraper that fetches posts and exports them to JSON.☆272Apr 15, 2025Updated 11 months ago
- A very simple news crawler with a funny name☆443Mar 17, 2026Updated last week
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆904Feb 6, 2026Updated last month
- Python scraper based on AI☆23,032Mar 17, 2026Updated last week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,322Updated this week
- IntelliJ Plugin that offers an infinite canvas to organize code bookmarks☆18May 31, 2025Updated 9 months ago
- Turn any document into ready-to-use AI image prompts.☆54Sep 3, 2025Updated 6 months ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applic…☆60Feb 24, 2025Updated last year
- Article extraction benchmark: dataset and evaluation scripts☆358Mar 1, 2026Updated 3 weeks ago
- 🎭 Intelligent browser header & fingerprint generator☆1,016Feb 26, 2026Updated 3 weeks ago
- A browser-based tool for comparing and combining before/after images. No server needed, runs entirely in your browser.☆17Jan 13, 2025Updated last year
- Automatically extract documents from images and perspectively correct them with classic computer-vision algorithms. In maintenance mode. …☆86Aug 24, 2025Updated 7 months ago
- An automated discovery engine that monitors multiple platforms to capture high-value, time-sensitive opportunities in the digital gaming …☆26Apr 9, 2025Updated 11 months ago
- Automates Telegram message digests using Claude AI for summaries and Replicate API for image generation, sending results to saved message…☆55Mar 10, 2025Updated last year
- Web app for reading and analyzing exported WhatsApp chat files with a clean, intuitive interface and powerful search and analytics☆36Dec 17, 2024Updated last year
- structured outputs for llms☆12,589Updated this week
- Repurpose your YouTube videos by converting them into blog posts.☆175May 1, 2024Updated last year
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆39,597Updated this week
- A fork of Dragnet that also extract author, headline, date, keywords from context, as well as built in metadata extraction all in one pac…☆300May 19, 2025Updated 10 months ago
- dynamic YAML-driven URL shortener and command mapper with real-time config updates☆20Aug 28, 2025Updated 6 months ago
- partdec is a command-line utility for multipart downloading and file splitting. Download a file in parts simultaneously.☆56Sep 26, 2025Updated 5 months ago
- Script for GoogleNews☆376Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆62,480Updated this week
- A standalone version of the readability lib☆11,036Jan 21, 2026Updated 2 months ago
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆10,284May 8, 2025Updated 10 months ago
- VirtualBox Web Control Panel is a lightweight HTTP server script providing a simple web interface to list, control, and interact with Vir…☆25Apr 15, 2025Updated 11 months ago
- Official code for the paper "GestSync: Determining who is speaking without a talking head" published at BMVC 2023☆47Sep 1, 2024Updated last year
- Swap your face in real-time☆75Mar 11, 2026Updated last week
- DSPy: The framework for programming—not prompting—language models☆33,038Updated this week