A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.
☆20Jul 5, 2024Updated last year
Alternatives and similar repositories for newscorpus
Users that are interested in newscorpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- R Shiny App created to predict the success rate of Freedom of Information Act requests.☆16Dec 11, 2017Updated 8 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 11 months ago
- Extract networks of entities from journalistic reporting☆49Jul 17, 2023Updated 2 years ago
- ⚙️ Das Backend zu OffeneGesetze.de☆25Jan 11, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- VIVO Dashboard - a semantic application for visualizing publication data☆21Apr 5, 2019Updated 7 years ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆31Nov 18, 2025Updated 7 months ago
- Alternative robots parser module for Python☆22Apr 8, 2026Updated 2 months ago
- ☆20May 20, 2021Updated 5 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆27Jul 15, 2025Updated 11 months ago
- DEPRECATED eXist code for Syriaca.org: The Syriac Reference Portal☆10Jun 1, 2024Updated 2 years ago
- Translation of query languages to serialized KoralQuery protocol☆15Jun 4, 2026Updated 2 weeks ago
- A Linked Data Platform (LDP) Server in Python☆13Apr 24, 2015Updated 11 years ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆22May 11, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ChatGPT with access to the internet☆25Jun 16, 2023Updated 3 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Android App for the Laravel news website (Unofficial)☆18Jun 27, 2018Updated 7 years ago
- A Geographical Information System, workbench and repository to retrieve, collect, create, enrich and preserve historical temporalized spa…☆18Dec 4, 2024Updated last year
- Multi-Langauge Identification☆28Jul 25, 2024Updated last year
- For retrieving data from the ORCID API and crosswalking to VIVO-ISF.☆11Dec 11, 2020Updated 5 years ago
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated 2 months ago
- Name Authority App written for Django☆13Feb 11, 2026Updated 4 months ago
- A light weight conda interface library☆15Sep 6, 2015Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- rightsstatements.org data model☆13Apr 21, 2026Updated last month
- Tabular Data RDF Reader and JSON serializer☆21Oct 9, 2024Updated last year
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- Easy teamspeak automation☆10Apr 19, 2023Updated 3 years ago
- Host your Linked Data for free, as static pages, using a variety of providers (GitHub Pages, Google Code, Google Drive, etc.), and run SP…☆18Oct 10, 2014Updated 11 years ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- Monorepo containing all addwiki libraries, packages and applications☆17Feb 17, 2026Updated 4 months ago
- Named entity recognition for the legal domain☆43Jun 1, 2021Updated 5 years ago
- ☆16Feb 23, 2015Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Create and populate a MySQL Database from Web of Science raw xml data☆17Dec 12, 2016Updated 9 years ago
- A repository of sample code designed to help you Tweet random dog facts☆15Sep 23, 2022Updated 3 years ago
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated 3 months ago
- Matomo plugin for Docusaurus v2/v3☆14Dec 3, 2023Updated 2 years ago
- Async first supervisord HTTP API Client for PHP 7☆16Dec 15, 2023Updated 2 years ago
- Tool to bulk follow accounts related Open Science on Mastodon. Runs at https://germanrepro.github.io/Mastodon-OpenScience/ Based on the D…☆16Mar 26, 2026Updated 2 months ago
- Internet Research Agency Facebook ads as structured data☆22Dec 10, 2019Updated 6 years ago