A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.
☆20Jul 5, 2024Updated last year
Alternatives and similar repositories for newscorpus
Users that are interested in newscorpus are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 10 months ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- ⚙️ Das Backend zu OffeneGesetze.de☆25Jan 11, 2024Updated 2 years ago
- VIVO Dashboard - a semantic application for visualizing publication data☆21Apr 5, 2019Updated 7 years ago
- Docker Image packaging for Pentaho BI Server☆10Jul 6, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Next-generation Punkt sentence boundary detection with zero dependencies☆30Nov 18, 2025Updated 6 months ago
- List of JavaScript modules for Berlin & Brandenburg public transport.☆70Oct 11, 2024Updated last year
- ☆20May 20, 2021Updated 5 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆26Jul 15, 2025Updated 10 months ago
- DEPRECATED eXist code for Syriaca.org: The Syriac Reference Portal☆10Jun 1, 2024Updated last year
- Translation of query languages to serialized KoralQuery protocol☆15May 14, 2026Updated 2 weeks ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆22May 11, 2026Updated 2 weeks ago
- A Flask-Based Web-App for Exploring Unicode☆11Jan 31, 2024Updated 2 years ago
- ChatGPT with access to the internet☆25Jun 16, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- Android App for the Laravel news website (Unofficial)☆18Jun 27, 2018Updated 7 years ago
- code and data used to build a training dataset for dragnet models☆10Nov 29, 2020Updated 5 years ago
- A Directory of Online Newspaper Sources for 70+ Languages☆31Apr 15, 2021Updated 5 years ago
- A Geographical Information System, workbench and repository to retrieve, collect, create, enrich and preserve historical temporalized spa…☆18Dec 4, 2024Updated last year
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 7 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Name Authority App written for Django☆13Feb 11, 2026Updated 3 months ago
- A light weight conda interface library☆15Sep 6, 2015Updated 10 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- advent of code 2019 repo☆17Dec 16, 2019Updated 6 years ago
- Intro to spatial analysis in R☆26Apr 11, 2023Updated 3 years ago
- A Map for Refugees☆11Jan 28, 2016Updated 10 years ago
- Fast, permanent and flexible patterns for sharing and computing on texts with metadata using Apache Arrow.☆15Mar 1, 2022Updated 4 years ago
- Tabular Data RDF Reader and JSON serializer☆20Oct 9, 2024Updated last year
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- Basic linked data fragments endpoint.☆15Apr 20, 2017Updated 9 years ago
- Easy teamspeak automation☆10Apr 19, 2023Updated 3 years ago
- Host your Linked Data for free, as static pages, using a variety of providers (GitHub Pages, Google Code, Google Drive, etc.), and run SP…☆18Oct 10, 2014Updated 11 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆28Nov 30, 2020Updated 5 years ago
- Named entity recognition for the legal domain☆43Jun 1, 2021Updated 4 years ago
- ☆16Feb 23, 2015Updated 11 years ago
- a fun project to download a full list of all the public repos for a github user☆14Aug 28, 2022Updated 3 years ago
- A Python library that provides an ergonomic, DOM-like model for XML encoded text documents.☆18May 1, 2026Updated 3 weeks ago
- A Laravel package to ReCompose your installed packages, their dependencies, your app & server environment☆12Oct 18, 2024Updated last year
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆58May 20, 2026Updated last week