Python wrapper for the CWB to extract concordances and score frequency lists
☆22Jan 12, 2026Updated 3 months ago
Alternatives and similar repositories for cwb-ccc
Users that are interested in cwb-ccc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Mar 30, 2026Updated 2 weeks ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- Easy language identification of 380 languages☆17Dec 2, 2019Updated 6 years ago
- Course on Language Technologies and NLP☆15May 15, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Editor for aligned parallel texts (personal desktop application).☆20Jan 15, 2026Updated 3 months ago
- Coquery is a free corpus query tool for linguists, lexicographers, translators, and anybody who wishes to search and analyse a text corpu…☆19Jan 7, 2026Updated 3 months ago
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year
- PaliNLP reworked. Version 2.☆17Jun 29, 2015Updated 10 years ago
- Corpus Annotation Graph builder (CAG) is an architectural framework that employs the build-and-annotate pattern for creating a graph.☆14Dec 7, 2023Updated 2 years ago
- A Python database interface for eXist-db☆15Mar 1, 2026Updated last month
- A lightweight web-based annotation tool for labelling entity recognition data.☆23Aug 19, 2024Updated last year
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆13Dec 7, 2023Updated 2 years ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆15Dec 6, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- code and data used to build a training dataset for dragnet models☆10Nov 29, 2020Updated 5 years ago
- Preview the current helm file selection.☆11Nov 21, 2025Updated 4 months ago
- Multi-Langauge Identification☆28Jul 25, 2024Updated last year
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 6 months ago
- The Mindee fork of Origami, a pure Ruby library to parse, modify and generate PDF documents.☆13May 20, 2025Updated 10 months ago
- Benson turns a list of URLs into mp3s of the contents of each web page - take control over your reading backlog!☆16Oct 30, 2024Updated last year
- Ready-to-use examples of dkpro-core components and pipelines.☆35Dec 16, 2023Updated 2 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- R-package for text mining with the Corpus Workbench (CWB) as backend☆49Mar 26, 2025Updated last year
- Pretraining summarization models using a corpus of nonsense☆13Sep 28, 2021Updated 4 years ago
- hydra-pl-wandb-sample-project is a NN experiment management code using hydra, pytorch-lightinig, and wandb.☆11Nov 22, 2021Updated 4 years ago
- 🛠 Live JavaScript RegExp tester☆12Mar 3, 2025Updated last year
- Unofficial Grammarly extension☆20Jan 10, 2024Updated 2 years ago
- ☆10Jul 23, 2021Updated 4 years ago
- A repository of sample code designed to help you Tweet random dog facts☆15Sep 23, 2022Updated 3 years ago
- Evaluate language models using multiple choice items☆13Mar 6, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tool to bulk follow accounts related Open Science on Mastodon. Runs at https://germanrepro.github.io/Mastodon-OpenScience/ Based on the D…☆16Mar 26, 2026Updated 3 weeks ago
- Learning to Hash for Maximum Inner Product Search☆12Jan 21, 2022Updated 4 years ago
- Command-line (CLI) coffee journal designed for coffee enthusiasts. (https://codeberg.org/mrus/kopi)☆14Dec 15, 2025Updated 4 months ago
- Random Bingo Sheet for DB delays☆16Oct 3, 2024Updated last year
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆24Oct 27, 2023Updated 2 years ago
- Open-source models of NH and HI auditory processing☆11Jun 20, 2024Updated last year
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆32Jan 22, 2024Updated 2 years ago