Python wrapper for the CWB to extract concordances and score frequency lists
☆22Jan 12, 2026Updated 3 months ago
Alternatives and similar repositories for cwb-ccc
Users that are interested in cwb-ccc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A containerized all-in-one solution for CQPWeb☆18Jan 22, 2023Updated 3 years ago
- Script that sets up and configures an entire CQPweb server installation☆11Dec 1, 2019Updated 6 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Updated this week
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Easy language identification of 380 languages☆17Dec 2, 2019Updated 6 years ago
- Docker image having a compiled nginx with rtmp module☆11Jun 27, 2020Updated 5 years ago
- Collaborative on-line editor for aligned parallel texts.☆13Nov 24, 2025Updated 5 months ago
- Coquery is a free corpus query tool for linguists, lexicographers, translators, and anybody who wishes to search and analyse a text corpu…☆19Jan 7, 2026Updated 4 months ago
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year
- Dockerization of brat application☆13Jun 13, 2018Updated 7 years ago
- PaliNLP reworked. Version 2.☆17Jun 29, 2015Updated 10 years ago
- Official NetworkX mirror☆22Jul 26, 2011Updated 14 years ago
- ☆17Jan 5, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- NLP-helper for OCR-ed pages in PAGE XML format☆10Dec 6, 2024Updated last year
- A Python database interface for eXist-db☆15May 2, 2026Updated last week
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 6 months ago
- ☆30May 30, 2023Updated 2 years ago
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆13Dec 7, 2023Updated 2 years ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆15Dec 6, 2025Updated 5 months ago
- HuCit KB: a knowledge base of classical texts and citable text units.☆11Nov 17, 2021Updated 4 years ago
- Digital Research Toolkit for Linguists course materials☆12Jul 23, 2025Updated 9 months ago
- Repo of the Turing's Humanities & Data Science Discussion Group☆13Jul 21, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- code and data used to build a training dataset for dragnet models☆10Nov 29, 2020Updated 5 years ago
- rim provides an interface to Maxima for R. Maxima is a powerful and fairly complete computer algebra system.☆12Nov 25, 2025Updated 5 months ago
- Basis of FragDenStaat.de's „Koalitionstracker“☆15Jul 14, 2025Updated 9 months ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- Multi-Langauge Identification☆28Jul 25, 2024Updated last year
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 6 months ago
- Benson turns a list of URLs into mp3s of the contents of each web page - take control over your reading backlog!☆16Oct 30, 2024Updated last year
- Terminal UI for monitoring SLURM jobs☆15Mar 29, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆22Jan 8, 2024Updated 2 years ago
- Training files for Greek cursive script (in early print)☆15May 26, 2021Updated 4 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- Online Pāli Dictionary and Pāli Tipiṭaka implemented in Go programming language.☆31Oct 26, 2023Updated 2 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- 1st Place Solution to iWildcam 2021: Count the number of animals of each species present in a sequence of images☆12Jun 24, 2021Updated 4 years ago