Python wrapper for the CWB to extract concordances and score frequency lists
☆22May 11, 2026Updated last month
Alternatives and similar repositories for cwb-ccc
Users that are interested in cwb-ccc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A containerized all-in-one solution for CQPWeb☆18Jan 22, 2023Updated 3 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Updated this week
- A part-of-speech tagger with support for domain adaptation and external resources.☆24Oct 26, 2022Updated 3 years ago
- Collaborative on-line editor for aligned parallel texts.☆14Nov 24, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Course on Language Technologies and NLP☆15May 15, 2017Updated 9 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆128Updated this week
- Editor for aligned parallel texts (personal desktop application).☆20Jan 15, 2026Updated 5 months ago
- Coquery is a free corpus query tool for linguists, lexicographers, translators, and anybody who wishes to search and analyse a text corpu…☆19Jun 10, 2026Updated last week
- A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.☆20Jul 5, 2024Updated last year
- PaliNLP reworked. Version 2.☆17Jun 29, 2015Updated 10 years ago
- Corpus Annotation Graph builder (CAG) is an architectural framework that employs the build-and-annotate pattern for creating a graph.☆14Dec 7, 2023Updated 2 years ago
- A Python database interface for eXist-db☆15May 2, 2026Updated last month
- [COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…☆15Oct 31, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆30May 30, 2023Updated 3 years ago
- A lightweight web-based annotation tool for labelling entity recognition data.☆23Aug 19, 2024Updated last year
- Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.☆13Dec 7, 2023Updated 2 years ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆79Apr 21, 2026Updated last month
- The `hp2xx' program is a versatile tool to convert vector-oriented graphics data given in Hewlett-Packard's HP-GL plotter language into a…☆18Feb 1, 2020Updated 6 years ago
- This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…☆16Dec 6, 2025Updated 6 months ago
- HuCit KB: a knowledge base of classical texts and citable text units.☆11Nov 17, 2021Updated 4 years ago
- Repo of the Turing's Humanities & Data Science Discussion Group☆13Jul 21, 2022Updated 3 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Automated Semantic Analysis of Discourse Markers☆11May 30, 2022Updated 4 years ago
- A Streamlit application to visualize sentence embeddings☆18Dec 21, 2022Updated 3 years ago
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Oct 18, 2025Updated 8 months ago
- [WWW 2026] 🕸 GlotWeb: Web Indexing for Minority Languages☆17Apr 14, 2026Updated 2 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Wikibase extension that allows defining RDF mappings for Wikibase Entities☆16Jun 1, 2026Updated 2 weeks ago
- Benson turns a list of URLs into mp3s of the contents of each web page - take control over your reading backlog!☆16Oct 30, 2024Updated last year
- Terminal UI for monitoring SLURM jobs☆15Mar 29, 2026Updated 2 months ago
- This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph☆21Jan 8, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Training files for Greek cursive script (in early print)☆15May 26, 2021Updated 5 years ago
- Ready-to-use examples of dkpro-core components and pipelines.☆34Dec 16, 2023Updated 2 years ago
- A reddit bot that finds original publish dates on linked articles.☆10Nov 30, 2024Updated last year
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆13Aug 10, 2023Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆152Dec 9, 2024Updated last year
- ☆15Jul 6, 2025Updated 11 months ago
- Twitter bot that tweets translated arXiv paper summaries☆10Dec 11, 2021Updated 4 years ago