ausgerechnet/cwb-ccc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ausgerechnet/cwb-ccc)

ausgerechnet / cwb-ccc

Python wrapper for the CWB to extract concordances and score frequency lists

☆22

Alternatives and similar repositories for cwb-ccc

Users that are interested in cwb-ccc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mocdaniel / docker-cqpweb
View on GitHub
A containerized all-in-one solution for CQPWeb
☆18Jan 22, 2023Updated 3 years ago
liao961120 / concordancer
View on GitHub
Searching in-memory corpus with Corpus Query Language (CQL)
☆19Dec 2, 2024Updated last year
KorAP / Krill
View on GitHub
A Corpus Data Retrieval Index using Lucene for Look-Ups
☆20Jul 22, 2026Updated last week
tsproisl / SoMeWeTa
View on GitHub
A part-of-speech tagger with support for domain adaptation and external resources.
☆24Oct 26, 2022Updated 3 years ago
gambolputty / newscorpus
View on GitHub
A Python scraping module, that extracts text from articles found in RSS feeds. Uses SQLite as database.
☆20Jul 5, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
czcorpus / InterText_editor
View on GitHub
Editor for aligned parallel texts (personal desktop application).
☆20Jan 15, 2026Updated 6 months ago
ddevaraj / docker-brat
View on GitHub
Dockerization of brat application
☆13Jun 13, 2018Updated 8 years ago
wjbmattingly / ww2-spacy
View on GitHub
☆17Jan 5, 2023Updated 3 years ago
UB-Mannheim / blatt
View on GitHub
NLP-helper for OCR-ed pages in PAGE XML format
☆10Dec 6, 2024Updated last year
wjbmattingly / keyword-spacy
View on GitHub
Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.
☆14Dec 7, 2023Updated 2 years ago
interrogator / buzz
View on GitHub
linguistics backend
☆43Mar 25, 2023Updated 3 years ago
mromanello / hucitlib
View on GitHub
HuCit KB: a knowledge base of classical texts and citable text units.
☆11Nov 17, 2021Updated 4 years ago
UB-Mannheim / eScriptorium_Dokumentation
View on GitHub
This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation…
☆16Dec 6, 2025Updated 7 months ago
KorAP / Koral
View on GitHub
Translation of query languages to serialized KoralQuery protocol
☆15Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
a-nap / Digital-Research-Toolkit
View on GitHub
Digital Research Toolkit for Linguists course materials
☆12Jul 23, 2025Updated last year
KorAP / Tokenizer-Evaluation
View on GitHub
Benchmark scripts for comparing different tokenizers and sentence segmenters of German
☆12Feb 27, 2023Updated 3 years ago
pharos-alexandria / ocr-greek_cursive
View on GitHub
Training files for Greek cursive script (in early print)
☆15May 26, 2021Updated 5 years ago
sileod / DiscSense
View on GitHub
Automated Semantic Analysis of Discourse Markers
☆11May 30, 2022Updated 4 years ago
imohitmayank / sentenceviz
View on GitHub
A Streamlit application to visualize sentence embeddings
☆18Dec 21, 2022Updated 3 years ago
dragnet-org / dragnet_data
View on GitHub
code and data used to build a training dataset for dragnet models
☆10Nov 29, 2020Updated 5 years ago
originell / smaz-py3
View on GitHub
Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+
☆13Oct 18, 2025Updated 9 months ago
okfde / froide-govplan
View on GitHub
Basis of FragDenStaat.de's „Koalitionstracker“
☆15Jul 14, 2025Updated last year
pangaea-data-publisher / qualianon
View on GitHub
QualiAnon is a tool to support the anonymization of text data. It is developed by the Qualiservice research data center for the anonymiza…
☆39Apr 23, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hiredscorelabs / seqtolang
View on GitHub
Multi-Langauge Identification
☆28Jul 25, 2024Updated 2 years ago
alix-tz / escriptorium-documentation
View on GitHub
Source code to eScriptorium Documentation's website (powered with Mkdocs)
☆16Jun 1, 2026Updated last month
ELTE-DH / NoSketch-Engine-Docker
View on GitHub
A NoSketch Engine Docker image which is easy to use
☆22Apr 15, 2026Updated 3 months ago
WissamAntoun / SlurmTUI
View on GitHub
Terminal UI for monitoring SLURM jobs
☆16Mar 29, 2026Updated 3 months ago
lenakmeth / Wikinflection-Corpus
View on GitHub
The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…
☆12Dec 15, 2023Updated 2 years ago
timoteostewart / benson
View on GitHub
Benson turns a list of URLs into mp3s of the contents of each web page - take control over your reading backlog!
☆16Oct 30, 2024Updated last year
mindee / origamindee
View on GitHub
The Mindee fork of Origami, a pure Ruby library to parse, modify and generate PDF documents.
☆13May 20, 2025Updated last year
tsproisl / SoMaJo
View on GitHub
A tokenizer and sentence splitter for German and English web and social media texts.
☆153Dec 9, 2024Updated last year
jd-coderepos / contributions-ner-cs
View on GitHub
This repository hosts the dataset for the paper Computer Science Named Entity Recognition in the Open Research Knowledge Graph
☆21Jan 8, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
dkpro / dkpro-core-examples
View on GitHub
Ready-to-use examples of dkpro-core components and pipelines.
☆34Dec 16, 2023Updated 2 years ago
mitmul / ofChainer
View on GitHub
☆11Apr 11, 2019Updated 7 years ago
internetarchive / sandcrawler
View on GitHub
Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki
☆28Jul 31, 2024Updated last year
sebischair / Exploring-NLP-Research
View on GitHub
Repository of the RANLP 2023 paper "Exploring the Landscape of Natural Language Processing Research".
☆13Oct 20, 2024Updated last year
acmi-lab / pretraining-with-nonsense
View on GitHub
Pretraining summarization models using a corpus of nonsense
☆13Sep 28, 2021Updated 4 years ago
alcunha / iwildcam2021ufam
View on GitHub
1st Place Solution to iWildcam 2021: Count the number of animals of each species present in a sequence of images
☆12Jun 24, 2021Updated 5 years ago
nika2312 / qa_explaination
View on GitHub
☆13Jul 8, 2020Updated 6 years ago