kristopherkyle/corpus_toolkit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kristopherkyle/corpus_toolkit)

kristopherkyle / corpus_toolkit

A simple toolkit for conducting analyses using corpus methods

☆28

Alternatives and similar repositories for corpus_toolkit

Users that are interested in corpus_toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kristopherkyle / corpus-analysis-python
View on GitHub
☆14Oct 30, 2025Updated 8 months ago
kristopherkyle / lexical_diversity
View on GitHub
This is a simple Python package for calculating a variety of lexical diversity indices
☆84Sep 15, 2023Updated 2 years ago
tanloong / neosca
View on GitHub
L2SCA & LCA fork: cross-platform, GUI, without Java dependency
☆45Apr 10, 2026Updated 3 months ago
kristopherkyle / TAASSC
View on GitHub
Tool for the Automatic Analysis of Syntactic Sophistication and Complexity
☆31Nov 4, 2023Updated 2 years ago
ssharoff / biberpy
View on GitHub
Python version for Doug Biber's Multidimensional Analysis (MDA)
☆41May 24, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
mshakirDr / MFTE
View on GitHub
MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…
☆31Jun 1, 2026Updated last month
liao961120 / concordancer
View on GitHub
Searching in-memory corpus with Corpus Query Language (CQL)
☆19Dec 2, 2024Updated last year
kristopherkyle / LxGrTgr
View on GitHub
Lexicogrammatical Tagger
☆15May 12, 2026Updated 2 months ago
iris2hu / Chinese-collocation-complexity
View on GitHub
☆24Aug 24, 2023Updated 2 years ago
LCR-ADS-Lab / TAALED
View on GitHub
Tool for the automatic assessment of lexical diversity
☆14Sep 6, 2025Updated 10 months ago
shyyhs / CourseraParallelCorpusMining
View on GitHub
Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation
☆15Aug 27, 2024Updated last year
palipced / palipced.github.io
View on GitHub
pced pali tipitaka canon electronic dictionary
☆12Jun 15, 2024Updated 2 years ago
dylnbk / chatty-v2
View on GitHub
Streamlit Multi AI Platform Chat App
☆10Nov 5, 2024Updated last year
NYUCCL / duolingoSLAM
View on GitHub
2018 Duolingo Shared Task on Second Language Acquisition Modeling (SLAM) (http://sharedtask.duolingo.com/)
☆12May 31, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Cronos87 / love2d-docset-generator
View on GitHub
A Love2d docset generator
☆10Dec 8, 2019Updated 6 years ago
nlp-tlp / maintie
View on GitHub
Maintenance Information Extraction (MaintIE)
☆21Jun 29, 2024Updated 2 years ago
giellalt / lang-crk
View on GitHub
Finite state and Constraint Grammar based analysers and proofing tools, and language resources for the Plains Cree language
☆16Jun 3, 2026Updated last month
chozelinek / europarl
View on GitHub
Toolkit to compile a comparable/parallel corpus from European Parliament proceedings
☆17Jan 26, 2020Updated 6 years ago
interrogator / buzz
View on GitHub
linguistics backend
☆42Mar 25, 2023Updated 3 years ago
CocoTan1020 / MLF-BERT
View on GitHub
基于多层级语言特征融合的中文文本可读性分级模型
☆12Feb 27, 2024Updated 2 years ago
emorynlp / ddr
View on GitHub
Deep Dependency Representation
☆16May 9, 2018Updated 8 years ago
youngmihuang / word2vec
View on GitHub
☆13Dec 3, 2017Updated 8 years ago
aaren / thesis
View on GitHub
My PhD thesis (in progress!)
☆15Oct 23, 2016Updated 9 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
UniversalDependencies / UD_English-GUM
View on GitHub
☆41Jul 10, 2026Updated last week
NLPAssignment / metaphor-detection
View on GitHub
Metaphor detection using NLP techniques, made in Python using NLTK
☆17Nov 30, 2013Updated 12 years ago
zesch / lang-tech-teaching-public
View on GitHub
☆16Jan 24, 2022Updated 4 years ago
jasonbahl / gatsby-conf-2021
View on GitHub
Gatsby + WordPress + WPGraphQL: Example repo used for GatsbyConf 2021 workshop
☆11Feb 25, 2021Updated 5 years ago
ranaroussi / trading_calendars
View on GitHub
Calendars for various securities exchanges.
☆14Dec 18, 2020Updated 5 years ago
mishajw / repeng
View on GitHub
Experiments with representation engineering
☆14Feb 28, 2024Updated 2 years ago
thiippal / AI2D-RST
View on GitHub
A repository for the AI2D-RST corpus.
☆18Jul 2, 2024Updated 2 years ago
elsevierlabs / OA-STM-Corpus
View on GitHub
Corpus of Open Access articles from multiple fields in Science, Technology, and Medicine.
☆75Mar 28, 2017Updated 9 years ago
altryne / llm-evals-workshop
View on GitHub
Materials for the LLM Evals Workshop from Weights & BIases
☆15Feb 24, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SapienzaNLP / gsrl
View on GitHub
GSRL is a seq2seq model for end-to-end dependency- and span-based SRL (IJCAI2021).
☆18Sep 14, 2021Updated 4 years ago
dongjunKANG / VIM
View on GitHub
☆11Oct 16, 2023Updated 2 years ago
ispras / atr4s
View on GitHub
Toolkit with state-of-the-art Automatic Terms Recognition methods in Scala
☆36Jul 23, 2018Updated 7 years ago
LivingSkyTechnologies / Document_Layout_Segmentation
View on GitHub
Repository to use/train segmentation models for document layout analysis
☆19Jan 13, 2022Updated 4 years ago
bigartm / visartm
View on GitHub
☆18Apr 25, 2018Updated 8 years ago
MPIDR / Global-flows-and-rates-of-international-migration-of-scholars
View on GitHub
Scripts and data to reproduce the paper Global flows and rates of international migration of scholars
☆18Jun 7, 2026Updated last month
IBM / graph4nlp
View on GitHub
Graph4NLP is the library for the easy use of Graph Neural Networks for Natural Language Processing
☆15Feb 12, 2021Updated 5 years ago