LinguisticData/linguistic-data

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LinguisticData/linguistic-data)

LinguisticData / linguistic-data

Basic dataset for the linguistic data collection.

☆15

Alternatives and similar repositories for linguistic-data

Users that are interested in linguistic-data are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GitbookIO / tokenize-english
View on GitHub
Javascript tokenizer for english sentences
☆14Oct 15, 2015Updated 10 years ago
tastyminerals / ccrawl
View on GitHub
Simple CORPORA list crawler
☆11Dec 2, 2016Updated 9 years ago
clarinsi / tweetcat
View on GitHub
TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions
☆12May 18, 2017Updated 9 years ago
andersjo / word2vec_sampler
View on GitHub
A Python implementation of word2vec that allows custom sampling strategies
☆10Jan 30, 2014Updated 12 years ago
aleksandergurin / simple-object-notation
View on GitHub
SON (Simple Object Notation) data interchange format
☆14Jun 22, 2015Updated 11 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
lmorgadodacosta / CantoneseWN
View on GitHub
The Cantonese Wordnet
☆15Dec 4, 2023Updated 2 years ago
stevedonovan / llib
View on GitHub
A compact library for C99 (and MSVC in C++ mode) providing refcounted arrays, maps, lists and a cool lexical scanner.
☆43Updated this week
tkutschbach / RST-Tace
View on GitHub
A tool for automatic comparison and evaluation of RST trees
☆12Apr 10, 2025Updated last year
parryc / interlinear
View on GitHub
Interlinear glossing with JS & CSS
☆20Aug 23, 2015Updated 10 years ago
interrogator / risk
View on GitHub
NYT Risk Semantics Project
☆12Mar 5, 2016Updated 10 years ago
dowobeha / Gale_and_Church_1993
View on GitHub
Bilingual sentence aligner (Gale & Church, 1993)
☆14Jan 8, 2026Updated 6 months ago
brendano / parseviz
View on GitHub
Visualize constituent and dependency parses as PDF or image formats, through GraphViz.
☆32Feb 11, 2021Updated 5 years ago
WladimirSidorenko / PotTS
View on GitHub
The Potsdam Twitter Sentiment Corpus
☆18Jan 15, 2020Updated 6 years ago
CoryMcCartan / adjustr
View on GitHub
An R package to help assess the sensitivity of a Bayesian model (fitted with Stan) to the specification of its likelihood and priors
☆11May 29, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
deepminder / SLM4J
View on GitHub
Smart Local Moving (SLM) algorithm is an algorithm for community detection (or clustering) in large networks
☆12Aug 12, 2016Updated 9 years ago
lmarti / ipynb-cheat-sheet
View on GitHub
a latex cheat sheet with ipython commands and shortcuts
☆10Mar 10, 2014Updated 12 years ago
amir-zeldes / DepEdit
View on GitHub
A simple configurable tool for manipulating dependency trees.
☆14Dec 25, 2024Updated last year
WladimirSidorenko / SentiLex
View on GitHub
Sentiment Lexicon Generation Suite
☆15Dec 4, 2017Updated 8 years ago
aredridel / lotsawa
View on GitHub
The Marpa parsing alrgorithm in Javascript
☆22Jan 15, 2017Updated 9 years ago
cidles / pyannotation
View on GitHub
PyAnnotation is a Python Library to access and manipulate linguistically annotated corpus files.
☆17Sep 4, 2012Updated 13 years ago
lex4all / lex4all
View on GitHub
pronunciation LEXicons for Any Low-resource Language
☆21Jul 14, 2020Updated 6 years ago
karlstratos / minitagger
View on GitHub
☆21Apr 4, 2015Updated 11 years ago
ropensci / pangoling
View on GitHub
An R package for estimating the log-probabilities of words in a given context using transformer models.
☆12Jun 30, 2026Updated 3 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
nikolasburk / graphqlday-workshop
View on GitHub
☆15Apr 14, 2018Updated 8 years ago
equinn1 / MTH225_Spring2016
View on GitHub
MTH225 Statistics for Science Spring 2016
☆12May 13, 2016Updated 10 years ago
NNBlocks / NNBlocks
View on GitHub
A framework to build and train linguistics neural models
☆19Apr 8, 2016Updated 10 years ago
magrant / Toronto-Psycholinguistics-Experiments
View on GitHub
Templates etc. for creating experiments using Ibex Farm.
☆11Jul 21, 2018Updated 8 years ago
johnymontana / pp-viz
View on GitHub
Visualizing the ICIJ Paradise Papers / Panama Papers / Offshore Leaks data
☆21Nov 29, 2017Updated 8 years ago
nathanvan / rstanmulticore
View on GitHub
A cross-platform R package to run RStan in parallel
☆10Jun 3, 2015Updated 11 years ago
aherbay / erpscope
View on GitHub
A little package to visualize ERPs in R
☆14May 23, 2025Updated last year
taichino / prettyprint
View on GitHub
prettyprint is a python module to output list/dict/tuple object prettily.
☆29Nov 22, 2021Updated 4 years ago
paigecm / 2016-campaign
View on GitHub
2016 Presidential Campaign Speeches
☆15Oct 25, 2016Updated 9 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vsbuffalo / stanhl
View on GitHub
Stan syntax highlighting for knitr
☆14Oct 26, 2020Updated 5 years ago
tdeenes / eegR
View on GitHub
A preliminary version of eegR: an R package to analyze EEG signals
☆15Apr 18, 2021Updated 5 years ago
JoshuaGrams / pep
View on GitHub
A Pint-sized Earley Parser
☆34Sep 12, 2024Updated last year
mrmlnc / svg2sprite
View on GitHub
A very simple module to generate SVG sprites.
☆11May 4, 2017Updated 9 years ago
lighttransport / sss-model
View on GitHub
Test model for subsurface scattering
☆10Aug 26, 2015Updated 10 years ago
turicas / templater
View on GitHub
Extract, parse and populate templates from strings
☆28Apr 4, 2019Updated 7 years ago
lighttransport / francine
View on GitHub
Highly scalable renderer backend
☆10Mar 4, 2019Updated 7 years ago