Basic dataset for the linguistic data collection.
☆15Feb 13, 2017Updated 9 years ago
Alternatives and similar repositories for linguistic-data
Users that are interested in linguistic-data are comparing it to the libraries listed below
Sorting:
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- PyAnnotation is a Python Library to access and manipulate linguistically annotated corpus files.☆17Sep 4, 2012Updated 13 years ago
- A Python implementation of word2vec that allows custom sampling strategies☆10Jan 30, 2014Updated 12 years ago
- A tool for automatic comparison and evaluation of RST trees☆12Apr 10, 2025Updated 10 months ago
- NYT Risk Semantics Project☆12Mar 5, 2016Updated 9 years ago
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12May 18, 2017Updated 8 years ago
- A simple configurable tool for manipulating dependency trees.☆14Dec 25, 2024Updated last year
- 2016 Presidential Campaign Speeches☆15Oct 25, 2016Updated 9 years ago
- Javascript tokenizer for english sentences☆14Oct 15, 2015Updated 10 years ago
- Bilingual sentence aligner (Gale & Church, 1993)☆14Jan 8, 2026Updated last month
- a latex cheat sheet with ipython commands and shortcuts☆10Mar 10, 2014Updated 11 years ago
- The Potsdam Twitter Sentiment Corpus☆18Jan 15, 2020Updated 6 years ago
- Interlinear glossing with JS & CSS☆20Aug 23, 2015Updated 10 years ago
- Sentiment Lexicon Generation Suite☆15Dec 4, 2017Updated 8 years ago
- Extract, parse and populate templates from strings☆27Apr 4, 2019Updated 6 years ago
- pronunciation LEXicons for Any Low-resource Language☆21Jul 14, 2020Updated 5 years ago
- A framework to build and train linguistics neural models☆19Apr 8, 2016Updated 9 years ago
- Parsing Time: Learning to Interpret Time Expressions☆31Apr 14, 2023Updated 2 years ago
- a lua template engine like a famous python template engine jinja2☆35May 10, 2013Updated 12 years ago
- ☆21Apr 4, 2015Updated 10 years ago
- prettyprint is a python module to output list/dict/tuple object prettily.☆29Nov 22, 2021Updated 4 years ago
- ☆46Oct 15, 2013Updated 12 years ago
- Fast and robust NLP components implemented in Java.☆53Oct 13, 2020Updated 5 years ago
- Spectral Word Embedding Learning for Language (SWELL) toolkit☆27Jul 24, 2014Updated 11 years ago
- A compact library for C99 (and MSVC in C++ mode) providing refcounted arrays, maps, lists and a cool lexical scanner.☆43Apr 5, 2017Updated 8 years ago
- Easy-first dependency parser based on Hierarchical Tree LSTMs☆32Nov 30, 2016Updated 9 years ago
- Discontinuous Data-Oriented Parsing☆46Jan 5, 2024Updated 2 years ago
- One-file utility module filled with helper functions for day to day Python programming☆43Jan 24, 2020Updated 6 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Dec 16, 2023Updated 2 years ago
- Tagger for questions posted on StackExchange Network☆38Sep 23, 2017Updated 8 years ago
- Templates etc. for creating experiments using Ibex Farm.☆11Jul 21, 2018Updated 7 years ago
- Translation of query languages to serialized KoralQuery protocol☆13Feb 23, 2026Updated last week
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- A very simple module to generate SVG sprites.☆10May 4, 2017Updated 8 years ago
- A collection of various discourse segmenters☆10Jun 30, 2017Updated 8 years ago
- Package to Train LANs (Likelihood approximation networks)☆13Feb 10, 2026Updated 2 weeks ago
- Code for reproducing the results in "Mining Semantic Affordances of Visual Object Categories"☆12Jun 10, 2024Updated last year
- FFT Explorations (basic implementation)☆10Aug 8, 2014Updated 11 years ago
- Tools and scripts for working with ELAN☆10Aug 4, 2022Updated 3 years ago