Democratizing NLP!
☆106Dec 6, 2023Updated 2 years ago
Alternatives and similar repositories for NLP-OSS
Users that are interested in NLP-OSS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- A library for data streaming and augmentation☆21May 5, 2025Updated last year
- Code base for NAACL 2016 paper☆15Apr 9, 2018Updated 8 years ago
- Lightweight piece tokenization library☆12Apr 15, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Scripts to explore and visualize distributional semantic models using graphs.☆24Sep 19, 2017Updated 8 years ago
- Cynical data selection☆20Jan 16, 2021Updated 5 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- ☆17May 6, 2022Updated 4 years ago
- structured attention encoder☆13Jun 6, 2018Updated 7 years ago
- Fast Word Clustering Software☆79Feb 8, 2025Updated last year
- Helpers for constructing scikit-learn grid search☆40Feb 16, 2020Updated 6 years ago
- Tools for the 3rd edition of the Constraint Grammar formalism.☆26Updated this week
- A simple replacement for gksu/ktsuss etc that allows you to run a program with different privileges ( root etc ).☆14Aug 14, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Apr 8, 2016Updated 10 years ago
- Analyzing and visualizing rental listings data☆12Feb 28, 2019Updated 7 years ago
- Java library to tokenize Thai text into a list of TCCs☆20May 30, 2017Updated 8 years ago
- Transition-based UCCA Parser☆74Dec 14, 2020Updated 5 years ago
- An open-source framework for modeling real-time conversations in spoken dialogue systems.☆27Aug 12, 2022Updated 3 years ago
- A fast, simple, multilingual tokenizer☆29May 24, 2017Updated 8 years ago
- VoxAngeles Corpus☆14Aug 23, 2025Updated 8 months ago
- Finnish data☆11Apr 30, 2026Updated last week
- Code for the paper "Latent Relation Language Models" at AAAI-20.☆41Sep 22, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆18Mar 30, 2022Updated 4 years ago
- Matrix tools for building and inspecting latent spaces☆26Aug 19, 2018Updated 7 years ago
- A conda-smithy repository for spacy.☆14Apr 23, 2026Updated 2 weeks ago
- allennlp tutorial for O'Reilly AI Conference, September 2019☆22Sep 10, 2019Updated 6 years ago
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Dec 23, 2016Updated 9 years ago
- a fork of Ronan Collobert's senna deep learning based NLP tools☆43Feb 5, 2013Updated 13 years ago
- KenLM extension for spaCy 2.0.☆16Dec 6, 2017Updated 8 years ago
- SWAN: Saar Web-based ANotation system☆14May 16, 2019Updated 6 years ago
- C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"☆21May 8, 2017Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Julia interface for SpaCy NLP library☆14Apr 22, 2018Updated 8 years ago
- Repository for the ACL 2020 virtual conference website (work in progress)☆39Mar 26, 2022Updated 4 years ago
- Natural Dialogue System☆22Mar 23, 2016Updated 10 years ago
- Prodigy thing(z)☆12Mar 22, 2018Updated 8 years ago
- NanigoNet — Language detector for code-mixed input supporting 150+19 human+programming languages using deep neural networks☆71May 22, 2023Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Aug 21, 2025Updated 8 months ago
- GOPHI: an AMR-to-English Verbalizer☆11Feb 5, 2020Updated 6 years ago