Democratizing NLP!
☆106Dec 6, 2023Updated 2 years ago
Alternatives and similar repositories for NLP-OSS
Users that are interested in NLP-OSS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- A library for data streaming and augmentation☆21May 5, 2025Updated 10 months ago
- Code for the AAAI 2023 Paper "Real or Fake Text?: Investigating Human Ability to Detect Boundaries Between Human-Written and Machine-Gene…☆17Oct 29, 2024Updated last year
- Code base for NAACL 2016 paper☆15Apr 9, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository that hosts the work done in the framework of the Computational Literary Studies Project (2020-2025).☆17May 21, 2025Updated 10 months ago
- Lightweight piece tokenization library☆12Apr 15, 2024Updated last year
- Scripts to explore and visualize distributional semantic models using graphs.☆24Sep 19, 2017Updated 8 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- ☆17May 6, 2022Updated 3 years ago
- structured attention encoder☆13Jun 6, 2018Updated 7 years ago
- Fast Word Clustering Software☆79Feb 8, 2025Updated last year
- Helpers for constructing scikit-learn grid search☆39Feb 16, 2020Updated 6 years ago
- Code for the ACL 2022 Paper "A Feasibility Study of Answer-Agnostic Question Generation for Education"☆16Jul 5, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tools for the 3rd edition of the Constraint Grammar formalism.☆25Feb 25, 2026Updated last month
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Apr 8, 2016Updated 9 years ago
- Analyzing and visualizing rental listings data☆12Feb 28, 2019Updated 7 years ago
- Isan NLP☆17Mar 27, 2024Updated 2 years ago
- Java library to tokenize Thai text into a list of TCCs☆19May 30, 2017Updated 8 years ago
- An open-source framework for modeling real-time conversations in spoken dialogue systems.☆27Aug 12, 2022Updated 3 years ago
- Transition-based UCCA Parser☆74Dec 14, 2020Updated 5 years ago
- VoxAngeles Corpus☆14Aug 23, 2025Updated 7 months ago
- Code for the paper "Latent Relation Language Models" at AAAI-20.☆41Sep 22, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Leaderboards are widely used in NLP and push the field forward. While leaderboards are a straightforward ranking of NLP models, this simp…☆18Mar 30, 2022Updated 4 years ago
- Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents☆30Jul 22, 2020Updated 5 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 10 months ago
- Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)☆59Updated this week
- Matrix tools for building and inspecting latent spaces☆27Aug 19, 2018Updated 7 years ago
- Instantiate objects and call functions using dictionary configs in Python using Genos.☆10Jun 19, 2023Updated 2 years ago
- A conda-smithy repository for spacy.☆14Updated this week
- WordWanderer – take your text for a walk☆12May 14, 2019Updated 6 years ago
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Dec 23, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆70Sep 14, 2021Updated 4 years ago
- KenLM extension for spaCy 2.0.☆16Dec 6, 2017Updated 8 years ago
- SWAN: Saar Web-based ANotation system☆14May 16, 2019Updated 6 years ago
- NLP course at Chulalongkorn University 2019☆21Mar 28, 2019Updated 7 years ago
- C++ code of "Learning to Parse and Translate Improves Neural Machine Translation"☆21May 8, 2017Updated 8 years ago
- Notes and code for the workshop "Rule-Based Models for Regression and Classification”☆13May 21, 2016Updated 9 years ago
- Repository for the ACL 2020 virtual conference website (work in progress)☆39Mar 26, 2022Updated 4 years ago