Democratizing NLP!
☆106Dec 6, 2023Updated 2 years ago
Alternatives and similar repositories for NLP-OSS
Users that are interested in NLP-OSS are comparing it to the libraries listed below
Sorting:
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- Code base for NAACL 2016 paper☆15Apr 9, 2018Updated 7 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- A library for data streaming and augmentation☆21May 5, 2025Updated 10 months ago
- ☆17May 6, 2022Updated 3 years ago
- A fast, simple, multilingual tokenizer☆29May 24, 2017Updated 8 years ago
- The Community-enRiched Open WordNet (CROWN)☆18Dec 3, 2015Updated 10 years ago
- Java library to tokenize Thai text into a list of TCCs☆19May 30, 2017Updated 8 years ago
- Fast Word Clustering Software☆79Feb 8, 2025Updated last year
- Simple extension of WikiExtractor(https://github.com/attardi/wikiextractor)☆16Dec 23, 2016Updated 9 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 9 months ago
- Code for the paper "Latent Relation Language Models" at AAAI-20.☆41Sep 22, 2025Updated 5 months ago
- structured attention encoder☆13Jun 6, 2018Updated 7 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆70Sep 14, 2021Updated 4 years ago
- Scripts to explore and visualize distributional semantic models using graphs.☆24Sep 19, 2017Updated 8 years ago
- Joint multi-task emotion deep neural model for emotion classification in multigenre.☆14May 10, 2024Updated last year
- Basic dataset for the linguistic data collection.☆15Feb 13, 2017Updated 9 years ago
- Encode / decode varints.☆14May 24, 2021Updated 4 years ago
- SWAN: Saar Web-based ANotation system☆14May 16, 2019Updated 6 years ago
- Repository that hosts the work done in the framework of the Computational Literary Studies Project (2020-2025).☆17May 21, 2025Updated 9 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Analyzing and visualizing rental listings data☆12Feb 28, 2019Updated 7 years ago
- Transition-based UCCA Parser☆74Dec 14, 2020Updated 5 years ago
- Code for Unsupervised Discovery of Multimodal Links in Multi-Image/Multi-Sentence Documents☆30Jul 22, 2020Updated 5 years ago
- Embeddings for n-grams☆11Jun 22, 2018Updated 7 years ago
- Python package to augment multilingual data☆15Feb 15, 2023Updated 3 years ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- Conference notes for AAAI 2019☆15Feb 1, 2019Updated 7 years ago
- 세종 구문 분석 말뭉치의 의존 구문 구조로의 변환 도구☆10Sep 7, 2018Updated 7 years ago
- ☆14Jun 22, 2020Updated 5 years ago
- Improving Sentiment Analysis with Multi-task Learning of Negation☆14May 6, 2021Updated 4 years ago
- ☆11Nov 20, 2020Updated 5 years ago
- A conda-smithy repository for spacy.☆14Nov 18, 2025Updated 3 months ago
- A visualisation tool for Spacy using Hierplane.☆65Jan 25, 2023Updated 3 years ago
- Neural machine translation implementation using dynet's python bindings☆17Jan 24, 2018Updated 8 years ago
- 2016 Presidential Campaign Speeches☆15Oct 25, 2016Updated 9 years ago
- KenLM extension for spaCy 2.0.☆16Dec 6, 2017Updated 8 years ago
- NLP course at Chulalongkorn University 2019☆21Mar 28, 2019Updated 6 years ago