nlp-compromise / nlp-corpus
varied english texts for modern NLP testing
☆75Updated 2 years ago
Alternatives and similar repositories for nlp-corpus:
Users that are interested in nlp-corpus are comparing it to the libraries listed below
- English lexicon useful in NLP/NLU☆15Updated last year
- The Community-enRiched Open WordNet (CROWN)☆19Updated 9 years ago
- Extract Data from Wikipedia Lists☆30Updated 7 years ago
- Python SDK for the TextRazor Text Analytics API☆20Updated last year
- A modular annotation system that supports complex, interactive annotation graphs embedded on top of sequences of text.☆91Updated 3 years ago
- WordNet in JSON format.☆91Updated 4 years ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 5 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.☆56Updated 5 years ago
- A tool for visualizing trees, tailored specifically to the analysis of parse trees.☆81Updated 4 years ago
- 🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec☆60Updated 3 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago
- Text summarization using Lexrank☆54Updated 6 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆111Updated this week
- ☆97Updated 3 years ago
- NEWS: JATE2.0 Beta.11 Released, see details below.☆81Updated last year
- Python library for Natural Language Generation (including SimpleNLG wrapper)☆44Updated 2 years ago
- Concept dictionary☆37Updated 9 months ago
- ADEL is a robust and efficient entity linking framework that is adaptive to text genres and language, entity types for the classification…☆17Updated 5 years ago
- A thin GraphQL wrapper around spacy☆21Updated 4 years ago
- command-line tool to extract taxonomies from Wikidata☆125Updated 5 years ago
- Maps clauses from a text corpus onto the metrical structure of a poem☆17Updated 9 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 6 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated last year
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 6 years ago
- Raw Wikipedia counts for entity linking☆19Updated 7 years ago
- An implementation of latent Dirichlet allocation in javascript☆182Updated 2 years ago
- Markov Chain combined with word vector embedding (word2vec) and part-of-speech tagging, for context-aware text generation. License: MIT☆98Updated 7 years ago