clarinsi / tweetcatLinks
TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions
☆12Updated 8 years ago
Alternatives and similar repositories for tweetcat
Users that are interested in tweetcat are comparing it to the libraries listed below
Sorting:
- A simple configurable tool for manipulating dependency trees.☆14Updated 7 months ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 6 months ago
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- spaCy-to-naf converter☆21Updated 2 months ago
- A Named-Entity Recogniser based on Grobid.☆54Updated 2 months ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- The Potsdam Twitter Sentiment Corpus☆18Updated 5 years ago
- Coreference resolution for German☆16Updated 8 years ago
- ☆32Updated 6 years ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- Multi Tier Annotation Search☆26Updated 4 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆44Updated 9 months ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated last week
- A tool for text normalisation via character-level machine translation☆13Updated 5 years ago
- Regex like pattern tree matching but on sentence's tree instead of Strings☆42Updated 7 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated 2 years ago
- German Morphological Analyzer☆46Updated 3 years ago
- Named-Entity Recognition for Norwegian Bokmål and Nynorsk☆12Updated 6 years ago
- A collection of notebooks for Natural Language Processing☆25Updated 6 months ago
- Python package for stylometry☆63Updated 4 years ago
- spaCy + UDPipe☆162Updated 3 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆27Updated 3 years ago
- ☆65Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆32Updated 9 years ago
- LaMachine - A software distribution of our in-house as well as some 3rd party NLP software - Virtual Machine, Docker, or local compilatio…☆68Updated last year
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year