clarinsi / tweetcatLinks
TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions
☆12Updated 8 years ago
Alternatives and similar repositories for tweetcat
Users that are interested in tweetcat are comparing it to the libraries listed below
Sorting:
- Multi Tier Annotation Search☆26Updated 4 years ago
- A simple configurable tool for manipulating dependency trees.☆14Updated 6 months ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆113Updated 5 months ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- spaCy-to-naf converter☆21Updated last month
- A Named-Entity Recogniser based on Grobid.☆55Updated 2 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated 2 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- Coreference resolution for German☆16Updated 8 years ago
- Python bindings to the dutch NLP tool Frog (pos tagger, lemmatiser, NER tagger, morphological analysis, shallow parser, dependency parser…☆49Updated 3 months ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Updated 5 years ago
- The Broad Twitter Corpus, an NER dataset in English stratified for time, location, social media genre, socioeconomic factors (COLING 2016…☆68Updated 3 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆98Updated last month
- A tool for text normalisation via character-level machine translation☆13Updated 5 years ago
- Wikidata embedding☆50Updated 8 months ago
- A natural language processing tool for automatically detecting quotations in text.☆15Updated 3 years ago
- Toolkit to compile a comparable/parallel corpus from European Parliament proceedings☆16Updated 5 years ago
- Linguistic search for large annotated text corpora, based on Apache Lucene☆112Updated last week
- Implementation of a simple frame identification approach (SimpleFrameId) described in the paper "Out-of-domain FrameNet Semantic Role Lab…☆15Updated 8 years ago
- ANNIS is an open source, versatile web browser-based search and visualization architecture for complex multilevel linguistic corpora with…☆75Updated last month
- A structured list of text corpora, created for use with a corpus downloader.☆13Updated 8 years ago
- Command-line tool to extract a ranked list of relevant keywords from a corpus with the option of using either topic modeling or tf-idf sc…☆40Updated 8 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- A tool for automatic spelling normalization☆20Updated 4 years ago
- Homebase of the IPTC EXTRA project about rule-based text categorization☆13Updated 8 years ago
- linguistic converter / merging tool for multi-level annotated corpora. graph-based (using Python and NetworkX).☆50Updated 2 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated last week