clarinsi / tweetcat
TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions
☆12Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for tweetcat
- Graph-based tool for disambiguation and linking of named entities to Linked Data sets for Digital Humanities and heritage texts☆27Updated 3 years ago
- spaCy-to-naf converter☆21Updated 5 months ago
- A Named-Entity Recogniser based on Grobid.☆49Updated 2 months ago
- Multi Tier Annotation Search☆26Updated 3 years ago
- GermaNER: Free Open German Named Entity Recognition Tool☆36Updated 11 months ago
- A simple configurable tool for manipulating dependency trees.☆13Updated 6 months ago
- Netherlands eScience Center - Shifting Concepts Through Time project☆26Updated 2 years ago
- Topic Modeling Workflow in Python☆16Updated last year
- Wrapper for DKPro Core to extract lingustic information from books.☆16Updated 2 years ago
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated last year
- A tool for text normalisation via character-level machine translation☆13Updated 4 years ago
- IXA pipes Part of Speech tagger and Lemmatizer (http://ixa2.si.ehu.es/ixa-pipes)☆17Updated 2 years ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Updated 5 months ago
- Identifying Historical People, Places and other Entities: Shared Task on Named Entity Recognition and Linking on Historical Newspapers at…☆22Updated 3 months ago
- A collection of notebooks for Natural Language Processing☆24Updated 3 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 3 years ago
- Coreference resolution for German☆16Updated 7 years ago
- Collection de romans français du dix-huitième siècle (1751-1800) / Collection of Eighteenth-Century French Novels (1751-1800)☆22Updated 6 months ago
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆32Updated 8 years ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.g…☆110Updated 4 months ago
- Wikidata embedding☆50Updated 2 weeks ago
- Basic dataset for the linguistic data collection.☆15Updated 7 years ago
- OCRopus model for Gothic print (Fraktur)☆18Updated 4 years ago
- A Python library for topic modeling and visualization☆64Updated 4 years ago
- A tool for automatic spelling normalization☆20Updated 3 years ago
- ☆17Updated 9 years ago
- Citation Classification using hybrid neural network model for Wikipedia References☆28Updated last year
- Within-book topic modeling on HTRC feature extraction files☆23Updated 8 years ago
- CrowdTruth framework for crowdsourcing ground truth for training & evaluation of AI systems☆57Updated 7 months ago
- 🚀GUI for training spaCy models☆53Updated 3 years ago