clarinsi / tweetcatView external linksLinks
TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions
☆12May 18, 2017Updated 8 years ago
Alternatives and similar repositories for tweetcat
Users that are interested in tweetcat are comparing it to the libraries listed below
Sorting:
- Basic dataset for the linguistic data collection.☆15Feb 13, 2017Updated 9 years ago
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Jan 13, 2026Updated last month
- A tool for text normalisation via character-level machine translation☆13Jun 12, 2020Updated 5 years ago
- generate rules from lists of words☆16Jul 9, 2021Updated 4 years ago
- Part of eMOP: the Recursive Text Alignment Tool compares OCR text results to groundtruth by character and computes a score.☆22Sep 24, 2015Updated 10 years ago
- This repository☆30Nov 13, 2022Updated 3 years ago
- Gazetteer of the Ancient Near East Data☆10Aug 1, 2013Updated 12 years ago
- A PHP library for comparing two or more Sanskrit TEI XML files and generating an apparatus with variants☆14Aug 18, 2025Updated 5 months ago
- What happens when you connect all the ZIP/postal codes in a country in ascending order?☆13Sep 25, 2024Updated last year
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆35Apr 24, 2017Updated 8 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Oct 14, 2022Updated 3 years ago
- A repository for the SRN documents database API☆14Feb 24, 2025Updated 11 months ago
- Arabic Word-Embedding (Word2vec) model training from Wikipedia articles☆11Dec 13, 2018Updated 7 years ago
- A Simple Sudoku Solver☆23Nov 26, 2012Updated 13 years ago
- Oracc GUI☆12Jun 27, 2025Updated 7 months ago
- Project to digitize avant-garde periodicals☆12May 13, 2022Updated 3 years ago
- Public Comment Analysis Project for the Federal Chief Data Officer Council. The Comment Analysis pilot has shown that a toolset leveragin…☆13Sep 17, 2021Updated 4 years ago
- TEI-encoded contents of the Egyptian Gazette☆15Jun 11, 2024Updated last year
- Simple CORPORA list crawler☆10Dec 2, 2016Updated 9 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- ☆10Oct 2, 2017Updated 8 years ago
- Detect changepoints in time series data☆11Oct 29, 2014Updated 11 years ago
- Realtime MIDI IO with Ruby for Windows/Cygwin☆15May 12, 2022Updated 3 years ago
- Drizzle ORM adapter for AdminJS☆13Aug 12, 2025Updated 6 months ago
- ☆11Mar 11, 2016Updated 9 years ago
- Sources of Collatinus software - Latin lemmatizer and morphological analyzer☆11Apr 25, 2016Updated 9 years ago
- EACL 2021☆11May 4, 2021Updated 4 years ago
- Automatically constructed lexical database for Bangla inspired from Wordnet☆11Jul 12, 2012Updated 13 years ago
- Python Commodore BBS multi-client☆12Aug 11, 2022Updated 3 years ago
- Write Like Hemingway☆12Nov 28, 2014Updated 11 years ago
- ☆11Oct 13, 2019Updated 6 years ago
- Building the epistemic web☆13Jul 16, 2024Updated last year
- Bangla OCR using CNN architecture☆11Nov 27, 2017Updated 8 years ago
- An map server for creating, serving and rendering vector tiles☆10Aug 20, 2017Updated 8 years ago
- Open Source bits of the core utilities used in PyVmMonitor (http://www.pyvmmonitor.com/)☆12Aug 26, 2024Updated last year
- Implement the article 'Towards a Better Way to Teach Dynamic Programming' (Forišek, 2015) as a series of Jupyter notebooks☆11Jul 30, 2020Updated 5 years ago
- Website for the osm2pgsql project☆11Updated this week
- JavaScript Sequence Alignment Viewer☆11Mar 25, 2022Updated 3 years ago
- An inmediate mode GUI that works on top of Canvas2D (it can also work in WebGL)☆13Apr 21, 2021Updated 4 years ago