Multi-way parallel text corpus of 5 key Ugandan languages.
☆17May 6, 2024Updated last year
Alternatives and similar repositories for salt-data-archive
Users that are interested in salt-data-archive are comparing it to the libraries listed below
Sorting:
- Minangkabau NLP corpus. PACLIC 2020☆10Jun 7, 2021Updated 4 years ago
- Precise type-checker for JavaScript☆11Oct 23, 2025Updated 4 months ago
- Python asynchronous library for web scrapping☆10Aug 24, 2021Updated 4 years ago
- Myanmar and Thai Language Resources☆10Jul 18, 2022Updated 3 years ago
- Rababa, the diacritization library for Arabic and Hebrew (Abjad scripts in general)☆13May 1, 2025Updated 10 months ago
- A Font with extensive coverage of Unicode13 as of March 2020 (part of Unicode Fonts for Ancient Scripts)☆15Mar 26, 2020Updated 5 years ago
- ☆14Mar 30, 2023Updated 2 years ago
- universal tokenizer☆17Nov 29, 2021Updated 4 years ago
- This little python script downloads the content from solidfiles. The reason I came up with this is 'SolidFiles using too much Pop Ups'. J…☆14Dec 20, 2018Updated 7 years ago
- A cross platform (Android/iOS/MacOS) Bahasa Indonesia speech recognizer library, written in Flutter.☆12Nov 18, 2025Updated 3 months ago
- automatically apply for Indeed jobs.☆13Nov 10, 2021Updated 4 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- Scrape Facebook page events(recurring and upcoming), and individual event on new Facebook design☆14Aug 12, 2022Updated 3 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Jun 14, 2024Updated last year
- A stream processor language.☆14Dec 14, 2023Updated 2 years ago
- This is the experimental description of MnTTS2.☆11Apr 11, 2024Updated last year
- Awesome Lao Natural Language Processing☆16Mar 7, 2025Updated 11 months ago
- A collection of utilities used in exploring data augmentation of low-resource parallel corpuses. …☆11Sep 6, 2017Updated 8 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆16Mar 5, 2021Updated 4 years ago
- Toki Pona corpus for NLTK☆15Dec 29, 2018Updated 7 years ago
- Local experiment manager☆14Jan 16, 2026Updated last month
- Implementation of the multi-objective genetic optimization algorithm NSGA-II☆12Jun 22, 2025Updated 8 months ago
- Search anything on the different Search Engine's it will collect all the links.☆14Jun 25, 2023Updated 2 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- A generic implementation of Negamax in Rust.☆14Jan 28, 2026Updated last month
- pialign - A Phrasal ITG Aligner☆24Apr 29, 2019Updated 6 years ago
- A game-playing engine (written in Rust) that uses the Minimax Algorithm with alpha-beta pruning for arbitrary two-player Minimax games li…☆14Aug 2, 2022Updated 3 years ago
- Symmetrized word alignment models, based on mgizapp and GIZA++☆14Jun 23, 2014Updated 11 years ago
- A python script and a linux service to run your Google Colab everyday automatically in the background using Selenium, systemd and python.☆14Jun 5, 2021Updated 4 years ago
- An open movie recommendation API. Use this to provide movie suggestions to your users in an App or a Website.☆17May 1, 2023Updated 2 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆20Jun 11, 2022Updated 3 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Jan 24, 2017Updated 9 years ago
- Poor man's algebraic effects for TypeScript (PoC); next -> https://github.com/susisu/effectful☆23Jan 9, 2023Updated 3 years ago
- A TUI studying program☆18Jun 6, 2022Updated 3 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- Extracts plain text, language identification and more metadata from WARC records☆23Oct 1, 2025Updated 5 months ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 2 years ago
- Chinese Dialect Database☆18Jun 18, 2017Updated 8 years ago
- ☆16Apr 19, 2022Updated 3 years ago