Multi-way parallel text corpus of 5 key Ugandan languages.
☆17May 6, 2024Updated last year
Alternatives and similar repositories for salt-data-archive
Users that are interested in salt-data-archive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- POS for African languages☆19Jun 25, 2025Updated 9 months ago
- ☆12Nov 9, 2018Updated 7 years ago
- The official implemetation of "Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks" (NAACL 2022).☆44Dec 25, 2022Updated 3 years ago
- Goldfish: Monolingual language models for 350 languages.☆24Mar 4, 2026Updated 3 weeks ago
- MIDict (Multi-Index Dict) can be indexed by any "keys" or "values", suitable as a bidirectional/inverse dict or a multi-key/multi-value d…☆14May 19, 2016Updated 9 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Precise type-checker for JavaScript☆11Oct 23, 2025Updated 5 months ago
- The definitive collection of is* functions for runtime type checking. Lodash-compatible, tree-shakable, with types.☆17Jan 25, 2025Updated last year
- An abstract, safe, and concise color conversion library for rust nightly This requires the feature adt_const_params☆12Nov 18, 2022Updated 3 years ago
- Simplification of street network geometry☆33Jan 5, 2026Updated 2 months ago
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated 2 years ago
- Poor man's algebraic effects for TypeScript (PoC); next -> https://github.com/susisu/effectful☆23Jan 9, 2023Updated 3 years ago
- A parallel corpus of Sorani, Kurmanji and English☆15Oct 6, 2020Updated 5 years ago
- pialign - A Phrasal ITG Aligner☆24Apr 29, 2019Updated 6 years ago
- A collection of utilities used in exploring data augmentation of low-resource parallel corpuses. …☆11Sep 6, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Minangkabau NLP corpus. PACLIC 2020☆10Jun 7, 2021Updated 4 years ago
- Python scripts and datasets of the "Extremely Low-Resource Neural Machine Translation: A Case Study of Cantonese" project☆16Oct 28, 2022Updated 3 years ago
- This is the experimental description of MnTTS2.☆11Apr 11, 2024Updated last year
- Symmetrized word alignment models, based on mgizapp and GIZA++☆14Jun 23, 2014Updated 11 years ago
- Python asynchronous library for web scrapping☆10Aug 24, 2021Updated 4 years ago
- Code associated with paper: Orthogonal Machine Learning for Demand Estimation: High-Dimensional Causal Inference in Dynamic Panels, Seme…☆27May 10, 2023Updated 2 years ago
- [DEPRECIATED] Symbolic MIDI Music AI implementation☆20Jun 11, 2022Updated 3 years ago
- A stream processor language.☆14Dec 14, 2023Updated 2 years ago
- A collection of textual datasets in Hausa language and the corresponding translation in English language.☆16Mar 5, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Trigram files for 500+ languages☆25Mar 21, 2025Updated last year
- Toki Pona corpus for NLTK☆15Dec 29, 2018Updated 7 years ago
- Code and Data for Paper "Controlling Styles in Neural Machine Translation with Activation Prompt" (ACL 2023 Findings)☆16Dec 20, 2022Updated 3 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Jan 24, 2017Updated 9 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆18May 31, 2023Updated 2 years ago
- ndarray/tensor data processing for modern browsers☆16May 4, 2023Updated 2 years ago
- Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…☆21Mar 16, 2023Updated 3 years ago
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Jun 14, 2024Updated last year
- Search anything on the different Search Engine's it will collect all the links.☆14Jun 25, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Type-Level Programming in Rust☆27Dec 29, 2021Updated 4 years ago
- bilingual dictionary extractor from parallel corpora☆23Jul 3, 2014Updated 11 years ago
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆16Jul 27, 2020Updated 5 years ago
- Code to reproduce the experiments presented in the article "Data-Efficient Playlist Captioning With Musical and Linguistic Knowledge" (EM…☆18Dec 21, 2022Updated 3 years ago
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆36Feb 5, 2026Updated last month
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- A generic implementation of Negamax in Rust.☆14Mar 4, 2026Updated 3 weeks ago