MENYO-20k Corpus in "The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation" in MT Summit 2021
☆13Jan 16, 2023Updated 3 years ago
Alternatives and similar repositories for menyo-20k_MT
Users that are interested in menyo-20k_MT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repo & Project for the Imminent Research Grant code & tasks☆12May 20, 2024Updated last year
- Building an effective preprocessing tool for African languages☆13Jan 24, 2024Updated 2 years ago
- ☆13Oct 3, 2024Updated last year
- ☆13Jan 14, 2026Updated 2 months ago
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆23Mar 22, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Automatic Diacritic Restoration of Yorùbá language Text☆25Jul 25, 2024Updated last year
- ☆10Apr 17, 2024Updated last year
- MAFAND-MT☆61Jul 9, 2024Updated last year
- ☆119Oct 15, 2025Updated 5 months ago
- Semantic Priming Across Many Languages (PSA Proposal)☆18Jan 3, 2026Updated 2 months ago
- [ACL 2023] S3HQA: A Three-Stage Approach for Multi-hop Text-Table Hybrid Question Answering☆20Jun 8, 2025Updated 9 months ago
- COMET for African languages☆11Jan 24, 2025Updated last year
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Chinese-English Neural machine translation with Encoder-Decoder seq2seq model : Bidirection-GRU + Fasttext word embedding + Attention + …☆20Dec 30, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Python and Javascript code used in the post "Bezier Curves and Picasso"☆21Jun 9, 2020Updated 5 years ago
- Named Entity Recognition in Nepali Language☆10Jan 12, 2023Updated 3 years ago
- Code and data for "Heterogeneous Supervised Topic Models"☆10Jun 27, 2022Updated 3 years ago
- This contains my solution for the Gender-Based Violence Tweet Classification Challenge hosted on Zindi☆15Nov 15, 2021Updated 4 years ago
- ☆10Mar 11, 2026Updated 2 weeks ago
- AIR retriever for Multi-Hop QA (ACL 2020 paper)☆30Jul 18, 2020Updated 5 years ago
- A Python code repository for downloading files from YouTube videos and URLs, with support for progress tracking and Google Drive integrat…☆12Jun 8, 2023Updated 2 years ago
- PyTorch code for the EMNLP 2020 paper "Embedding Words in Non-Vector Space with Unsupervised Graph Learning"☆41Feb 18, 2021Updated 5 years ago
- Experiments for XLM-V Transformers Integeration☆13Feb 8, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A lightweight, user-friendly data-plane for LLM training.☆38Sep 10, 2025Updated 6 months ago
- Hanja Understanding Evaluation Dataset☆15May 2, 2022Updated 3 years ago
- A family of efficient speech models for multilingual phone recognition☆53Feb 12, 2026Updated last month
- Scripts to create speech corpora from open.bible☆13Jan 3, 2022Updated 4 years ago
- This is an analytical project done using python to process and extract valuable insights from WhatsApp text file, deployed as a webapp us…☆19Dec 8, 2023Updated 2 years ago
- ☆11Apr 2, 2024Updated last year
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- Code for Auditing Data Provenance in Text-Generation Models (in KDD 2019)☆10Jun 18, 2019Updated 6 years ago
- The repo for ACL2021 findings paper - Don't Miss the Labels: Label-semantic Argumented Meta-Learner for Few-Shot Text Classification☆15Mar 24, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Semantic similarity of sentences using wordnet nltk and basic numpy☆19Apr 8, 2021Updated 4 years ago
- Build website in minutes using our UI components, sections and pages with easy to use customization options.☆16Jul 4, 2024Updated last year
- Placeholder repository☆15Mar 16, 2022Updated 4 years ago
- COMS20012 Computer Systems B☆14Mar 12, 2026Updated 2 weeks ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆13Aug 15, 2022Updated 3 years ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- Personal information identification standard☆21Jan 24, 2024Updated 2 years ago