Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to EMNLP 2022.
☆25Nov 4, 2022Updated 3 years ago
Alternatives and similar repositories for small100
Users that are interested in small100 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "SMaLL-100: Introducing Shallow Multilingual Machine Translation Model for Low-Resource Languages" paper, accepted to E…☆30Feb 8, 2023Updated 3 years ago
- Discovery of Rhyme Schemes in Poetry☆17Nov 22, 2011Updated 14 years ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 4 years ago
- A library of translation-based text similarity measures☆25Dec 11, 2023Updated 2 years ago
- Implementation of paper "Parallelizable Stack Long Short-Term Memory"☆12Apr 8, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- pgvector examples for R☆13May 19, 2026Updated last month
- Next platform mono repo☆11Updated this week
- NTREX -- News Test References for MT Evaluation☆87Jun 5, 2024Updated 2 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (EMNLP 2022)☆19Mar 28, 2023Updated 3 years ago
- X-SRL Dataset. Including the code for the SRL annotation projection tool and an out-of-the-box word alignment tool based on Multilingual …☆15Apr 22, 2021Updated 5 years ago
- R package to interact with the Pushift.io API☆10Aug 4, 2025Updated 10 months ago
- This dataset contains all the 2020 COVID-19 related data from the paper "An Augmented Multilingual Twitter Dataset for Studying the COVID…☆11Jan 20, 2022Updated 4 years ago
- Utility to compute number of mandates based on election results, uting D'Hondt method☆11Sep 6, 2013Updated 12 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Plugin for django CMS – Add comments to the structure board and comment out plugins, visible to staff only☆13Sep 15, 2020Updated 5 years ago
- Code and models for the COLING2020 paper "Bridging the Gap in Multilingual Semantic Role Labeling: a Language-Agnostic Approach".☆13Dec 2, 2022Updated 3 years ago
- LSE Hackathon Challenge: Detecting Online Trolling Behaviour☆10Apr 19, 2018Updated 8 years ago
- Access to the stringi API from within an Rcpp-based Project☆11Feb 3, 2025Updated last year
- Code for paper "AnswerQuest: A System for Generating Question-Answer Items from Multi-Paragraph Documents"☆19Jun 12, 2023Updated 3 years ago
- Aldebaran is a cross-platform (Discord and Revolt) multi-purposes bot which offers useful features to DiscordRPG players along with many …☆11Jan 9, 2024Updated 2 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 5 years ago
- AQuaMuSe is a novel scalable approach to automatically mine dual query based multi-document summarization datasets for extractive and abs…☆17May 13, 2021Updated 5 years ago
- Collection of domains that spread misinformation from various sources☆15May 10, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆36Jun 29, 2025Updated last year
- machine translation data process tools☆10Apr 29, 2024Updated 2 years ago
- R package for working with the CCS Annotator☆13Mar 14, 2024Updated 2 years ago
- ☆10Nov 15, 2020Updated 5 years ago
- A collection of cross-platform social media posts about the 2022 U.S. midterm elections☆13Sep 20, 2024Updated last year
- A repository of social media posts related to the Italian 2022 general election.☆10Oct 23, 2023Updated 2 years ago
- AMR-to-text Generation with Graph Transformer☆18Nov 16, 2020Updated 5 years ago
- Automatically harvested multilingual contrastive word sense disambiguation test sets for machine translation☆18Jan 18, 2021Updated 5 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆29Apr 28, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 3 years ago
- ☆20Aug 21, 2020Updated 5 years ago
- This repositary hosts my experiments for the project, I did with OffNote Labs.☆10Apr 12, 2021Updated 5 years ago
- ☆15Nov 5, 2020Updated 5 years ago
- OpenAI R client☆15Apr 18, 2024Updated 2 years ago
- Efficient-Sentence-Embedding-using-Discrete-Cosine-Transform☆17Jul 2, 2020Updated 5 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Apr 2, 2022Updated 4 years ago