The most extensive open massively multilingual corpus of datasets for training sentiment models. The corpus consists of 79 manually selected from over 350 datasets reported in the scientific literature based on strict quality criteria and covers 27 languages.
☆16Nov 14, 2023Updated 2 years ago
Alternatives and similar repositories for mms_benchmark
Users that are interested in mms_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An open-source tool for reading OvertureMaps data with multiprocessing and additional Quality-of-Life features☆32Mar 2, 2026Updated 2 weeks ago
- Create GTFS data for PKP Intercity.☆10Mar 7, 2026Updated 2 weeks ago
- Storing geometry data in Apache Arrow format☆14Jun 1, 2022Updated 3 years ago
- hex2vec - Context-Aware Embedding H3 Hexagons withOpenStreetMap Tags☆25May 2, 2023Updated 2 years ago
- Python implementation of the Iterative Classification Algorithm☆35Jan 12, 2017Updated 9 years ago
- Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…☆37Dec 3, 2023Updated 2 years ago
- Pola pomoże Ci odnaleźć polskie wyroby. Zabierając Polę na zakupy odnajdujesz produkty “z duszą” i wspierasz polską gospodarkę.☆13Mar 7, 2026Updated 2 weeks ago
- 《2021医学健康数据分析与挖掘》课程论文 -- 基于BERT的20NewsGroups数据集新闻分类实验☆10Jun 22, 2021Updated 4 years ago
- Survey of available speech datasets for Polish ASR development☆17Jan 1, 2025Updated last year
- ☆10Jul 6, 2023Updated 2 years ago
- 🎉 TrustJudge is accepted to ICLR 2026!☆38Sep 27, 2025Updated 5 months ago
- ☆11Feb 27, 2026Updated 3 weeks ago
- Writing your first card for Home Assistant☆16May 24, 2023Updated 2 years ago
- ☆11Jul 7, 2023Updated 2 years ago
- ChatTube: A Retrieval QA System to Youtube Videos☆10Jun 6, 2023Updated 2 years ago
- A Medical / Clinical Note Taking Demo Application using Deepgram Voice Agent API☆14Jul 9, 2025Updated 8 months ago
- BenchBench is a Python package to evaluate multi-task benchmarks.☆18Oct 12, 2025Updated 5 months ago
- MediaWiki Categories Model☆13Feb 14, 2024Updated 2 years ago
- Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…☆13Mar 18, 2021Updated 5 years ago
- Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.☆18Dec 7, 2022Updated 3 years ago
- 100 days of Go learning☆28Sep 22, 2021Updated 4 years ago
- This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish☆14Nov 24, 2023Updated 2 years ago
- Example for Logging LLM Evaluator Prompt Responses☆18Aug 14, 2023Updated 2 years ago
- Real Time Chat Application☆14Dec 20, 2022Updated 3 years ago
- Statistical power analyses in Julia☆18Feb 24, 2026Updated 3 weeks ago
- It is fine-tune the GPT-Neo model for Thai language.☆12Jun 30, 2021Updated 4 years ago
- Julia code to accompany https://probability4datascience.com/index.html☆25Jul 25, 2023Updated 2 years ago
- ☆51Aug 22, 2022Updated 3 years ago
- Thai-English transliteration dictionary☆15Jun 24, 2022Updated 3 years ago
- ☆15Jun 7, 2022Updated 3 years ago
- ☆14Mar 28, 2025Updated 11 months ago
- Pola pomoże Ci odnaleźć polskie wyroby. Zabierając Polę na zakupy odnajdujesz produkty “z duszą” i wspierasz polską gospodarkę.☆23Nov 21, 2025Updated 4 months ago
- ☆14Mar 23, 2023Updated 3 years ago
- Implementation and Deployment of Multilingual Custom Keyword Spotting Running in Real-time on an Edge Device.☆11Apr 27, 2023Updated 2 years ago
- ☆10May 25, 2022Updated 3 years ago
- Fetches security vulnerabilities and creates pip-constraints based on them.☆12Jan 27, 2025Updated last year
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Apr 11, 2020Updated 5 years ago
- Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?☆19Oct 5, 2022Updated 3 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago