Brand24-AI/mms_benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Brand24-AI/mms_benchmark)

Brand24-AI / mms_benchmark

The most extensive open massively multilingual corpus of datasets for training sentiment models. The corpus consists of 79 manually selected from over 350 datasets reported in the scientific literature based on strict quality criteria and covers 27 languages.

☆16

Alternatives and similar repositories for mms_benchmark

Users that are interested in mms_benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MKuranowski / PKPIntercityGTFS
View on GitHub
Create GTFS data for PKP Intercity.
☆10May 5, 2026Updated 2 months ago
kraina-ai / overturemaestro
View on GitHub
An open-source tool for reading OvertureMaps data with multiprocessing and additional Quality-of-Life features
☆36Jul 6, 2026Updated 2 weeks ago
jorisvandenbossche / python-geoarrow
View on GitHub
Storing geometry data in Apache Arrow format
☆14Jun 1, 2022Updated 4 years ago
kraina-ai / hex2vec
View on GitHub
hex2vec - Context-Aware Embedding H3 Hexagons withOpenStreetMap Tags
☆26May 2, 2023Updated 3 years ago
tkipf / ica
View on GitHub
Python implementation of the Iterative Classification Algorithm
☆35Jan 12, 2017Updated 9 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
CLARIN-PL / embeddings
View on GitHub
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polis…
☆37May 20, 2026Updated 2 months ago
KlubJagiellonski / pola-web
View on GitHub
Pola pomoże Ci odnaleźć polskie wyroby. Zabierając Polę na zakupy odnajdujesz produkty “z duszą” i wspierasz polską gospodarkę.
☆14Jul 2, 2026Updated 2 weeks ago
YaoXinZhi / BERT-for-20NewsGroups
View on GitHub
《2021医学健康数据分析与挖掘》课程论文 -- 基于BERT的20NewsGroups数据集新闻分类实验
☆10Jun 22, 2021Updated 5 years ago
home-assistant-tutorials / 02.hello-world-card
View on GitHub
Writing your first card for Home Assistant
☆16May 24, 2023Updated 3 years ago
MSR-LIT / MultilingualBias
View on GitHub
☆10Jul 6, 2023Updated 3 years ago
matrixorigin / matrixorigin.io.cn
View on GitHub
☆12Jul 6, 2026Updated 2 weeks ago
DCSaunders / gender-debias
View on GitHub
Adaptation datasets and scripts for the paper "Reducing gender bias in Neural Machine Translation as a domain adaptation problem" (ACL 20…
☆13Mar 18, 2021Updated 5 years ago
vilsonrodrigues / youtube-retrieval-qa
View on GitHub
ChatTube: A Retrieval QA System to Youtube Videos
☆10Jun 6, 2023Updated 3 years ago
kanekomasahiro / bias_eval_in_multiple_mlm
View on GitHub
☆11Jul 7, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
OmkarAcharekar / ChatValve
View on GitHub
Real Time Chat Application
☆14Dec 20, 2022Updated 3 years ago
tarekziade / mwcat
View on GitHub
MediaWiki Categories Model
☆13Feb 14, 2024Updated 2 years ago
Cobra16319 / 100_Days_Of_Go
View on GitHub
100 days of Go learning
☆28Sep 22, 2021Updated 4 years ago
CLARIN-PL / LEPISZCZE
View on GitHub
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
☆15May 20, 2026Updated 2 months ago
jinlanfu / Polyglot_Prompt
View on GitHub
Code and dataset for Polyglot Prompting: Multilingual Multitask Prompt Training.
☆18Dec 7, 2022Updated 3 years ago
bytedance / AncientDoc
View on GitHub
☆15Oct 10, 2025Updated 9 months ago
dair-ai / llm-evaluator
View on GitHub
Example for Logging LLM Evaluator Prompt Responses
☆18Aug 14, 2023Updated 2 years ago
wannaphong / thaigpt-next
View on GitHub
It is fine-tune the GPT-Neo model for Thai language.
☆12Jun 30, 2021Updated 5 years ago
KlubJagiellonski / pola-ios
View on GitHub
Pola pomoże Ci odnaleźć polskie wyroby. Zabierając Polę na zakupy odnajdujesz produkty “z duszą” i wspierasz polską gospodarkę.
☆23Nov 21, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ipipan / spacy-pl
View on GitHub
☆51Aug 22, 2022Updated 3 years ago
bastien-roucaries / latex-pax
View on GitHub
☆15Jun 7, 2022Updated 4 years ago
learnable-game-engines / lge
View on GitHub
☆14Mar 23, 2023Updated 3 years ago
allegro / AlleNoise
View on GitHub
☆14Mar 28, 2025Updated last year
goodmike31 / pl-asr-speech-data-survey
View on GitHub
Survey of available speech datasets for Polish ASR development
☆17Jan 1, 2025Updated last year
mam-dev / security-constraints
View on GitHub
Fetches security vulnerabilities and creates pip-constraints based on them.
☆12Jan 27, 2025Updated last year
nasim-alamdari / RealTime-Custom-Keyword-Spotting
View on GitHub
Implementation and Deployment of Multilingual Custom Keyword Spotting Running in Real-time on an Edge Device.
☆11Apr 27, 2023Updated 3 years ago
danijel3 / ClarinStudioKaldi
View on GitHub
A baseline Automatic Speech Recognition system for Polish based on Kaldi.
☆18Dec 21, 2021Updated 4 years ago
MetricsDI / DIMetrics
View on GitHub
☆10May 25, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
robmsmt / SpeechLoop
View on GitHub
Many ASRs under one roof. With Benchmarking... answering the question. What is the best ASR for my dataset?
☆19Oct 5, 2022Updated 3 years ago
home-assistant-tutorials / 01.development-environment
View on GitHub
Setting up the development environment for the tutorials
☆29May 24, 2023Updated 3 years ago
senghe / TheGame
View on GitHub
☆36Dec 11, 2023Updated 2 years ago
JunjieHu / xtreme-dev
View on GitHub
Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)
☆22Apr 11, 2020Updated 6 years ago
marionbartl / gender-bias-BERT
View on GitHub
This repository holds the code for my master thesis entitles "The Association of Gender Bias with BERT - Measuring, Mitigating and Cross-…
☆18Sep 19, 2022Updated 3 years ago
gaussalgo / adaptor
View on GitHub
ACL 2022: Adaptor: a library to easily adapt a language model to your own task, domain, or custom objective(s).
☆28Mar 28, 2025Updated last year
Madhuvod / VoxLingua
View on GitHub
A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…
☆17May 19, 2025Updated last year