Code for Multilingual Eval of Generative AI paper published at EMNLP 2023
☆72Mar 6, 2024Updated 2 years ago
Alternatives and similar repositories for Multilingual-Evaluation-of-Generative-AI-MEGA
Users that are interested in Multilingual-Evaluation-of-Generative-AI-MEGA are comparing it to the libraries listed below
Sorting:
- eShopLite - Semantic Search is a reference .NET application implementing an eCommerce site with Search features using Keyword Search and …☆13Apr 24, 2025Updated 10 months ago
- Activate GenAI with Azure☆23Jan 26, 2026Updated last month
- This lab is a 1-day/2-day end-to-end SLM workshop led and developed by AI GBB. Attendees will learn how to quickly and easily perform the…☆45Jan 22, 2026Updated last month
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Aug 18, 2023Updated 2 years ago
- Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"☆26Jun 3, 2025Updated 9 months ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago
- ☆10Mar 11, 2025Updated 11 months ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- SK Multi agentic advanced orchestration example☆15Feb 20, 2026Updated 2 weeks ago
- 🕸 GlotWeb: Web Indexing for Minority Languages (WWW 2026)☆17Feb 27, 2026Updated last week
- NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning☆26Mar 3, 2025Updated last year
- Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"☆28Oct 3, 2021Updated 4 years ago
- Production-ready Infrastructure as Code, applications, pluggable components, and PlatformOps toolchains that empower organizations to ach…☆51Updated this week
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 4 months ago
- This repository contains additional reference translations for the WMT'14 En-De (newstest2014) and WMT'19 En-Ru (newstest2019) test sets …☆15Aug 31, 2021Updated 4 years ago
- ☆28Feb 24, 2025Updated last year
- This solution converts speech to text and then processes and summarizes the text based on the prompt scenario.☆19Aug 8, 2024Updated last year
- ☆22Nov 8, 2024Updated last year
- Official code for the NeurIPS25 paper "RAT: Bridging RNN Efficiencyand Attention Accuracy in Language Modeling" (https://arxiv.org/abs/25…☆23Dec 10, 2025Updated 2 months ago
- ☆18Feb 25, 2025Updated last year
- ☆18Sep 13, 2024Updated last year
- Repository to help in understanding the Microsoft "GraphRAG" library usage☆16Nov 7, 2024Updated last year
- AI-driven solutions for Health & Life Sciences with Azure !☆20Jun 25, 2025Updated 8 months ago
- ☆51Updated this week
- Implementation for "EpiCoder: Encompassing Diversity and Complexity in Code Generation" (ICML 2025)☆24May 16, 2025Updated 9 months ago
- 🕸 GlotCC Dataset and Pipline -- NeurIPS 2024☆20Apr 6, 2025Updated 11 months ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆19Apr 1, 2024Updated last year
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)☆19May 17, 2022Updated 3 years ago
- Video search using Azure Computer Vision 4 (Florence)☆20Nov 21, 2025Updated 3 months ago
- ☆25May 30, 2025Updated 9 months ago
- Visual search with Azure Computer Vision and Azure Cognitive Search☆21Oct 18, 2023Updated 2 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆26Feb 16, 2026Updated 3 weeks ago
- ☆18Nov 25, 2022Updated 3 years ago
- Develop pro-code personal agents integrated with memory service on Teams☆27May 18, 2025Updated 9 months ago
- Cross-lingual TRansfer Evaluation of Multilingual Encoders (XTREME)☆22Apr 11, 2020Updated 5 years ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆22Nov 10, 2024Updated last year
- German Alpaca Dataset (Cleaned + Translated)☆26Apr 6, 2023Updated 2 years ago
- The geometry of multilingual language model representations (EMNLP 2022).☆22Oct 21, 2022Updated 3 years ago