Geralt-Targaryen/MC-Evaluation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Geralt-Targaryen/MC-Evaluation)

Geralt-Targaryen / MC-Evaluation

☆14

Alternatives and similar repositories for MC-Evaluation

Users that are interested in MC-Evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zwhong714 / adaptive_decoding
View on GitHub
[ICML2024]Adaptive decoding balances the diversity and coherence of open-ended text generation.
☆19Jun 2, 2024Updated 2 years ago
QwenLM / PolyMath
View on GitHub
[NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"
☆43May 22, 2025Updated last year
huacheng1985 / TheRaschBook
View on GitHub
Companion to BER 670: Rasch Techniques for Constructing and Evaluating Measurement Instruments
☆10Oct 27, 2021Updated 4 years ago
KbsdJames / omni-math-rule
View on GitHub
The rule-based evaluation subset and code implementation of Omni-MATH
☆28Dec 23, 2024Updated last year
MahaThafar / DTi2Vec
View on GitHub
This repository provides an implementation of the DTi2Vec tool, to identify Drug-Target interaction using network embedding and ensemble …
☆12Sep 28, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zwhe99 / WMT22-En-Liv
View on GitHub
[WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian
☆23May 4, 2023Updated 3 years ago
Alsace08 / OOD-Math-Reasoning
View on GitHub
[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"
☆28May 28, 2024Updated 2 years ago
codefuse-ai / Awesome-Omnimodal-Embeddings
View on GitHub
☆18Dec 9, 2025Updated 7 months ago
SWE-bench / reading-list
View on GitHub
Academic papers and works related to SWE-bench and SWE-agents
☆15Dec 8, 2025Updated 7 months ago
THUNLP-MT / Template-NMT
View on GitHub
☆23Nov 15, 2022Updated 3 years ago
WHB139426 / GCG
View on GitHub
Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering [ACM MM'24]
☆10Jul 22, 2024Updated 2 years ago
tangg555 / acl-anthology-helper
View on GitHub
To help search, filter, and download papers from 'acl anthology' (https://aclanthology.org/).
☆18Sep 12, 2024Updated last year
tobna / TaylorShift
View on GitHub
This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…
☆15Feb 25, 2026Updated 5 months ago
zhongwanjun / CARP
View on GitHub
code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…
☆12Sep 16, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
K-Kuyama / yet-another-UI-for-AW
View on GitHub
UI for ActivityWatch. Include category editor and viewer for multiple categorizations.
☆10Jan 31, 2024Updated 2 years ago
mizuno-group / ChiralityMisunderstanding
View on GitHub
Transformer in Chemical Language Model sometimes misunderstands chirality
☆13Apr 19, 2024Updated 2 years ago
keiji / region_cropper
View on GitHub
Help creating image dataset for machine learning.
☆10Nov 4, 2020Updated 5 years ago
chenllliang / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ACL-2022
☆18May 19, 2022Updated 4 years ago
amzn / faithful-data2text-cycle-training
View on GitHub
☆11Jul 11, 2023Updated 3 years ago
teddy-gustiaux / github-notifications-rss-feed
View on GitHub
Application to generate an RSS feed from your GitHub notifications.
☆13Dec 8, 2022Updated 3 years ago
chujiezheng / LLM-MCQ-Bias
View on GitHub
Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"
☆43May 20, 2025Updated last year
cosmoquester / transformers-bart-pretrain
View on GitHub
Script to pre-train hugginface transformers BART with Tensorflow 2
☆35Apr 13, 2023Updated 3 years ago
PrimeIntellect-ai / diloco_simple
View on GitHub
torch implementation of diloco
☆25Jul 17, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
AI4Bharat / FBI
View on GitHub
FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists
☆31Aug 14, 2025Updated 11 months ago
kenkenpa2126 / vanilla_transformer_from_scratch_with_JAX
View on GitHub
☆10Dec 18, 2023Updated 2 years ago
tomsherborne / zx-parse
View on GitHub
Zero-Shot Cross-Lingual Semantic Parsing (Sherborne & Lapata, ACL 2022)
☆17May 16, 2022Updated 4 years ago
Shivanshu-Gupta / icl-coverage
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
a01sa01to / TitleAndURL_Picker
View on GitHub
Chrome Extension. As the name suggests.
☆10Jan 30, 2022Updated 4 years ago
giovannicoppola / alfred-yaanki
View on GitHub
yet another anki app
☆14Sep 9, 2024Updated last year
Geralt-Targaryen / Awesome-Education-LLM
View on GitHub
A curated list of LLM researches and applications in education.
☆80Sep 13, 2024Updated last year
lwaekfjlk / awesome-gpt-games
View on GitHub
Create awesome games with GPT
☆33Mar 21, 2023Updated 3 years ago
AppraiseDev / Appraise
View on GitHub
Appraise code used as part of WMT21 human evaluation campaign
☆30Jul 15, 2026Updated 2 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
lyveng / pandas-hbase
View on GitHub
Pandas Helper Library for reading and writing DataFrames from and to HBase.
☆10Mar 8, 2018Updated 8 years ago
roedoejet / FastSpeech2_ACL2022_reproducibility
View on GitHub
☆21Feb 27, 2024Updated 2 years ago
McGill-NLP / CHASE
View on GitHub
Synthetic Data Generation for Evaluation
☆16Feb 21, 2025Updated last year
WPR001 / Ego-ST
View on GitHub
☆16Sep 25, 2025Updated 10 months ago
starsuzi / DAR
View on GitHub
☆19Sep 19, 2022Updated 3 years ago
yuishihara / A3C-tensorflow
View on GitHub
A3C tensorflow implementation
☆11Jul 22, 2018Updated 8 years ago
DaisukeBekki / JSeM
View on GitHub
Japanese semantic test suite (FraCaS counterpart and extensions)
☆13Apr 21, 2026Updated 3 months ago