☆13May 21, 2024Updated last year
Alternatives and similar repositories for MC-Evaluation
Users that are interested in MC-Evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML2024]Adaptive decoding balances the diversity and coherence of open-ended text generation.☆19Jun 2, 2024Updated last year
- Companion to BER 670: Rasch Techniques for Constructing and Evaluating Measurement Instruments☆10Oct 27, 2021Updated 4 years ago
- This repository provides an implementation of the DTi2Vec tool, to identify Drug-Target interaction using network embedding and ensemble …☆12Sep 28, 2021Updated 4 years ago
- Reading notes on Speculative Decoding papers☆30Updated this week
- The rule-based evaluation subset and code implementation of Omni-MATH☆27Dec 23, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2025 D&B Track] Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"☆43May 22, 2025Updated 10 months ago
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- [WMT 2022] Implementation of TAL-SJTU's system for WMT22 English-Livonian☆23May 4, 2023Updated 2 years ago
- ☆23Nov 15, 2022Updated 3 years ago
- Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering [ACM MM'24]☆10Jul 22, 2024Updated last year
- Official Repository for paper "Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language Mod…☆15Nov 25, 2024Updated last year
- This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…☆14Feb 25, 2026Updated last month
- ☆19May 3, 2025Updated 11 months ago
- code for the table-based open domain question answering project, with paper title: "Reasoning over Hybrid Chain for Table-and-Text Open D…☆12Sep 16, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- To help search, filter, and download papers from 'acl anthology' (https://aclanthology.org/).☆18Sep 12, 2024Updated last year
- Transformer in Chemical Language Model sometimes misunderstands chirality☆13Apr 19, 2024Updated 2 years ago
- Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ACL-2022☆18May 19, 2022Updated 3 years ago
- Help creating image dataset for machine learning.☆10Nov 4, 2020Updated 5 years ago
- Official repository for ICLR 2024 Spotlight paper "Large Language Models Are Not Robust Multiple Choice Selectors"☆43May 20, 2025Updated 11 months ago
- torch implementation of diloco☆22May 31, 2024Updated last year
- MFTE (Multi Feature Tagger of English) Python is the Python version based on Le Foll's MFTE written in Perl. It is extended to include se…☆29Feb 21, 2026Updated last month
- ☆11Jul 11, 2023Updated 2 years ago
- Application to generate an RSS feed from your GitHub notifications.☆13Dec 8, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Script to pre-train hugginface transformers BART with Tensorflow 2☆35Apr 13, 2023Updated 3 years ago
- ☆13Mar 5, 2024Updated 2 years ago
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆20Mar 9, 2025Updated last year
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆31Aug 14, 2025Updated 8 months ago
- ☆10Dec 18, 2023Updated 2 years ago
- ☆78Jun 28, 2025Updated 9 months ago
- ☆16Sep 25, 2025Updated 6 months ago
- ☆21Feb 27, 2024Updated 2 years ago
- Chrome Extension. As the name suggests.☆10Jan 30, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Create awesome games with GPT☆32Mar 21, 2023Updated 3 years ago
- yet another anki app☆14Sep 9, 2024Updated last year
- Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning☆31Sep 12, 2025Updated 7 months ago
- UI for ActivityWatch. Include category editor and viewer for multiple categorizations.☆10Jan 31, 2024Updated 2 years ago
- A curated list of LLM researches and applications in education.☆74Sep 13, 2024Updated last year
- BUAA Compiler Course Project 2023 by Toby Shi.☆13Aug 20, 2024Updated last year
- Appraise code used as part of WMT21 human evaluation campaign☆30Apr 12, 2026Updated last week