LLM evaluation on 2024 Chinese Gaokao Mathematics — zero-contamination benchmark with dual prompt formats
☆19Apr 15, 2026Updated 3 weeks ago
Alternatives and similar repositories for Llmeval-Gaokao2024-Math
Users that are interested in Llmeval-Gaokao2024-Math are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ACL 2026] A large-scale longitudinal study on robust and fair evaluation of LLMs — 200K+ generative questions across 13 disciplines☆37Apr 13, 2026Updated 3 weeks ago
- GAOGAO-Bench-Updates is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.☆42Jan 7, 2025Updated last year
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- Math-aware QA system☆18Dec 17, 2022Updated 3 years ago
- AMI Meeting Parallel Corpus☆11Dec 11, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for the ICLR'24 paper: MT-RANKER : Reference-free machine translation evaluation by inter-system ranking☆10Feb 29, 2024Updated 2 years ago
- XWikisCorpus, cross-lingual summarisation, multi-lingual summarisation, pre-trained language models, zero-shot and few-shot summarisation…☆10Nov 4, 2022Updated 3 years ago
- The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Pos…☆71Sep 29, 2025Updated 7 months ago
- Source code for paper Are Human-generated Demonstrations Necessary for In-context Learning☆12Jan 21, 2024Updated 2 years ago
- 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- The PyTorch code for paper: An Affect-Rich Neural Conversational Model with Biased Attention and Weighted Cross-Entropy Loss☆12Oct 7, 2019Updated 6 years ago
- A wiki platform for the students and teachers of Tsinghua University☆15Apr 14, 2026Updated 3 weeks ago
- Assist Non-native Viewers: Multimodal Crosslingual Summarization for How2 Videos☆10Sep 2, 2024Updated last year
- Context-aware-Interactive-Attention-for-Multi-modal-Sentiment-and Emotion-Analysis☆11Feb 24, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Nov 29, 2021Updated 4 years ago
- Deep learning for tissue parameter estimation in magnetic resonance fingerprinting☆13Oct 5, 2019Updated 6 years ago
- Code for "A Dependency Syntactic Knowledge Augmented Interactive Architecture for End-to-End Aspect-based Sentiment Analysis" on Neurocom…☆17May 19, 2021Updated 4 years ago
- Code for the ACL2022 main conference paper "A Variational Hierarchical Model for Neural Cross-Lingual Summarization"☆18Sep 5, 2022Updated 3 years ago
- Code for "A Novel Aspect-Guided Deep Transition Model for Aspect Based Sentiment Analysis." on EMNLP 2019.☆21Dec 22, 2019Updated 6 years ago
- ☆11Jul 21, 2024Updated last year
- PyTorch implementation of the Marginalizable Density Model Approximator☆18Oct 11, 2021Updated 4 years ago
- Sign language translation dataset using SignWriting☆21Feb 22, 2024Updated 2 years ago
- ☆50Sep 6, 2025Updated 8 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This is the official leaderboard of the six practice for the new commers of BJTUNLPers.☆15Dec 17, 2019Updated 6 years ago
- Learning to Generate STRUCTURED Output with Schema Reinforcement Learning☆23Mar 2, 2025Updated last year
- Accepted by ACL 2025☆30Aug 13, 2025Updated 8 months ago
- a benckmark for evaluating logical reasoning of LLMs☆23Jan 25, 2024Updated 2 years ago
- 语雀 + Elog + Hexo + GitHub Actions + Vercel 博客解决方案☆12Jul 9, 2024Updated last year
- This is the second version of the practices for the rookies of BJTUNLPers.☆18Jan 13, 2021Updated 5 years ago
- An application that captures a video, splits into many segments and uploads to a server☆20Nov 12, 2011Updated 14 years ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆58Feb 5, 2024Updated 2 years ago
- remark plugin for custom container☆10Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🗂️A file list/WebDAV program that supports multiple storages, powered by Gin and Solidjs. / 一个支持多存储的文件列表/WebDAV程序,使用 Gin 和 Solidjs。☆18Jun 11, 2025Updated 10 months ago
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- ☆26Oct 21, 2019Updated 6 years ago
- ☆10Mar 18, 2024Updated 2 years ago
- ☆31Apr 21, 2023Updated 3 years ago
- Fortran IntelliSense for Visual Studio 2017☆12Mar 18, 2022Updated 4 years ago
- Convert your videos to densepose and use it on MagicAnimate☆23Dec 7, 2023Updated 2 years ago