🩺 A collection of ChatGPT evaluation reports on various bechmarks.
☆50Mar 28, 2023Updated 3 years ago
Alternatives and similar repositories for awesome-lm-evaluation
Users that are interested in awesome-lm-evaluation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ✒️ ChatGPT as a writing partner.☆14Mar 6, 2023Updated 3 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- 基于树形条件随机场的高阶句法分析☆16Apr 28, 2022Updated 4 years ago
- 🍼 Official implementation of Dynamic Data Mixing Maximizes Instruction Tuning for Mixture-of-Experts☆41Sep 29, 2024Updated last year
- 🎮 A toolkit for Relation Extraction and more...☆24May 8, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation of "Probing Language Models for Pre-training Data Detection"☆20Dec 4, 2024Updated last year
- The information of NLP PhD application in the world.☆37Aug 27, 2024Updated last year
- 儿童故事常识推理与寓意理解评测(Commonsense Reasoning and Moral Understanding Evaluation in Children's Stories,CRMU)☆18Oct 22, 2024Updated last year
- [EMNLP'23] Code for "Non-autoregressive Text Editing with Copy-aware Latent Alignments".☆20Oct 17, 2023Updated 2 years ago
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Jan 23, 2024Updated 2 years ago
- 🪞A powerful toolkit for almost all the Information Extraction tasks.☆124Apr 21, 2025Updated last year
- ☆19Jun 3, 2024Updated last year
- ☆14Aug 18, 2022Updated 3 years ago
- [COLING'22] Code for "Semantic Role Labeling as Dependency Parsing: Exploring Latent Tree Structures Inside Arguments".☆61Oct 8, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study☆43Mar 8, 2023Updated 3 years ago
- The repo of "Improving Seq2Seq Grammatical Error Correction via Decoding Interventions"☆32Jan 22, 2024Updated 2 years ago
- The Code & Paper for ACL 2023 paper "Enhancing Language Representation with Constructional Information for Natural Language Understanding…☆20Jan 18, 2025Updated last year
- ☆12May 6, 2024Updated 2 years ago
- This is the code for neural-Jacana aligner, and the data for MultiMWA dataset.☆20Feb 12, 2023Updated 3 years ago
- ☆13Feb 7, 2023Updated 3 years ago
- ☆15Jul 16, 2021Updated 4 years ago
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆132May 16, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ACL 2023] Are Pre-trained Language Models Useful for Model Ensemble in Chinese Grammatical Error Correction?☆10Dec 15, 2025Updated 4 months ago
- Code for embedding and retrieval research.☆16Oct 24, 2023Updated 2 years ago
- 苏州大学研究生学位论文模板 - Soochow University Thesis TeX Template☆20Feb 27, 2026Updated 2 months ago
- Calculate the probability of a paper being accepted by EMNLP2023 based on score distribution of ACL2023.☆14Sep 7, 2023Updated 2 years ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.☆14Oct 12, 2024Updated last year
- Awesome papers on Language-Model-as-a-Service (LMaaS)☆545May 14, 2024Updated last year
- ☆16Nov 5, 2018Updated 7 years ago
- [ICLR'25 Spotlight] Min-K%++: Improved baseline for detecting pre-training data of LLMs☆55May 26, 2025Updated 11 months ago
- ⛷️ LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-training (EMNLP 2024)☆1,001Dec 6, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆84Apr 10, 2023Updated 3 years ago
- ☆14Jul 27, 2022Updated 3 years ago
- ☆22Apr 14, 2020Updated 6 years ago
- Use the tokenizer in parallel to achieve superior acceleration☆20Mar 21, 2024Updated 2 years ago
- Source code of ACL 2023 Main Conference Paper "PAD-Net: An Efficient Framework for Dynamic Networks".☆12Feb 28, 2026Updated 2 months ago
- Code and data for the paper: On the Reliability of Psychological Scales on Large Language Models☆30Dec 15, 2025Updated 4 months ago
- ☆17Jul 9, 2021Updated 4 years ago