Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.
☆29Feb 17, 2025Updated last year
Alternatives and similar repositories for LatestEval
Users that are interested in LatestEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SSLCL: An Efficient Model-Agnostic Supervised Contrastive Learning Framework for Emotion Recognition in Conversations☆15Jul 27, 2024Updated last year
- Source code of paper “A Novel Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation”☆16Nov 25, 2021Updated 4 years ago
- Benchmarking Commonsense Reasoning in Real-World Tasks☆12Dec 14, 2023Updated 2 years ago
- ☆22Dec 1, 2021Updated 4 years ago
- code and dataset of EMNLP 2020 paper "PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge"☆12Nov 6, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆46Jun 24, 2025Updated 9 months ago
- this repository contains the source code for the ACL 2019 paper "Generating Long and Informative Reviews with Aspect-Aware Coarse-to-Fine…☆37Nov 29, 2019Updated 6 years ago
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Mini Model Daemon☆13Nov 9, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Knowledge Infused Decoding☆70Dec 31, 2023Updated 2 years ago
- Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts☆25Feb 23, 2024Updated 2 years ago
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆27May 25, 2024Updated last year
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Aug 14, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The Paper List on Data Contamination for Large Language Models Evaluation.☆115Jan 29, 2026Updated 2 months ago
- one client for all of your favorite clouds, multiple clouds unlimited accounts (Multicloud Manager)☆12May 2, 2020Updated 5 years ago
- ☆12Dec 14, 2024Updated last year
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 8 years ago
- ☆34Jan 7, 2026Updated 3 months ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated 2 years ago
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Oct 7, 2024Updated last year
- RWKV-7 mini☆12Mar 29, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.☆11Mar 1, 2024Updated 2 years ago
- Published version of composing programs textbook☆15Mar 8, 2014Updated 12 years ago
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach☆32Nov 6, 2023Updated 2 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 10 months ago
- GoldFinch and other hybrid transformer components☆13Dec 9, 2025Updated 4 months ago
- ☆26Dec 8, 2025Updated 4 months ago
- Do Large Language Models Know What They Don’t Know?☆103Nov 8, 2024Updated last year
- Mathematical Analysis (et analyse fonctionnelle)☆14Feb 1, 2022Updated 4 years ago
- Code for MERMAID : Metaphor Generation with Symbolism and Discriminative Decoding☆11May 2, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Synthetic data generation for TODs☆23Jul 17, 2024Updated last year
- LLM4HWDesign Starting Toolkit☆19Oct 4, 2024Updated last year
- Xlore2.0 Code[BaiduExtractor, HudongExtractor, WikiExtractor, XloreData, XloreWeb]☆12Apr 5, 2017Updated 9 years ago
- In-Context Learning User Simulators for Task-Oriented Dialog Systems☆30Jun 2, 2023Updated 2 years ago
- The Shmoop Corpus☆17Oct 27, 2020Updated 5 years ago
- lofiatc.com but with mpv☆11Mar 18, 2025Updated last year
- RWKV Wiki website (archived, please visit official wiki)☆11Mar 26, 2023Updated 3 years ago