Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.
☆29Feb 17, 2025Updated last year
Alternatives and similar repositories for LatestEval
Users that are interested in LatestEval are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Longitudinal Evaluation of LLMs via Data Compression☆33May 29, 2024Updated last year
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 5 months ago
- The official repository for the paper entitled "Time Travel in LLMs: Tracing Data Contamination in Large Language Models."☆12Jun 11, 2024Updated last year
- Source code of paper “A Novel Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue Generation”☆16Nov 25, 2021Updated 4 years ago
- ☆22Dec 1, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Jul 11, 2023Updated 2 years ago
- code and dataset of EMNLP 2020 paper "PARADE: A New Dataset for Paraphrase Identification Requiring Computer Science Domain Knowledge"☆12Nov 6, 2020Updated 5 years ago
- (EACL 2021) Discourse-Aware Unsupervised Summarization of Long Scientific Documents☆25Jun 12, 2023Updated 2 years ago
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- this repository contains the source code for the ACL 2019 paper "Generating Long and Informative Reviews with Aspect-Aware Coarse-to-Fine…☆37Nov 29, 2019Updated 6 years ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Code and data setup for the paper "Are Diffusion Models Vision-and-language Reasoners?"☆33Mar 15, 2024Updated 2 years ago
- Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts☆25Feb 23, 2024Updated 2 years ago
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆27May 25, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Official implementation of the ACL 2023 paper: "Zero-shot Faithful Factual Error Correction"☆17Aug 14, 2023Updated 2 years ago
- Submissions, baselines and evaluations scripts for the 2nd version of the WebNLG+ Challenge 2020☆13Feb 1, 2022Updated 4 years ago
- The Paper List on Data Contamination for Large Language Models Evaluation.☆112Jan 29, 2026Updated 2 months ago
- ☆15Mar 20, 2025Updated last year
- one client for all of your favorite clouds, multiple clouds unlimited accounts (Multicloud Manager)☆12May 2, 2020Updated 5 years ago
- ☆12May 18, 2023Updated 2 years ago
- ☆12Dec 14, 2024Updated last year
- GoldFinch and other hybrid transformer components☆12Dec 9, 2025Updated 3 months ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Chicago Social Interaction Model (chiSIM) framework repository☆12Aug 9, 2023Updated 2 years ago
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated 2 years ago
- Chinese tokens in tiktoken tokenizers.☆32May 15, 2024Updated last year
- 🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…☆16Oct 7, 2024Updated last year
- ViViDex implementation under the SAPIEN simulator, ICRA 2025☆18Apr 9, 2025Updated 11 months ago
- RWKV-7 mini☆12Mar 29, 2025Updated last year
- Published version of composing programs textbook☆14Mar 8, 2014Updated 12 years ago
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approach☆32Nov 6, 2023Updated 2 years ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆75May 20, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Do Large Language Models Know What They Don’t Know?☆102Nov 8, 2024Updated last year
- ☆25Dec 8, 2025Updated 3 months ago
- DeFacto - Demonstrations and Feedback for improving factual consistency of text summarization☆30Dec 19, 2022Updated 3 years ago
- Mathematical Analysis (et analyse fonctionnelle)☆14Feb 1, 2022Updated 4 years ago
- ☆13Feb 26, 2023Updated 3 years ago
- Code for MERMAID : Metaphor Generation with Symbolism and Discriminative Decoding☆11May 2, 2022Updated 3 years ago
- ☆13Dec 5, 2022Updated 3 years ago