McGill-NLP / CHASE
Synthetic Data Generation for Evaluation
☆11Updated last month
Alternatives and similar repositories for CHASE:
Users that are interested in CHASE are comparing it to the libraries listed below
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated 11 months ago
- Codebase for Instruction Following without Instruction Tuning☆33Updated 6 months ago
- ☆41Updated 3 weeks ago
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated last month
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆20Updated 7 months ago
- ☆17Updated 5 months ago
- ☆59Updated last week
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 4 months ago
- ☆15Updated last month
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆47Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆23Updated last week
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆42Updated last year
- ☆119Updated 5 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; COLM 2024)☆44Updated 2 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆44Updated last month
- ☆38Updated 11 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆44Updated this week
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated 11 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆44Updated 3 months ago
- InstructCoder: Instruction Tuning Large Language Models for Code Editing | Oral ACL-2024 srw☆58Updated 5 months ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆67Updated 9 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆39Updated last month
- Evaluate the Quality of Critique☆34Updated 9 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆70Updated 3 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆36Updated 3 months ago
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆52Updated last year