McGill-NLP / CHASELinks
Synthetic Data Generation for Evaluation
☆15Updated 5 months ago
Alternatives and similar repositories for CHASE
Users that are interested in CHASE are comparing it to the libraries listed below
Sorting:
- ☆124Updated 9 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year
- ☆35Updated 3 months ago
- Code for Zero-Shot Tokenizer Transfer☆133Updated 6 months ago
- ☆38Updated last year
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆57Updated 2 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 9 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆139Updated 8 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆58Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 4 months ago
- ☆151Updated last year
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training☆22Updated 11 months ago
- This is the official repository for Inheritune.☆112Updated 5 months ago
- Functional Benchmarks and the Reasoning Gap☆88Updated 9 months ago
- LOFT: A 1 Million+ Token Long-Context Benchmark☆205Updated last month
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated 10 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆184Updated last week
- Code repository for the c-BTM paper☆106Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 11 months ago
- ☆57Updated 9 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆206Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 10 months ago
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆90Updated 7 months ago
- [ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners☆116Updated 3 weeks ago
- ☆44Updated 8 months ago
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated 11 months ago
- ☆66Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆33Updated 9 months ago
- Code, datasets, models for the paper "Automatic Evaluation of Attribution by Large Language Models"☆56Updated 2 years ago