abertsch72 / long-context-icl
Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"
☆25Updated 3 weeks ago
Related projects: ⓘ
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆37Updated 2 months ago
- ☆44Updated 2 weeks ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆81Updated 2 weeks ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆39Updated 7 months ago
- An open-source library for contamination detection in NLP datasets and Large Language Models (LLMs).☆40Updated last month
- ☆38Updated 5 months ago
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆48Updated 6 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆28Updated 6 months ago
- Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators (Liu et al.; arXiv preprint arXiv:2403.…☆34Updated 2 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆45Updated 6 months ago
- AI Logging for Interpretability and Explainability🔬☆74Updated 3 months ago
- Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models☆38Updated 10 months ago
- ☆64Updated last month
- ☆69Updated 10 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆51Updated last year
- ☆22Updated 2 months ago
- ☆25Updated 3 months ago
- ☆30Updated last month
- BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆41Updated last month
- Repository for paper Tools Are Instrumental for Language Agents in Complex Environments☆32Updated 8 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆52Updated last month
- Code and data used in the paper: "Training on Incorrect Synthetic Data via RL Scales LLM Math Reasoning Eight-Fold"☆22Updated 3 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆82Updated 2 months ago
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆71Updated last year
- PASTA: Post-hoc Attention Steering for LLMs☆96Updated last week
- This repository contains data, code and models for contextual noncompliance.☆17Updated 2 months ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆33Updated 3 months ago
- LoFiT: Localized Fine-tuning on LLM Representations☆15Updated 2 months ago
- Evaluate the Quality of Critique☆35Updated 3 months ago
- [ICML'24] TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks☆20Updated 7 months ago