msclar / formatspread
Code accompanying "How I learned to start worrying about prompt formatting".
☆95Updated last month
Related projects ⓘ
Alternatives and complementary repositories for formatspread
- Evaluating LLMs with fewer examples☆134Updated 7 months ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆195Updated 2 weeks ago
- ☆112Updated last month
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆124Updated 3 weeks ago
- Code for In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering☆144Updated last month
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]☆103Updated last month
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆122Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆115Updated last week
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning"☆91Updated 4 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- ☆102Updated last month
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆107Updated last year
- ☆116Updated 5 months ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆129Updated this week
- Reformatted Alignment☆112Updated last month
- Code and results accompanying the paper "Refusal in Language Models Is Mediated by a Single Direction".☆123Updated last month
- ☆295Updated 5 months ago
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆141Updated last year
- [EMNLP 2024] A Retrieval Benchmark for Scientific Literature Search☆61Updated 4 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆41Updated 9 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆62Updated last year
- Self-Alignment with Principle-Following Reward Models☆148Updated 8 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆108Updated 2 months ago
- 🌍 Repository for "AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agent", ACL'24 Best Resource Pap…☆110Updated 3 weeks ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- A Survey on Data Selection for Language Models☆182Updated last month
- LOFT: A 1 Million+ Token Long-Context Benchmark☆146Updated 3 weeks ago
- evol augment any dataset online☆55Updated last year