allenai / CommonGen-Eval
Evaluating LLMs with CommonGen-Lite
☆85Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for CommonGen-Eval
- Codebase accompanying the Summary of a Haystack paper.☆72Updated 2 months ago
- ☆112Updated last month
- Just a bunch of benchmark logs for different LLMs☆114Updated 3 months ago
- Data preparation code for Amber 7B LLM☆82Updated 6 months ago
- Mixing Language Models with Self-Verification and Meta-Verification☆97Updated last year
- Functional Benchmarks and the Reasoning Gap☆78Updated last month
- Code repository for the c-BTM paper☆105Updated last year
- The first dense retrieval model that can be prompted like an LM☆63Updated 2 months ago
- ☆40Updated 2 weeks ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- ☆74Updated 3 weeks ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- ☆102Updated last month
- A pipeline for LLM knowledge distillation☆78Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆77Updated 8 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆216Updated 7 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆80Updated 2 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆74Updated 10 months ago
- ☆93Updated last month
- Data preparation code for CrystalCoder 7B LLM☆42Updated 6 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆91Updated 3 months ago
- This is the official repository for Inheritune.☆105Updated last month
- Track the progress of LLM context utilisation☆53Updated 4 months ago
- ☆87Updated 9 months ago
- ☆35Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆113Updated 3 weeks ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆87Updated last year
- Code accompanying "How I learned to start worrying about prompt formatting".☆95Updated last month