dsdanielpark / open-llm-leaderboard-report
Weekly visualization report of Open LLM model performance based on 4 metrics.
☆86Updated last year
Alternatives and similar repositories for open-llm-leaderboard-report:
Users that are interested in open-llm-leaderboard-report are comparing it to the libraries listed below
- Mixing Language Models with Self-Verification and Meta-Verification☆100Updated last month
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆98Updated 4 months ago
- ☆172Updated last year
- ☆84Updated last year
- Google Deepmind's PromptBreeder for automated prompt engineering implemented in langchain expression language.☆91Updated 5 months ago
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆90Updated last year
- This repository implements the chain of verification paper by Meta AI☆160Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆224Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆116Updated last year
- Implementation of Google's SELF-DISCOVER☆288Updated 5 months ago
- ☆266Updated last year
- ☆74Updated last year
- Just a bunch of benchmark logs for different LLMs☆117Updated 6 months ago
- ☆350Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆115Updated last year
- Patch for MPT-7B which allows using and training a LoRA☆58Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- ☆128Updated last year
- Langchain implementation of HuggingGPT☆127Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆101Updated 5 months ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆540Updated last year
- ☆138Updated 9 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated last year
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆460Updated 10 months ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆225Updated last year
- Official Implementation of InstructZero; the first framework to optimize bad prompts of ChatGPT(API LLMs) and finally obtain good prompts…☆186Updated 6 months ago
- A joint community effort to create one central leaderboard for LLMs.☆288Updated 5 months ago
- Evaluating LLMs with CommonGen-Lite☆88Updated 10 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆149Updated last year