ntunlp / OpenSource-LLMs-better-than-OpenAILinks
Listing all reported open-source LLMs achieving a higher score than proprietary, paying OpenAI models (ChatGPT, GPT-4).
☆68Updated 2 years ago
Alternatives and similar repositories for OpenSource-LLMs-better-than-OpenAI
Users that are interested in OpenSource-LLMs-better-than-OpenAI are comparing it to the libraries listed below
Sorting:
- Open Implementations of LLM Analyses☆107Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆85Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆90Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Updated last year
- FuseAI Project☆87Updated last year
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆60Updated last year
- List of papers on Self-Correction of LLMs.☆80Updated last year
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆110Updated 4 months ago
- ☆75Updated last year
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆61Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆193Updated last year
- a curated list of the role of small models in the LLM era☆111Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Updated 2 years ago
- ☆104Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- augmented LLM with self reflection☆137Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆59Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆101Updated 2 years ago
- Contrastive Chain-of-Thought Prompting☆68Updated 2 years ago
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆30Updated last year
- My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"☆100Updated 2 years ago
- Reformatted Alignment☆111Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆49Updated 2 years ago
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆119Updated 2 years ago
- ☆123Updated last year
- A framework for human-readable prompt-based method with large language models. Specially designed for researchers. (Deprecated, check out…☆131Updated 2 years ago
- Benchmark baseline for retrieval qa applications☆119Updated last year
- ☆80Updated 10 months ago