ntunlp / OpenSource-LLMs-better-than-OpenAILinks
Listing all reported open-source LLMs achieving a higher score than proprietary, paying OpenAI models (ChatGPT, GPT-4).
☆68Updated last year
Alternatives and similar repositories for OpenSource-LLMs-better-than-OpenAI
Users that are interested in OpenSource-LLMs-better-than-OpenAI are comparing it to the libraries listed below
Sorting:
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆60Updated last year
- Reformatted Alignment☆113Updated last year
- Open Implementations of LLM Analyses☆108Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆89Updated last year
- FuseAI Project☆87Updated 10 months ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆59Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- Official repo of Respond-and-Respond: data, code, and evaluation☆104Updated last year
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆47Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆100Updated 2 years ago
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Updated last year
- ☆122Updated last year
- a curated list of the role of small models in the LLM era☆111Updated last year
- List of papers on Self-Correction of LLMs.☆81Updated 11 months ago
- Flacuna was developed by fine-tuning Vicuna on Flan-mini, a comprehensive instruction collection encompassing various tasks. Vicuna is al…☆111Updated 2 years ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆60Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆81Updated last year
- Codebase for LLM story generation; updated version of https//github.com/yangkevin2/doc-story-generation☆87Updated 2 years ago
- ⏳ ChatLog: Recording and Analysing ChatGPT Across Time☆103Updated last year
- [SIGIR 2024 (Demo)] CoSearchAgent: A Lightweight Collborative Search Agent with Large Language Models☆28Updated last year
- Scripts for generating synthetic finetuning data for reducing sycophancy.☆117Updated 2 years ago
- A set of utilities for running few-shot prompting experiments on large-language models☆126Updated 2 years ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆60Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆192Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆84Updated last year
- [ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following☆134Updated last year
- [NAACL 2024] Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models☆86Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 9 months ago
- ☆69Updated 2 years ago