dsdanielpark / open-llm-datasetsLinks
Repository for organizing datasets and papers used in Open LLM.
☆101Updated 2 years ago
Alternatives and similar repositories for open-llm-datasets
Users that are interested in open-llm-datasets are comparing it to the libraries listed below
Sorting:
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆318Updated 2 years ago
- A joint community effort to create one central leaderboard for LLMs.☆308Updated last year
- [ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation☆184Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated last year
- ☆278Updated 2 years ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated 2 years ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆245Updated last year
- Official repository for LongChat and LongEval☆533Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆555Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆143Updated 2 years ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆253Updated 2 years ago
- Complex question answering in LLMs with enhanced reasoning and information-seeking capabilities.☆204Updated 2 years ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆120Updated 3 months ago
- ☆78Updated 2 years ago
- Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.☆409Updated last year
- An open collection of methodologies to help with successful training of large language models.☆550Updated last year
- DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection and Instruction-Aware Models for Conversational AI☆519Updated last year
- Weekly visualization report of Open LLM model performance based on 4 metrics.☆86Updated 2 years ago
- Merge Transformers language models by use of gradient parameters.☆213Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆169Updated 2 years ago
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆103Updated 5 months ago
- Langchain implementation of HuggingGPT☆134Updated 2 years ago
- The data processing pipeline for the Koala chatbot language model☆118Updated 2 years ago
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆221Updated 2 years ago
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆213Updated 2 months ago
- Pre-training code for Amber 7B LLM☆170Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆137Updated 2 years ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆228Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆509Updated 2 years ago
- Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.☆324Updated 2 years ago