dsdanielpark / open-llm-datasets
Repository for organizing datasets and papers used in Open LLM.
☆94Updated last year
Alternatives and similar repositories for open-llm-datasets:
Users that are interested in open-llm-datasets are comparing it to the libraries listed below
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆236Updated last year
- Data preparation code for Amber 7B LLM☆86Updated 10 months ago
- ☆268Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆104Updated 6 months ago
- This repository implements the chain of verification paper by Meta AI☆166Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 11 months ago
- A joint community effort to create one central leaderboard for LLMs.☆294Updated 7 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆73Updated 5 months ago
- Open Implementations of LLM Analyses☆103Updated 5 months ago
- Pre-training code for CrystalCoder 7B LLM☆54Updated 10 months ago
- Weekly visualization report of Open LLM model performance based on 4 metrics.☆87Updated last year
- Official repository for LongChat and LongEval☆516Updated 10 months ago
- Implementation of paper Data Engineering for Scaling Language Models to 128K Context☆454Updated last year
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆313Updated 6 months ago
- ☆74Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆302Updated 5 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆299Updated last year
- [ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark☆374Updated 8 months ago
- Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)☆205Updated 10 months ago
- All available datasets for Instruction Tuning of Large Language Models☆247Updated last year
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆313Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆128Updated last year
- Meta-CoT: Generalizable Chain-of-Thought Prompting in Mixed-task Scenarios with Large Language Models☆94Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆226Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆117Updated last year
- Official repository for ORPO☆446Updated 10 months ago
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆217Updated 11 months ago
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆177Updated last year