young-geng / koala_data_pipeline
The data processing pipeline for the Koala chatbot language model
☆117Updated last year
Alternatives and similar repositories for koala_data_pipeline:
Users that are interested in koala_data_pipeline are comparing it to the libraries listed below
- The test set for Koala☆45Updated last year
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆226Updated last year
- ☆268Updated last year
- ☆172Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆218Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆178Updated last year
- Simple next-token-prediction for RLHF☆222Updated last year
- Code and models for BERT on STILTs☆53Updated 2 years ago
- ☆178Updated 2 years ago
- Repository for analysis and experiments in the BigCode project.☆117Updated last year
- Fine-tune SantaCoder for Code/Text Generation.☆190Updated last year
- Unofficial implementation of AlpaGasus☆90Updated last year
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆223Updated last year
- A repository for transformer critique learning and generation☆89Updated last year
- Open Source WizardCoder Dataset☆156Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 11 months ago
- Self-Alignment with Principle-Following Reward Models☆156Updated last year
- ☆84Updated last year
- All available datasets for Instruction Tuning of Large Language Models☆247Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Official repository for LongChat and LongEval☆516Updated 10 months ago
- This project studies the performance and robustness of language models and task-adaptation methods.☆147Updated 10 months ago
- Experiments on speculative sampling with Llama models☆125Updated last year
- CodeGen2 models for program synthesis☆274Updated last year
- Pre-training code for CrystalCoder 7B LLM☆54Updated 10 months ago
- Code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning" (ICLR 2024)☆364Updated 7 months ago
- ☆159Updated 2 years ago
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆50Updated last year
- Implementation of Toolformer: Language Models Can Teach Themselves to Use Tools☆137Updated last year