The data processing pipeline for the Koala chatbot language model
☆118Apr 6, 2023Updated 3 years ago
Alternatives and similar repositories for koala_data_pipeline
Users that are interested in koala_data_pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The test set for Koala☆45Mar 31, 2023Updated 3 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,516Aug 13, 2024Updated last year
- A collection of reproducible inference engine benchmarks☆39Apr 22, 2025Updated last year
- The git repository of Modular Prompted Chatbot paper☆35May 24, 2023Updated 3 years ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆181Feb 23, 2023Updated 3 years ago
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- ☆11Feb 25, 2024Updated 2 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,664Sep 15, 2023Updated 2 years ago
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,153Mar 17, 2024Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆97Jun 11, 2023Updated 3 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 3 years ago
- ☆24Mar 1, 2025Updated last year
- ☆12Aug 13, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- evolve llm training instruction, from english instruction to any language.☆120Sep 15, 2023Updated 2 years ago
- Instruction Tuning with GPT-4☆4,334Jun 11, 2023Updated 3 years ago
- Tutorial to get started with SkyPilot!☆59May 15, 2024Updated 2 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- An llm wrapper for OpenAI☆12Dec 14, 2024Updated last year
- Data and code for EACL'24 paper: Over-Reasoning and Redundant Calculation of Large Language Models☆11Jan 23, 2024Updated 2 years ago
- ☆1,566Jun 10, 2026Updated 2 weeks ago
- ⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡☆2,944Nov 26, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Named entity recognition system using multi-stage CRF and statistical rules☆11Oct 3, 2016Updated 9 years ago
- ☆95Dec 19, 2024Updated last year
- A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-di…☆10Aug 28, 2018Updated 7 years ago
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- ☆47Nov 1, 2025Updated 7 months ago
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,145Jun 1, 2023Updated 3 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆139Apr 30, 2024Updated 2 years ago
- Transfer Learning in Dialogue Benchmarking Toolkit☆14Mar 31, 2023Updated 3 years ago
- ☆11Apr 21, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- ☆12May 18, 2022Updated 4 years ago
- ☆402Mar 22, 2023Updated 3 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 3 years ago
- ☆173Apr 20, 2023Updated 3 years ago
- ☆14Jul 17, 2025Updated 11 months ago
- ☆11Nov 13, 2020Updated 5 years ago