The data processing pipeline for the Koala chatbot language model
☆118Apr 6, 2023Updated 3 years ago
Alternatives and similar repositories for koala_data_pipeline
Users that are interested in koala_data_pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The test set for Koala☆45Mar 31, 2023Updated 3 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,519Aug 13, 2024Updated last year
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated 11 months ago
- The git repository of Modular Prompted Chatbot paper☆35May 24, 2023Updated 2 years ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- ☆11Feb 25, 2024Updated 2 years ago
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,160Mar 17, 2024Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆97Jun 11, 2023Updated 2 years ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆50Dec 15, 2023Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- ☆12Aug 13, 2022Updated 3 years ago
- evolve llm training instruction, from english instruction to any language.☆120Sep 15, 2023Updated 2 years ago
- "Unsupervised Paraphrase Generation using Pre-trained Language Model."☆22Aug 28, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Instruction Tuning with GPT-4☆4,337Jun 11, 2023Updated 2 years ago
- Tutorial to get started with SkyPilot!☆58May 15, 2024Updated last year
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- My personal solutions to some textbook problems☆11Feb 12, 2020Updated 6 years ago
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Pytorch implementation of NPAttack☆12Jul 7, 2020Updated 5 years ago
- ☆1,561Updated this week
- ☆95Dec 19, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official pytorch implementation of I2I translation with low resolution conditioning☆23Sep 2, 2021Updated 4 years ago
- ☆45Nov 1, 2025Updated 5 months ago
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,117Jun 1, 2023Updated 2 years ago
- Transfer Learning in Dialogue Benchmarking Toolkit☆14Mar 31, 2023Updated 3 years ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆139Apr 30, 2024Updated last year
- ☆43Sep 3, 2024Updated last year
- ☆11May 18, 2022Updated 3 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- ☆173Apr 20, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆14Jul 17, 2025Updated 9 months ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆13May 5, 2022Updated 3 years ago
- ☆11Nov 13, 2020Updated 5 years ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,837Jun 17, 2025Updated 10 months ago
- ☆12Feb 16, 2024Updated 2 years ago
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,935Dec 7, 2024Updated last year
- Crosslingual Generalization through Multitask Finetuning☆536Sep 22, 2024Updated last year