The data processing pipeline for the Koala chatbot language model
☆118Apr 6, 2023Updated 2 years ago
Alternatives and similar repositories for koala_data_pipeline
Users that are interested in koala_data_pipeline are comparing it to the libraries listed below
Sorting:
- The test set for Koala☆45Mar 31, 2023Updated 2 years ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,517Aug 13, 2024Updated last year
- Easy to use and open-source unknown stealer☆22Jul 24, 2023Updated 2 years ago
- ☆13May 8, 2023Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆38Apr 22, 2025Updated 10 months ago
- A Multilingual Replicable Instruction-Following Model☆96Jun 11, 2023Updated 2 years ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- Lottery Tickets in Evolutionary Optimization (Lange & Sprekeler, ICML 2023)☆17Jun 2, 2023Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆209Jan 13, 2024Updated 2 years ago
- A deep reinforcement learning model for portfolio management. For more info, check☆14Jun 2, 2020Updated 5 years ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,629Sep 15, 2023Updated 2 years ago
- ☆180Feb 23, 2023Updated 3 years ago
- ☆24Mar 1, 2025Updated last year
- orderbooks contain so much more organic informations than moving averages...☆19Updated this week
- A repository for transformer critique learning and generation☆89Dec 7, 2023Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,168Mar 17, 2024Updated last year
- LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation☆23Apr 24, 2024Updated last year
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆138Apr 30, 2024Updated last year
- ☆1,559Feb 20, 2026Updated 2 weeks ago
- Template-DQN and DRRN agent implementations☆22Jun 12, 2023Updated 2 years ago
- Tutorial to get started with SkyPilot!☆58May 15, 2024Updated last year
- NusaWrites is an in-depth analysis of corpora collection strategy and a comprehensive language modeling benchmark for underrepresented an…☆28Sep 27, 2024Updated last year
- ☆56Nov 6, 2024Updated last year
- The official repository for the Anything But Wrappers: Llama Edition Hackameetup☆22Sep 1, 2023Updated 2 years ago
- Instruction Tuning with GPT-4☆4,341Jun 11, 2023Updated 2 years ago
- Reimplementation of the task generation part from the Alpaca paper☆119Apr 4, 2023Updated 2 years ago
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆2,096Jun 1, 2023Updated 2 years ago
- ☆57Feb 10, 2025Updated last year
- LVCS@Tesla.com☆12Jan 16, 2026Updated last month
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- ☆70Jun 7, 2023Updated 2 years ago
- ☆29Oct 15, 2023Updated 2 years ago
- Mindful is a mental wellness app designed to support users in managing stress and anxiety. Powered by advanced AI, it offers personalized…☆11Apr 11, 2025Updated 10 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,738Jan 8, 2024Updated 2 years ago
- An open-source implementation of Google's PaLM models☆819Jun 21, 2024Updated last year
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,923Dec 7, 2024Updated last year
- ☆457Oct 15, 2023Updated 2 years ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆254Oct 31, 2023Updated 2 years ago