runwayIA / alpaca-loraLinks
Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)
☆11Updated 2 years ago
Alternatives and similar repositories for alpaca-lora
Users that are interested in alpaca-lora are comparing it to the libraries listed below
Sorting:
- ☆24Updated 9 months ago
- My personal web page☆11Updated 2 months ago
- A framework for few-shot evaluation of autoregressive language models.☆12Updated 5 months ago
- ☆25Updated 3 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Updated 3 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆15Updated 6 months ago
- Code base for internal reward models and PPO training☆24Updated 2 years ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆25Updated 3 weeks ago
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆25Updated 4 years ago
- ☆44Updated last year
- We study toy models of skill learning.☆31Updated 11 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated 3 weeks ago
- ☆14Updated 2 years ago
- ☆30Updated last year
- Code for ICML 2025 paper | Joint Localization and Activation Editing for Low-Resource Fine-Tuning☆25Updated 6 months ago
- Official Code Repository for the paper "Key-value memory in the brain"☆31Updated 10 months ago
- [ICML 24 NGSM workshop] Associative Recurrent Memory Transformer implementation and scripts for training and evaluation☆60Updated this week
- ☆29Updated 2 weeks ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆19Updated 2 years ago
- ☆15Updated 2 years ago
- ☆39Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- A repository for research on medium sized language models.☆77Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated 2 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Updated 3 weeks ago
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated 2 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last month