runwayIA / alpaca-loraLinks
Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)
☆11Updated 2 years ago
Alternatives and similar repositories for alpaca-lora
Users that are interested in alpaca-lora are comparing it to the libraries listed below
Sorting:
- ☆25Updated 3 years ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- ☆44Updated last year
- ☆14Updated 2 years ago
- My personal web page☆11Updated this week
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆15Updated 2 months ago
- ☆14Updated 5 months ago
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆24Updated 4 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆21Updated 2 years ago
- A platform for Interactive AI-assisted Hypothesis Generation [ACL 2025]☆21Updated 2 weeks ago
- Transformers at any scale☆41Updated last year
- Code base for internal reward models and PPO training☆25Updated last year
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆95Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- Google Research☆45Updated 2 years ago
- ☆17Updated 6 years ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated last year
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆12Updated last month
- ☆25Updated last month
- Entailment self-training☆25Updated 2 years ago
- We study toy models of skill learning.☆31Updated 7 months ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆18Updated 2 years ago
- ☆26Updated last year
- An unofficial pytorch implementation of 'Efficient Infinite Context Transformers with Infini-attention'☆52Updated last year
- ☆37Updated 4 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- RL algorithm: Advantage induced policy alignment☆65Updated 2 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year