runwayIA / alpaca-loraLinks
Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)
☆11Updated 2 years ago
Alternatives and similar repositories for alpaca-lora
Users that are interested in alpaca-lora are comparing it to the libraries listed below
Sorting:
- My personal web page☆11Updated 3 months ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆16Updated 7 months ago
- ☆26Updated 3 years ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- ☆26Updated 10 months ago
- ☆30Updated last year
- A framework bridging cognitive science and LLM reasoning research to diagnose and improve how large language models reason, based on anal…☆31Updated 2 months ago
- ☆14Updated 2 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Updated 3 years ago
- Code base for internal reward models and PPO training☆24Updated 2 years ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆19Updated 2 years ago
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆97Updated 2 years ago
- Learning to Model Editing Processes☆26Updated 5 months ago
- We study toy models of skill learning.☆31Updated last year
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆25Updated 4 years ago
- ☆29Updated last month
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆28Updated last week
- Official implementation for paper "How Far Are We from Genuinely Useful Deep Research Agents?"☆63Updated last month
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Official Project Page for HLA: Higher-order Linear Attention (https://arxiv.org/abs/2510.27258)☆44Updated 3 weeks ago
- Transformers at any scale☆42Updated 2 years ago
- ☆53Updated 2 years ago
- Minimum Description Length probing for neural network representations☆20Updated last year
- ☆44Updated last year
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆31Updated 2 years ago
- A framework for few-shot evaluation of autoregressive language models.☆12Updated 6 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆35Updated 2 years ago
- Code for our ACL '23 paper titled "Grokking of Hierarchical Structure in Vanilla Transformers"☆24Updated 2 years ago
- Triton Implementation of HyperAttention Algorithm☆48Updated 2 years ago
- A platform for Interactive AI-assisted Hypothesis Generation [ACL 2025]☆27Updated 5 months ago