runwayIA / alpaca-loraLinks
Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)
☆11Updated 2 years ago
Alternatives and similar repositories for alpaca-lora
Users that are interested in alpaca-lora are comparing it to the libraries listed below
Sorting:
- ☆25Updated 3 years ago
- My personal web page☆11Updated last month
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆15Updated 5 months ago
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆25Updated 4 years ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆21Updated 3 years ago
- RL algorithm: Advantage induced policy alignment☆66Updated 2 years ago
- ☆14Updated 2 years ago
- ☆44Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Updated 2 years ago
- Code base for internal reward models and PPO training☆24Updated 2 years ago
- Download, parse, and filter data from Phil Papers. Data-ready for The-Pile.☆18Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 3 years ago
- Based on the tree of thoughts paper☆48Updated 2 years ago
- ☆22Updated 8 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated this week
- ☆29Updated 3 months ago
- ☆30Updated last year
- Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning☆29Updated 4 years ago
- ☆17Updated 6 years ago
- ☆52Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- ☆43Updated last year
- Bayesian scaling laws for in-context learning.☆15Updated 8 months ago
- Transformers at any scale☆41Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- Google Research☆46Updated 3 years ago
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆29Updated last year