runwayIA / alpaca-loraLinks
Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)
☆11Updated 2 years ago
Alternatives and similar repositories for alpaca-lora
Users that are interested in alpaca-lora are comparing it to the libraries listed below
Sorting:
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆21Updated 2 years ago
- ☆16Updated 5 months ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- ☆25Updated 3 years ago
- A platform for Interactive AI-assisted Hypothesis Generation [ACL 2025]☆21Updated last month
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆24Updated 4 years ago
- ☆14Updated 2 years ago
- ☆29Updated last month
- Code base for internal reward models and PPO training☆24Updated last year
- A framework for few-shot evaluation of autoregressive language models.☆12Updated 2 months ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆15Updated 2 months ago
- Minimum Description Length probing for neural network representations☆18Updated 7 months ago
- ☆16Updated 2 years ago
- ☆44Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- My personal web page☆11Updated last week
- Based on the tree of thoughts paper☆48Updated 2 years ago
- ☆17Updated 6 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802☆95Updated 2 years ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- ☆22Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated last week
- ⚓️ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆17Updated last month
- Google Research☆46Updated 2 years ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated this week
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14Updated 2 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Updated 2 years ago
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14Updated 3 years ago