runwayIA / alpaca-loraLinks
Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)
☆11Updated 2 years ago
Alternatives and similar repositories for alpaca-lora
Users that are interested in alpaca-lora are comparing it to the libraries listed below
Sorting:
- ☆25Updated 3 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Updated 2 years ago
- Interpreting Learned Search and Planning: Reverse-engineering recurrent convolutional networks (DRC) that play Sokoban☆14Updated 2 weeks ago
- ☆14Updated last year
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆24Updated 4 years ago
- My personal web page☆11Updated last week
- Code base for internal reward models and PPO training☆25Updated last year
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated last year
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- RL algorithm: Advantage induced policy alignment☆65Updated last year
- Entailment self-training☆25Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆28Updated 8 months ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆20Updated 2 years ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Shakespeare transformer fine-tuned to generate positive sentiment samples using RLHF☆9Updated 2 years ago
- Official code for paper LIME: Learning Inductive Bias for Primitives of Mathematical Reasoning☆29Updated 4 years ago
- ☆16Updated last year
- Multi-Domain Expert Learning☆67Updated last year
- Google Research☆46Updated 2 years ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆13Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆12Updated 6 months ago
- Helper scripts and notes that were used while porting various nlp models☆46Updated 3 years ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Updated last year
- Code for "The Expressive Power of Low-Rank Adaptation".☆20Updated last year
- ☆34Updated 2 years ago
- ☆27Updated 2 weeks ago
- ☆44Updated last year
- Code for Pushdown Layers from our EMNLP 2023 paper☆28Updated last year