runwayIA / alpaca-lora
Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)
☆11Updated last year
Alternatives and similar repositories for alpaca-lora:
Users that are interested in alpaca-lora are comparing it to the libraries listed below
- ☆23Updated 3 months ago
- ☆14Updated last year
- Interpretability tools for recurrent networks that play Sokoban☆10Updated last month
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆19Updated 2 years ago
- Implementation of the DocLLM paper for Llama models.☆12Updated 2 months ago
- Benchmarking Generalization to New Tasks from Natural Language Instructions☆26Updated 3 years ago
- ☆15Updated last year
- Interview-based evaluation of LLMs☆15Updated last month
- The paper list of multilingual pre-trained models (Continual Updated).☆20Updated 8 months ago
- ☆24Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- Official implementations for (1) BlonDe: An Automatic Evaluation Metric for Document-level Machine Translation and (2) Discourse Centric …☆75Updated last year
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆12Updated 6 months ago
- ☆34Updated last year
- ☆21Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- ☆39Updated last month
- Observe the slow deterioration of my mental sanity in the github commit history☆12Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆25Updated 10 months ago
- Official implementation of AAAI 2025 paper "Augmenting Math Word Problems via Iterative Question Composing"(https://arxiv.org/abs/2401.09…☆18Updated 2 months ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆32Updated last year
- Code for the paper "Getting the most out of your tokenizer for pre-training and domain adaptation"☆15Updated last year
- We finetune Bloomz-7b1-mt using LoRA with the chatdoctor-200k dataset at here https://huggingface.co/LinhDuong/doctorwithbloomz-7b1-mt an…☆30Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆29Updated last week
- ☆47Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆12Updated 11 months ago
- Repo for "On Learning to Summarize with Large Language Models as References"☆44Updated last year
- [COLING 2022] Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization☆25Updated 10 months ago
- Implementation for MomentumSMoE☆16Updated this week
- [ICLR 2023] PyTorch code of Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees☆23Updated last year