runwayIA / alpaca-lora
Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)
☆11Updated 2 years ago
Alternatives and similar repositories for alpaca-lora:
Users that are interested in alpaca-lora are comparing it to the libraries listed below
- Common crawl pretrained sentencepiece tokenizers for English and Japanese for various vocabulary sizes. Also development environment for …☆10Updated 3 years ago
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- ☆24Updated 2 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- ☆14Updated last year
- Interpretability tools for recurrent convolutional networks (DRC) that play Sokoban☆12Updated last month
- Conic10K: A large-scale dataset for closed-vocabulary math problem understanding. Accepted to EMNLP2023 Findings.☆25Updated last year
- ☆28Updated 5 months ago
- Data and code accompanying the paper "Intent Detection with WikiHow"☆10Updated 3 years ago
- Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"☆32Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆18Updated 2 years ago
- ☆35Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- Code for GenAug: Data Augmentation for Finetuning Text Generators.☆27Updated 3 years ago
- ☆44Updated 10 months ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated 11 months ago
- ☆11Updated 2 years ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Updated 2 years ago
- Code for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning☆23Updated 2 years ago
- Repository containing the website for the EMNLP 2023 conference☆16Updated 2 months ago
- Transformers at any scale☆41Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆35Updated 2 years ago
- ☆27Updated last month
- ☆22Updated last year
- Large-scale query-focused multi-document Summarization dataset☆10Updated 3 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆16Updated last year
- an implementation of paper"Retentive Network: A Successor to Transformer for Large Language Models" https://arxiv.org/pdf/2307.08621.pdf☆12Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆22Updated last year
- ☆19Updated 3 years ago