paperswithcode / galai
Model API for GALACTICA
☆2,675Updated last year
Related projects: ⓘ
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,442Updated 8 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆5,958Updated last week
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,691Updated 6 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,368Updated last month
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆3,561Updated this week
- An unnecessarily tiny implementation of GPT-2 in NumPy.☆3,183Updated last year
- ☆2,635Updated last week
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,743Updated 3 months ago
- A collection of libraries to optimise AI model performances☆8,373Updated last month
- Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM☆7,671Updated 8 months ago
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models☆2,815Updated 2 months ago
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,210Updated 3 weeks ago
- Accessible large language models via k-bit quantization for PyTorch.☆6,029Updated this week
- Supercharge Your Model Training☆5,116Updated this week
- LLM training code for Databricks foundation models☆3,964Updated this week
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,522Updated last month
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,339Updated last year
- Train transformer language models with reinforcement learning.☆9,288Updated this week
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries☆6,819Updated this week
- QLoRA: Efficient Finetuning of Quantized LLMs☆9,906Updated 3 months ago
- LLM as a Chatbot Service☆3,280Updated 9 months ago
- Cramming the training of a (BERT-type) language model into limited compute.☆1,284Updated 3 months ago
- Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sour…☆2,587Updated 5 months ago
- Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"☆1,646Updated 7 months ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,133Updated last month
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆1,941Updated last month
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,601Updated last year
- Let ChatGPT teach your own chatbot in hours with a single GPU!☆3,155Updated 6 months ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,049Updated 6 months ago
- An open-source framework for training large multimodal models.☆3,658Updated 2 weeks ago