paperswithcode / galai
Model API for GALACTICA
β2,699Updated last year
Alternatives and similar repositories for galai:
Users that are interested in galai are comparing it to the libraries listed below
- Cramming the training of a (BERT-type) language model into limited compute.β1,311Updated 7 months ago
- πΈ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloadingβ9,378Updated 4 months ago
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language modelsβ2,937Updated 6 months ago
- The hub for EleutherAI's work on interpretability and learning dynamicsβ2,349Updated last month
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformerβ1,624Updated last year
- OpenLLaMA, a permissively licensed open source reproduction of Meta AIβs LLaMA 7B trained on the RedPajama datasetβ7,421Updated last year
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,022Updated 4 months ago
- CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.β4,982Updated this week
- Running large language models on a single GPU for throughput-oriented scenarios.β9,255Updated 3 months ago
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.β4,626Updated last month
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flβ¦β2,445Updated 5 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)β4,570Updated last year
- Training and serving large-scale neural networks with auto parallelization.β3,095Updated last year
- AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (Nβ¦β4,590Updated last month
- Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Enginβ¦β3,380Updated 10 months ago
- Accessible large language models via k-bit quantization for PyTorch.β6,568Updated this week
- 4 bits quantization of LLaMA using GPTQβ3,032Updated 6 months ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parametersβ5,801Updated 10 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,185Updated 7 months ago
- LLM as a Chatbot Serviceβ3,297Updated last year
- Compositional Differentiable Programming Libraryβ1,001Updated this week
- β4,355Updated 6 months ago
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,316Updated 5 months ago
- β590Updated last year
- Alpaca dataset from Stanford, cleaned and curatedβ1,532Updated last year
- An open-source framework for training large multimodal models.β3,805Updated 4 months ago
- β515Updated 11 months ago
- Efficient few-shot learning with Sentence Transformersβ2,328Updated 2 weeks ago
- β1,528Updated last year
- Official supported Python bindings for llama.cpp + gpt4allβ1,020Updated last year