paperswithcode / galai
Model API for GALACTICA
☆2,712Updated 2 years ago
Alternatives and similar repositories for galai:
Users that are interested in galai are comparing it to the libraries listed below
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,043Updated 6 months ago
- Compositional Differentiable Programming Library☆1,021Updated this week
- A collection of libraries to optimise AI model performances☆8,372Updated 8 months ago
- Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)☆3,896Updated 9 months ago
- The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".☆1,310Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,325Updated 9 months ago
- Efficient few-shot learning with Sentence Transformers☆2,424Updated 2 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,610Updated last year
- Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)☆1,252Updated 3 months ago
- Official supported Python bindings for llama.cpp + gpt4all☆1,020Updated last year
- Creative interactive views of any dataset.☆837Updated 3 months ago
- Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch☆1,958Updated 10 months ago
- Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.☆3,973Updated 7 months ago
- Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"☆1,692Updated last year
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,429Updated 2 weeks ago
- Easily compute clip embeddings and build a clip retrieval system with them☆2,521Updated 11 months ago
- Fast & Simple repository for pre-training and fine-tuning T5-style models☆1,001Updated 7 months ago
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,110Updated 4 months ago
- ☆1,000Updated last year
- A school for camelids☆1,208Updated last year
- MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.☆923Updated last week
- [NeurIPS 22] [AAAI 24] Recurrent Transformer-based long-context architecture.☆760Updated 5 months ago
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆744Updated last year
- Monte Carlo tree search in JAX☆2,446Updated 3 months ago
- Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"☆1,059Updated last year
- Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-sour…☆2,640Updated 6 months ago
- Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI☆2,016Updated 8 months ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆820Updated 2 years ago
- Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch☆8,217Updated 5 months ago
- A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer☆1,626Updated last year