Aleph-Alpha / magma
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
☆490Updated 2 weeks ago
Alternatives and similar repositories for magma:
Users that are interested in magma are comparing it to the libraries listed below
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆530Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆309Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆313Updated last year
- ☆351Updated 2 years ago
- Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors☆336Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆547Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆820Updated 2 years ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆166Updated last year
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆317Updated last year
- Cramming the training of a (BERT-type) language model into limited compute.☆1,331Updated 10 months ago
- 🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".☆482Updated last year
- Language Modeling with the H3 State Space Model☆520Updated last year
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,238Updated 2 years ago
- Aim for the moon. If you miss, you may hit a star.☆164Updated 2 years ago
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆203Updated last year
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆863Updated last year
- Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)☆464Updated 2 years ago
- ☆141Updated 2 years ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆707Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆122Updated last year
- Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"☆450Updated last year
- Ask Me Anything language model prompting☆547Updated last year
- Official repository for the paper "Instance-Conditioned GAN" by Arantxa Casanova, Marlene Careil, Jakob Verbeek, Michał Drożdżal, Adriana…☆538Updated 3 years ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆766Updated 2 years ago
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆412Updated 2 years ago
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆289Updated last year
- Official Implementation of Paella https://arxiv.org/abs/2211.07292v2☆745Updated last year
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆310Updated 4 months ago
- Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.☆275Updated 2 years ago