Aleph-Alpha / magma

MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com

☆490

Alternatives and similar repositories for magma:

Users that are interested in magma are comparing it to the libraries listed below

lucidrains / parti-pytorch
Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch
☆530Updated last year
CarperAI / cheese
Used for adaptive human in the loop evaluation of language and embedding models.
☆309Updated 2 years ago
facebookresearch / dropout
Code release for "Dropout Reduces Underfitting"
☆313Updated last year
EleutherAI / vqgan-clip
☆351Updated 2 years ago
CasualGANPapers / Make-A-Scene
Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
☆336Updated 2 years ago
lucidrains / nuwa-pytorch
Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch
☆547Updated 2 years ago
lucidrains / PaLM-pytorch
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways
☆820Updated 2 years ago
dhansmair / flamingo-mini
Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training
☆166Updated last year
rom1504 / cc2dataset
Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...
☆317Updated last year
JonasGeiping / cramming
Cramming the training of a (BERT-type) language model into limited compute.
☆1,331Updated 10 months ago
kohjingyu / fromage
🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".
☆482Updated last year
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆520Updated last year
lucidrains / flamingo-pytorch
Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch
☆1,238Updated 2 years ago
LAION-AI / laion-dreams
Aim for the moon. If you miss, you may hit a star.
☆164Updated 2 years ago
lucidrains / Mega-pytorch
Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena
☆203Updated last year
lucidrains / RETRO-pytorch
Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch
☆863Updated last year
bigscience-workshop / t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
☆464Updated 2 years ago
apple / ml-no-token-left-behind
☆141Updated 2 years ago
lucidrains / x-clip
A concise but complete implementation of CLIP with various experimental improvements from recent papers
☆707Updated last year
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆122Updated last year
r-three / t-few
Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"
☆450Updated last year
HazyResearch / ama_prompting
Ask Me Anything language model prompting
☆547Updated last year
facebookresearch / ic_gan
Official repository for the paper "Instance-Conditioned GAN" by Arantxa Casanova, Marlene Careil, Jakob Verbeek, Michał Drożdżal, Adriana…
☆538Updated 3 years ago
internet-explorer-ssl / internet-explorer
Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…
☆163Updated 2 years ago
facebookresearch / SLIP
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆766Updated 2 years ago
JD-P / simulacra-aesthetic-captions
Dataset of prompts, synthetic AI generated images, and aesthetic ratings.
☆412Updated 2 years ago
sanjeevanahilan / nanoChatGPT
A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick
☆289Updated last year
dome272 / Paella
Official Implementation of Paella https://arxiv.org/abs/2211.07292v2
☆745Updated last year
facebookresearch / Mephisto
A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.
☆310Updated 4 months ago
JoaoLages / diffusers-interpret
Diffusers-Interpret 🤗🧨🕵️‍♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.
☆275Updated 2 years ago