Aleph-Alpha-Research / magmaLinks
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
☆489Updated last month
Alternatives and similar repositories for magma
Users that are interested in magma are comparing it to the libraries listed below
Sorting:
- ☆350Updated 3 years ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆90Updated 2 years ago
- Used for adaptive human in the loop evaluation of language and embedding models.☆309Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆167Updated 3 weeks ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆549Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆823Updated 2 years ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆713Updated last year
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- OpenAI CLIP text encoders for multiple languages!☆802Updated 2 years ago
- CLIP (Contrastive Language–Image Pre-training) for Italian☆186Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆317Updated last year
- Contrastive Language-Image Pretraining☆143Updated 2 years ago
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,248Updated 2 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆769Updated 2 years ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆167Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆313Updated 2 years ago
- PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)☆242Updated 2 weeks ago
- Ask Me Anything language model prompting☆546Updated last year
- This is a summary of easily available datasets for generalized DALLE-pytorch training.☆128Updated 3 years ago
- An instruction-based benchmark for text improvements.☆141Updated 2 years ago
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆312Updated 6 months ago
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆866Updated last year
- A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.☆462Updated 3 years ago
- RUDOLPH: One Hyper-Tasking Transformer can be creative as DALL-E and GPT-3 and smart as CLIP☆254Updated 2 years ago
- Multi-angle c(q)uestion answering☆458Updated 2 years ago
- Language Modeling with the H3 State Space Model☆519Updated last year
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆433Updated 3 years ago
- Dataset of prompts, synthetic AI generated images, and aesthetic ratings.☆412Updated 2 years ago
- Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.☆277Updated 2 years ago
- ☆130Updated 3 years ago