Aleph-Alpha-Research / magmaLinks
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
☆490Updated last month
Alternatives and similar repositories for magma
Users that are interested in magma are comparing it to the libraries listed below
Sorting:
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆539Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆311Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆319Updated last year
- ☆352Updated 3 years ago
- Research code for pixel-based encoders of language (PIXEL)☆339Updated last month
- Ask Me Anything language model prompting☆546Updated 2 years ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆713Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆823Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆168Updated last month
- CLIP (Contrastive Language–Image Pre-training) for Italian☆186Updated 2 years ago
- Aim for the moon. If you miss, you may hit a star.☆164Updated 2 years ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆90Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆314Updated 2 years ago
- ☆249Updated 2 years ago
- Repository for "Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search"☆179Updated 3 years ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆229Updated 11 months ago
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆635Updated 2 years ago
- Language Modeling with the H3 State Space Model☆519Updated last year
- Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.☆379Updated last year
- Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.☆280Updated 2 years ago
- Playing around with stable diffusion. Generated images are reproducible because I save the metadata and latent information. You can gener…☆205Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆168Updated 2 years ago
- code for reproducing some of the diagrams in the paper "Multimodal Neurons in Artificial Neural Networks"☆308Updated 4 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 3 years ago
- MinImagen: A minimal implementation of the Imagen text-to-image model☆307Updated 2 years ago
- Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch☆1,259Updated 2 years ago
- Multi-angle c(q)uestion answering☆458Updated 3 years ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆872Updated last year