Aleph-Alpha-Research / magmaLinks
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
☆489Updated 3 months ago
Alternatives and similar repositories for magma
Users that are interested in magma are comparing it to the libraries listed below
Sorting:
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆540Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆307Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆320Updated last year
- ☆356Updated 3 years ago
- Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.☆278Updated 3 years ago
- Ask Me Anything language model prompting☆545Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆824Updated 3 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆169Updated last month
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆89Updated 2 years ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆230Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Aim for the moon. If you miss, you may hit a star.☆163Updated 2 years ago
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆634Updated 2 years ago
- Language Modeling with the H3 State Space Model☆518Updated 2 years ago
- Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT☆222Updated last year
- Code release for "Dropout Reduces Underfitting"☆315Updated 2 years ago
- Production-ready data processing made easy and shareable☆353Updated last year
- Research code for pixel-based encoders of language (PIXEL)☆344Updated 3 months ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Explorations into training LLMs to use clinical calculators from patient history, using open sourced models. Will start with Wells' Crite…☆315Updated 2 months ago
- ☆131Updated 3 years ago
- Here is a collection of checkpoints for DALLE-pytorch models, from where you can keep on training or start generating images.☆146Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆550Updated 2 years ago
- Exploring finetuning public checkpoints on filter 8K sequences on Pile☆115Updated 2 years ago
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆875Updated 2 years ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆436Updated 2 years ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆167Updated 2 years ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- ☆250Updated 2 years ago
- Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch☆417Updated 10 months ago