Aleph-Alpha-Research / magmaLinks
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
☆489Updated 2 months ago
Alternatives and similar repositories for magma
Users that are interested in magma are comparing it to the libraries listed below
Sorting:
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆537Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆310Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆317Updated last year
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆823Updated 2 years ago
- ☆351Updated 3 years ago
- Ask Me Anything language model prompting☆547Updated 2 years ago
- Language Modeling with the H3 State Space Model☆520Updated last year
- Implementation of ChatGPT, but tailored towards primary care medicine, with the reward being able to collect patient histories in a thoro…☆314Updated last year
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆869Updated last year
- ☆130Updated 3 years ago
- Internet Explorer explores the web in a self-supervised manner to progressively find relevant examples that improve performance on a desi…☆163Updated 2 years ago
- Aim for the moon. If you miss, you may hit a star.☆165Updated 2 years ago
- ☆141Updated 2 years ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆90Updated 2 years ago
- Code release for "Dropout Reduces Underfitting"☆313Updated 2 years ago
- Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch☆229Updated 10 months ago
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆634Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆550Updated 2 years ago
- 🧀 Code and models for the ICML 2023 paper "Grounding Language Models to Images for Multimodal Inputs and Outputs".☆482Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆168Updated last month
- A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick☆291Updated last year
- Place where folks can contribute to 🤗 community events☆424Updated last year
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Simple Annotated implementation of GPT-NeoX in PyTorch☆110Updated 2 years ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆711Updated last year
- This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…☆352Updated 2 years ago
- CLIP (Contrastive Language–Image Pre-training) for Italian☆186Updated 2 years ago
- [WIP] A 🔥 interface for running code in the cloud☆85Updated 2 years ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆437Updated 2 years ago
- Here is a collection of checkpoints for DALLE-pytorch models, from where you can keep on training or start generating images.☆146Updated 2 years ago