Aleph-Alpha-Research / magmaLinks
MAGMA - a GPT-style multimodal model that can understand any combination of images and language. NOTE: The freely available model from this repo is only a demo. For the latest multimodal and multilingual models from Aleph Alpha check out our website https://app.aleph-alpha.com
☆489Updated 5 months ago
Alternatives and similar repositories for magma
Users that are interested in magma are comparing it to the libraries listed below
Sorting:
- Used for adaptive human in the loop evaluation of language and embedding models.☆308Updated 2 years ago
- Implementation of Parti, Google's pure attention-based text-to-image neural network, in Pytorch☆537Updated 2 years ago
- Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...☆322Updated 2 years ago
- ☆355Updated 3 years ago
- Aim for the moon. If you miss, you may hit a star.☆164Updated 2 years ago
- Ask Me Anything language model prompting☆546Updated 2 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆828Updated 3 years ago
- Research code for pixel-based encoders of language (PIXEL)☆345Updated 5 months ago
- Multi-angle c(q)uestion answering☆456Updated 3 years ago
- [WIP] A 🔥 interface for running code in the cloud☆86Updated 2 years ago
- Language Modeling with the H3 State Space Model☆521Updated 2 years ago
- CLIP (Contrastive Language–Image Pre-training) for Italian☆185Updated 2 years ago
- DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.☆171Updated 3 months ago
- ☆141Updated 3 years ago
- O-GIA is an umbrella for research, infrastructure and projects ecosystem that should provide open source, reproducible datasets, models, …☆87Updated 2 years ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Updated 2 years ago
- Production-ready data processing made easy and shareable☆358Updated last year
- A suite of tools for managing crowdsourcing tasks from the inception through to data packaging for research use.☆312Updated last year
- Here is a collection of checkpoints for DALLE-pytorch models, from where you can keep on training or start generating images.☆146Updated 3 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆179Updated 2 years ago
- ☆249Updated 2 years ago
- Open-AI's DALL-E for large scale training in mesh-tensorflow.☆430Updated 3 years ago
- Diffusers-Interpret 🤗🧨🕵️♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.☆280Updated 3 years ago
- Implementation of RETRO, Deepmind's Retrieval based Attention net, in Pytorch☆876Updated 2 years ago
- A concise but complete implementation of CLIP with various experimental improvements from recent papers☆721Updated 2 years ago
- Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch☆549Updated 2 years ago
- Implementation of the deepmind Flamingo vision-language model, based on Hugging Face language models and ready for training☆168Updated 2 years ago
- Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpe…☆435Updated 2 years ago
- Explorations into training LLMs to use clinical calculators from patient history, using open sourced models. Will start with Wells' Crite…☆316Updated 4 months ago
- Salesforce open-source LLMs with 8k sequence length.☆723Updated 11 months ago