PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆28Apr 20, 2026Updated 2 weeks ago
Alternatives and similar repositories for MM1
Users that are interested in MM1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- A vast array of Multi-Modal Embodied Robotic Foundation Models!☆28Mar 18, 2024Updated 2 years ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Apr 20, 2026Updated 2 weeks ago
- A simple to use package to call various model providers such as openai, anthropic, and others with utmost reliability, security, and perf…☆13Oct 6, 2025Updated 6 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆55Oct 13, 2025Updated 6 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code and model for paper <Mutual Information Maximization for Effective Lip Reading>☆19Sep 4, 2020Updated 5 years ago
- Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"☆43Nov 11, 2024Updated last year
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆46May 23, 2023Updated 2 years ago
- TF code for our CVPR2020 paper "Discriminative Multi-modality Speech Recognition"☆26Apr 27, 2022Updated 4 years ago
- An implementation of the base GPT-3 Model architecture from the paper by OPENAI "Language Models are Few-Shot Learners"☆20Jun 29, 2024Updated last year
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆30Apr 13, 2026Updated 3 weeks ago
- The Swarm Ecosystem☆30Aug 1, 2024Updated last year
- The project proposal template for OpenBioML community projects.☆18Feb 9, 2023Updated 3 years ago
- Face de-occlusion using 3D morphable model and generative adversarial network☆34Oct 22, 2021Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 3 years ago
- Code/data for MARG (multi-agent review generation)☆63Mar 5, 2026Updated last month
- ☆26Mar 14, 2024Updated 2 years ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆25Apr 20, 2026Updated 2 weeks ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆18Apr 27, 2026Updated last week
- Community Open Source Implementation of GPT4o in PyTorch☆29Apr 20, 2026Updated 2 weeks ago
- Generates random utf-8 strings for fuzz t�sting character encoding probl�ms☆11Aug 21, 2015Updated 10 years ago
- Segmenting a given document using recursive xy-cut algorithm.☆12Oct 9, 2018Updated 7 years ago
- A PyTorch re-implementation of Weakly Supervised Facial Action Unit Recognition through Adversarial Training☆10Apr 23, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆59Apr 20, 2026Updated 2 weeks ago
- MelGAN and Tacotron 2 in PyTorch☆11Oct 22, 2019Updated 6 years ago
- Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"☆65Oct 6, 2025Updated 6 months ago
- Inference Llama 2 in one file of pure C. Nahh wait, now fresh in Julia!☆25Aug 2, 2023Updated 2 years ago
- Low-latency live streaming PoC☆11Jul 30, 2019Updated 6 years ago
- ☆17Feb 2, 2024Updated 2 years ago
- Yet another LLM☆10Apr 6, 2023Updated 3 years ago
- Smart spinner component for Qwik, to manage the duration of loading states.☆13Sep 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Aug 10, 2023Updated 2 years ago
- ☆13Jun 16, 2021Updated 4 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆31Nov 11, 2024Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆30Apr 13, 2026Updated 2 weeks ago