Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
☆16Nov 11, 2024Updated last year
Alternatives and similar repositories for VisionLLaMA
Users that are interested in VisionLLaMA are comparing it to the libraries listed below
Sorting:
- Community Implementation of the paper: "Multi-Head Mixture-of-Experts" In PyTorch☆29Jan 31, 2026Updated last month
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Nov 11, 2024Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Updated this week
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Feb 6, 2026Updated last month
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆52Jan 27, 2025Updated last year
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆122Jan 31, 2026Updated last month
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Apr 9, 2025Updated 11 months ago
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Mar 13, 2026Updated last week
- Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"☆65Oct 6, 2025Updated 5 months ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆21Feb 9, 2026Updated last month
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆24Dec 9, 2023Updated 2 years ago
- Implementation of the premier Text to Video model from OpenAI☆56Nov 11, 2024Updated last year
- ☆11Sep 18, 2023Updated 2 years ago
- Shared edits for Discourse☆18Mar 13, 2026Updated last week
- ☆15Apr 26, 2025Updated 10 months ago
- Deep Molecular Dreaming☆26May 25, 2024Updated last year
- The next evolution of Agents☆48Updated this week
- A simple WebAssembly Linker in JavaScript☆17Jun 15, 2021Updated 4 years ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- a `cross application run time` for luau, designed to be easy to embed and sandbox☆14May 28, 2025Updated 9 months ago
- Add dynamic typing capabilities to C++☆12Jun 15, 2015Updated 10 years ago
- diffusion model baesd video-virtual-try-on☆26Feb 20, 2024Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Lua bindings for libusb☆14Jul 25, 2023Updated 2 years ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆16Feb 16, 2026Updated last month
- Azure DevOps workflow for ML☆20Mar 29, 2023Updated 2 years ago
- Publishes an automated Year in Review topic☆15Mar 13, 2026Updated last week
- Scan your Discourse uploads.☆13Updated this week
- Exploring SOTA Advanced RAG techniques: This project implements a self reflective RAG, seamlessly integrating multiple knowledge sources …☆20Jul 8, 2024Updated last year
- Breakdown of the PlayStation VR communication protocols for programmers☆15Oct 19, 2018Updated 7 years ago
- Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"☆26Updated this week
- Convert lua table to pointer, and iterate it.☆16Oct 17, 2016Updated 9 years ago
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Effortlessly Create Engaging and Informative Threads in Minutes☆14Feb 3, 2023Updated 3 years ago
- An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"☆36Jun 7, 2024Updated last year
- Thing To-Do After Install Ubuntu☆12Sep 9, 2023Updated 2 years ago
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- Graphics math library for MoonGL☆16Nov 12, 2023Updated 2 years ago