Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
☆16Nov 11, 2024Updated last year
Alternatives and similar repositories for VisionLLaMA
Users that are interested in VisionLLaMA are comparing it to the libraries listed below
Sorting:
- Implementation of the Pairformer model used in AlphaFold 3☆14Feb 23, 2026Updated last week
- "Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023☆16Nov 28, 2024Updated last year
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆15Feb 23, 2026Updated last week
- Implementation of the text to video model LUMIERE from the paper: "A Space-Time Diffusion Model for Video Generation" by Google Research☆52Jan 27, 2025Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Apr 9, 2025Updated 10 months ago
- Train a production grade GPT in less than 400 lines of code. Better than Karpathy's verison and GIGAGPT☆16Feb 6, 2026Updated 3 weeks ago
- Implementation of MoE Mamba from the paper: "MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts" in Pytorch and Ze…☆120Jan 31, 2026Updated last month
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- Implementation of the model: "(MC-ViT)" from the paper: "Memory Consolidation Enables Long-Context Video Understanding"☆27Jan 17, 2026Updated last month
- SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network☆24Dec 9, 2023Updated 2 years ago
- Deep Molecular Dreaming☆26May 25, 2024Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Feb 23, 2026Updated last week
- A simpler Pytorch + Zeta Implementation of the paper: "SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series…☆28Nov 11, 2024Updated last year
- Implementation of the paper: "BRAVE : Broadening the visual encoding of vision-language models"☆26Feb 6, 2026Updated 3 weeks ago
- Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"☆65Oct 6, 2025Updated 4 months ago
- Demo repository showcasing how to use reusable workflows to build artifact attestations☆14Feb 16, 2026Updated 2 weeks ago
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- Self-evaluating RAG application on LangCheck docs☆11Sep 10, 2025Updated 5 months ago
- A local browser automation agent based on Microsoft Fara-7B model optimized for LM Studio inference.☆25Nov 25, 2025Updated 3 months ago
- Amplify your coding capabilities with AI - your smart co-pilot for an elevated coding experience.☆14Feb 18, 2026Updated last week
- IonQ iQuHACK 2024 Remote Challenge☆11Feb 3, 2024Updated 2 years ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆12Oct 20, 2024Updated last year
- PSI-MOD ontology for modified and unmodified amino acid residues☆14Jan 8, 2026Updated last month
- A NOMAD plugin containing base sections for material processing.☆11Jan 20, 2026Updated last month
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- C and C++ to Luau compiler for Roblox.☆10Feb 6, 2024Updated 2 years ago
- ☆14Jul 2, 2023Updated 2 years ago
- gammcor code☆11Sep 25, 2025Updated 5 months ago
- Code and software used to design de novo protein nanomachines. Supplementary material for "Computational design of nanoscale rotational m…☆10Mar 19, 2022Updated 3 years ago
- ☆21Updated this week
- Focus handling and navigation library with React integration. This is a read-only mirror.☆15Dec 19, 2024Updated last year
- Implementation of the model "AudioFlamingo" from the paper: "Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dial…☆40Jan 27, 2025Updated last year
- A new repo to demonstrate tutorials for using HuggingFace on Graphcore IPUs.☆12May 3, 2023Updated 2 years ago
- Spatialyze: A Geospatial Video Analytic System with Spatial-Aware Optimizations☆11Mar 3, 2025Updated last year
- A python package for protein inference in Mass Spectrometric data analysis.☆10Jun 6, 2022Updated 3 years ago
- how to build a sentence embedding application using BentoML☆14Mar 31, 2025Updated 11 months ago
- Sample and Computation Redistribution for Efficient Face Detection☆16May 13, 2024Updated last year
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- A smaller, lighter-weight version of OpenClaw—natively multi-agent, compiles to Rust, and built on the Swarms framework and Swarms ecosys…☆35Updated this week