Implementation of Qformer from BLIP2 in Zeta Lego blocks.
☆49Nov 11, 2024Updated last year
Alternatives and similar repositories for qformer
Users that are interested in qformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A CUDA kernel for NHWC GroupNorm for PyTorch☆23Nov 15, 2024Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Mar 11, 2024Updated 2 years ago
- A simple neural network framework written in C++.☆24Sep 16, 2023Updated 2 years ago
- Implementation of the Pairformer model used in AlphaFold 3☆14Updated this week
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆17Apr 2, 2025Updated 11 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training☆26Dec 5, 2023Updated 2 years ago
- Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta☆16Nov 11, 2024Updated last year
- ☆11May 9, 2023Updated 2 years ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆19Nov 18, 2024Updated last year
- ☆15Jan 12, 2026Updated 2 months ago
- ☆17Oct 6, 2025Updated 5 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- Implementation of Materials Discovery with Extreme properties via AI-Driven Combinatorial Chemistry☆10May 8, 2024Updated last year
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- RetroDFM-R: Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning☆20Nov 22, 2025Updated 4 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated 3 months ago
- Jupyter notebooks for cloud-based usage☆10Aug 26, 2023Updated 2 years ago
- ☆13Apr 16, 2022Updated 3 years ago
- ☆19Jun 29, 2025Updated 9 months ago
- ☆16Jun 9, 2023Updated 2 years ago
- Pytorch Implementation of the Model from "MIRASOL3B: A MULTIMODAL AUTOREGRESSIVE MODEL FOR TIME-ALIGNED AND CONTEXTUAL MODALITIES"☆26Jan 27, 2025Updated last year
- ☆24Dec 23, 2024Updated last year
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆12Mar 16, 2022Updated 4 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Aug 23, 2023Updated 2 years ago
- Curated collection of AI for Science papers, organized by research domains. 收录并分类整理公众号【你好不吃虾】中分享的 AI for Science 论文。☆34Oct 28, 2025Updated 5 months ago
- Control LLM☆22Apr 6, 2025Updated 11 months ago
- Implementation of PyTorch: "GAMBA: MARRY GAUSSIAN SPLATTING WITH MAMBA FOR SINGLE-VIEW 3D RECONSTRUCTION"☆65Oct 6, 2025Updated 5 months ago
- Preprocessing of datasets of chemical reactions: standardization, filtering, augmentation, tokenization, etc.☆16Sep 10, 2025Updated 6 months ago
- The first comprehensive multimodal language analysis benchmark for evaluating foundation models☆29Sep 22, 2025Updated 6 months ago
- Repo for reproducing show and tell: neural image captioning☆11Dec 12, 2018Updated 7 years ago
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆14Nov 5, 2024Updated last year
- implementation of dualformer☆25Mar 1, 2025Updated last year
- Reasoning in Space via Grounding in the World (ICLR 2025)☆50Nov 3, 2025Updated 4 months ago
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Nov 7, 2025Updated 4 months ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆34Feb 21, 2025Updated last year
- Implementation of Google's USM speech model in Pytorch☆35Mar 22, 2026Updated last week
- Neural Network Crystal Synthesizability Predictor (NNCSP)☆11Aug 29, 2024Updated last year