layer6ai-labs/fusemix

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/layer6ai-labs/fusemix)

layer6ai-labs / fusemix

Data-Efficient Multimodal Fusion on a Single GPU

☆68

Alternatives and similar repositories for fusemix

Users that are interested in fusemix are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

layer6ai-labs / fair-dp
View on GitHub
Code accompanying the paper "Disparate Impact in Differential Privacy from Gradient Misalignment".
☆11Apr 4, 2023Updated 3 years ago
layer6ai-labs / calo-forest
View on GitHub
A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.
☆22Mar 23, 2026Updated 4 months ago
layer6ai-labs / TabDPT-training
View on GitHub
Training code for TabDPT: Scaling Tabular Foundation Models on Real Data
☆61Aug 3, 2025Updated 11 months ago
layer6ai-labs / TabDPT-inference
View on GitHub
Inference code for "TabDPT: Scaling Tabular Foundation Models on Real Data"
☆100Updated this week
MNSfuxiang / MFN
View on GitHub
A multimodal fine-grained correlation fusion network with attention mechanisms for visual-textual sentiment analysis
☆10Jan 13, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
arumaekawa / DiLM
View on GitHub
Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".
☆28Feb 10, 2025Updated last year
LHL3341 / ContextBLIP
View on GitHub
ContextBLIP : Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions [ACL 2024]
☆11May 17, 2024Updated 2 years ago
draw2think / harness-geometry
View on GitHub
Implementation code for the paper "Draw2Think: Harnessing Geometry Reasoning through Constraint Engine Interaction"
☆17May 28, 2026Updated last month
layer6ai-labs / msc-sql
View on GitHub
Text-2-SQL
☆19Feb 21, 2025Updated last year
ExplainableML / fomo_in_flux
View on GitHub
Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]
☆62Dec 10, 2024Updated last year
gyuilLim / Youtube-scene-search-with-text
View on GitHub
Finding scenes that you want by text automatically
☆10Jan 13, 2025Updated last year
Zi-hao-Wei / Efficient-Vision-Language-Pre-training-by-Cluster-Masking
View on GitHub
[CVPR 2024] Improving language-visual pretraining efficiency by perform cluster-based masking on images.
☆33May 16, 2024Updated 2 years ago
BrandonHanx / FAME-ViL
View on GitHub
[CVPR 2023 (Highlight)] FAME-ViL: Multi-Tasking V+L Model for Heterogeneous Fashion Tasks
☆56Sep 28, 2023Updated 2 years ago
m1k2zoo / negbench
View on GitHub
Evaluation and dataset construction code for the CVPR 2025 paper "Vision-Language Models Do Not Understand Negation"
☆47Feb 26, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
tanvir-utexas / PaPr
View on GitHub
☆13Jul 3, 2024Updated 2 years ago
elisakreiss / concadia
View on GitHub
☆16Jan 3, 2023Updated 3 years ago
layer6ai-labs / dgm_geometry
View on GitHub
☆18Jan 20, 2025Updated last year
layer6ai-labs / lfr
View on GitHub
Code for the ICLR'24 paper "Self-supervised Representation Learning From Random Data Projectors
☆16Mar 16, 2024Updated 2 years ago
stoneMo / DeepAVFusion
View on GitHub
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
☆43Aug 2, 2024Updated last year
hustvl / WeakCLIP
View on GitHub
[IJCV 2024]
☆21Nov 11, 2024Updated last year
Dalia-Sher / Speech-Emotion-Recognition-using-BLSTM-with-Attention
View on GitHub
We present a study of a neural network based method for speech emotion recognition, using audio-only features. In the studied scheme, the…
☆11Jul 24, 2024Updated 2 years ago
yiren-jian / BLIText
View on GitHub
[NeurIPS 2023] Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
☆26Dec 5, 2023Updated 2 years ago
hsiehjackson / Mr.Right
View on GitHub
Mr. Right: Multimodal Retrieval on Representation of ImaGe witH Text
☆24Aug 15, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vkhoi / cora_cvpr24
View on GitHub
☆28Sep 3, 2024Updated last year
uvavision / SyViC
View on GitHub
[ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data
☆13Sep 30, 2023Updated 2 years ago
QingyangZhang / QMF
View on GitHub
[ICML 2023] Provable Dynamic Fusion for Low-Quality Multimodal Data
☆126Jun 28, 2025Updated last year
VITA-Group / RefPaint
View on GitHub
☆15Jul 21, 2023Updated 3 years ago
friedrichor / UNITE
View on GitHub
official code for "Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval"
☆42Jul 4, 2025Updated last year
omipan / svl_adapter
View on GitHub
SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models
☆21Jan 11, 2024Updated 2 years ago
mzhaoshuai / RLCF
View on GitHub
[ICLR 2024] Test-Time RL with CLIP Feedback for Vision-Language Models.
☆102Oct 20, 2025Updated 9 months ago
Paranioar / RCAR
View on GitHub
[TIP2023] The code of “Plug-and-Play Regulators for Image-Text Matching”
☆34Apr 11, 2024Updated 2 years ago
bizerfr / BPNet
View on GitHub
☆19Apr 26, 2026Updated 2 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
adobe-research / llava-score
View on GitHub
☆11Oct 2, 2024Updated last year
ashawkey / grid_put
View on GitHub
An operation trying to do the opposite of F.grid_sample
☆20Aug 8, 2023Updated 2 years ago
VinAIResearch / selfsup_pcd
View on GitHub
Self-Supervised Learning with Multi-View Rendering for 3D Point Cloud Analysis (ACCV 2022)
☆11Jul 22, 2024Updated 2 years ago
Janie1996 / AV4SER
View on GitHub
PyTorch implementation for Audio-Visual Domain Adaptation Feature Fusion for Speech Emotion Recognition
☆12Mar 20, 2022Updated 4 years ago
Chenguoz / Keypoints
View on GitHub
[NN 2024] Code Release of Unsupervised Distribution-aware Keypoints Generation from 3D Point Clouds
☆11Feb 20, 2024Updated 2 years ago
nianfd / RWKV-VG
View on GitHub
☆10Dec 3, 2024Updated last year
3DCoMPaT200 / 3DCoMPaT200
View on GitHub
☆15Feb 13, 2025Updated last year