ShramanPramanick/VoLTA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ShramanPramanick/VoLTA)

ShramanPramanick / VoLTA

Code release for "VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment" [TMLR, 2023]

☆12

Alternatives and similar repositories for VoLTA

Users that are interested in VoLTA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / EgoVLPv2
View on GitHub
Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]
☆110Jul 2, 2024Updated 2 years ago
schowdhury671 / APoLLo
View on GitHub
☆24Jun 12, 2024Updated 2 years ago
google / spiqa
View on GitHub
Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]
☆76Jan 13, 2025Updated last year
sapmitra / IDEA_VHDL
View on GitHub
used VHDL to implement and simulate the IDEA-algorithm (International Data Encryption Algorithm). We will test the hardware-oriented impl…
☆12Jan 15, 2019Updated 7 years ago
pierpaolomori / SemanticSegmentationFPGA
View on GitHub
☆11Sep 3, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ivanmontero / autobot
View on GitHub
Implementation of the paper 'Sentence Bottleneck Autoencoders from Transformer Language Models'
☆17Mar 14, 2022Updated 4 years ago
sayannag / EEG
View on GitHub
power spectrum using fft, waveforms and amplitude spectrums for delta, alpha, gamma, beta and theta bands
☆12Sep 28, 2016Updated 9 years ago
mirthAI / GDM-VE
View on GitHub
☆21Mar 11, 2025Updated last year
YUECHE77 / SPIN
View on GitHub
[EMNLP 2025 Main Conference] Mitigating Hallucinations in Vision-Language Models through Image-Guided Head Suppression
☆16Dec 26, 2025Updated 6 months ago
jnzs1836 / intent-vizor
View on GitHub
☆16Jul 10, 2024Updated 2 years ago
sahilmishra0012 / OpenDisplay
View on GitHub
Open-source macOS display manager. DDC brightness/contrast/volume, resolution switching, window tiling, night shift, HDR, profiles, CLI. …
☆20Apr 3, 2026Updated 3 months ago
royshil / morethantechnical
View on GitHub
Automatically exported from code.google.com/p/morethantechnical
☆17Mar 16, 2015Updated 11 years ago
chemcognition-lab / pom-mix
View on GitHub
☆17Jul 4, 2025Updated last year
JulienGenovese / JulienGenovese
View on GitHub
In this repository we have all the codes that we have developed
☆12Sep 13, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
tanmaybinaykiya / Ask-Me-Question-Generating-Agent
View on GitHub
Ask Me: Question Generating Agent
☆14Jan 10, 2019Updated 7 years ago
PanZaifeng / KVFlow
View on GitHub
☆28Mar 12, 2026Updated 4 months ago
schowdhury671 / meerkat
View on GitHub
☆35Jul 9, 2025Updated last year
hjy-u / ETOG
View on GitHub
[ICRA 2025] A Parameter-Efficient Tuning Framework for Language-guided Object Grounding and Robot Grasping
☆13Feb 7, 2025Updated last year
shikras / d-cube
View on GitHub
A detection/segmentation dataset with labels characterized by intricate and flexible expressions. "Described Object Detection: Liberating…
☆138Mar 20, 2024Updated 2 years ago
lgresearch / QASA
View on GitHub
☆33Oct 30, 2023Updated 2 years ago
JinhaoLee / WCA
View on GitHub
[ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models
☆19Mar 23, 2026Updated 4 months ago
uvadlc / uvadlc_practicals_2021
View on GitHub
Repository for the code assignment of the Deep Learning 1 course, Fall 2021 edition
☆10Oct 31, 2022Updated 3 years ago
ArmanZarei / SliderEdit
View on GitHub
☆33Apr 21, 2026Updated 3 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LCS2-IIITD / MOMENTA
View on GitHub
☆23Apr 8, 2024Updated 2 years ago
schlaile / concatfs
View on GitHub
concatfs FUSE driver for easy file concatenation (like large movie files)
☆56Jun 16, 2023Updated 3 years ago
GeWu-Lab / Certifiable-Robust-Multi-modal-Training
View on GitHub
A python implement for Certifiable Robust Multi-modal Training
☆20Jun 21, 2025Updated last year
niklasb / webkit-server
View on GitHub
[not actively maintained] The C++ webkit-server from capybara-webkit with useful extensions and Python bindings
☆48Jan 4, 2021Updated 5 years ago
moewiee / RSNA2020-Team-VinBDI-MedicalImaging
View on GitHub
☆10Oct 27, 2020Updated 5 years ago
platformxlab / NeuSim
View on GitHub
An open-source simulator framework for neural processing units
☆46Jun 22, 2026Updated last month
yafeng19 / T-CORE
View on GitHub
[CVPR 2025] PyTorch implementation of T-CORE, introduced in "When the Future Becomes the Past: Taming Temporal Correspondence for Self-su…
☆19Nov 4, 2025Updated 8 months ago
tmdemelo / pydcm
View on GitHub
Dynamic Causal Modeling with Python
☆44Sep 19, 2019Updated 6 years ago
cherishleon / cvpr25_medical_paper
View on GitHub
☆26Jun 4, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
LinjieMu / MMXU
View on GitHub
☆25Nov 27, 2025Updated 7 months ago
AIDASLab / Medic-AD
View on GitHub
[CVPR 2026 Oral] Official implementation for "MEDIC-AD: Towards Medical Vision-Language Model's Clinical Intelligence"
☆28Apr 9, 2026Updated 3 months ago
ShramanPramanick / Transformer_Based_Geo-localization
View on GitHub
This is the repository for ECCV2022 paper titled: "Where in the World is this Image? Transformer-based Geo-localization in the Wild".
☆47Apr 24, 2023Updated 3 years ago
szkocot / Adapting-Auxiliary-Losses-Using-Gradient-Similarity
View on GitHub
Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article
☆33Mar 1, 2019Updated 7 years ago
showlab / videogui
View on GitHub
[NeurIPS 2024 D&B] VideoGUI: A Benchmark for GUI Automation from Instructional Videos
☆53Feb 22, 2026Updated 5 months ago
allenai / qasper-led-baseline
View on GitHub
☆61Oct 27, 2021Updated 4 years ago
Holipori / EKAID
View on GitHub
code for Expert Knowledge-Aware Image Difference Graph Representation Learning for Difference-Aware Medical Visual Question Answering
☆29May 30, 2025Updated last year