benywon/ChiQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/benywon/ChiQA)

benywon / ChiQA

The implementations of various baselines in our CIKM 2022 paper: ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding.

☆34

Alternatives and similar repositories for ChiQA

Users that are interested in ChiQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bjoernpl / KOSMOS_reimplementation
View on GitHub
A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"
☆27Mar 3, 2023Updated 3 years ago
rmcong / CoADNet_NeurIPS20
View on GitHub
CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object Detection
☆19Jan 8, 2021Updated 5 years ago
taotaoorange / words-matter-scene-text-for-image-classification
View on GitHub
☆10Apr 4, 2018Updated 8 years ago
SAGNIKMJR / ego-AV-spatial-correspondence
View on GitHub
[CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'
☆14Jun 16, 2024Updated 2 years ago
xiaoneil / LPNet
View on GitHub
☆13Nov 28, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
facebookresearch / ego-env
View on GitHub
Human-centric environment representations from egocentric video
☆15Feb 5, 2026Updated 5 months ago
wzhouad / context-faithful-llm
View on GitHub
Code and data for paper "Context-faithful Prompting for Large Language Models".
☆41Mar 23, 2023Updated 3 years ago
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
BolinLai / CSTS
View on GitHub
[ECCV2024] The official implementation of "Listen to Look into the Future: Audio-Visual Egocentric Gaze Anticipation".
☆16Feb 24, 2025Updated last year
gyx-gloria / DMT
View on GitHub
Official Implementation of DMT: Dual Mean-Teacher in PyTorch.
☆10Oct 27, 2023Updated 2 years ago
runchu-tian / LongPiBench
View on GitHub
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆14Dec 16, 2024Updated last year
iLearn-Lab / MM23-RTQ
View on GitHub
ACM Multimedia 2023 (Oral) - RTQ: Rethinking Video-language Understanding Based on Image-text Model
☆15Apr 7, 2026Updated 3 months ago
rtmdrr / replicability-analysis-NLP
View on GitHub
☆15Oct 19, 2020Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kai-wen-yang / IDAA
View on GitHub
[ICML2022] "Identity-Disentangled Adversarial Augmentation for Self-Supervised Learning"
☆10Jul 24, 2022Updated 4 years ago
stogiannidis / srbench
View on GitHub
Source code for the Paper "Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language Models"
☆19Feb 1, 2026Updated 5 months ago
SHTUPLUS / vsub
View on GitHub
The substitution of qsub.
☆12Jan 25, 2019Updated 7 years ago
PhoebusSi / SAR
View on GitHub
Code for our ACL2021 paper: "Check It Again: Progressive Visual Question Answering via Visual Entailment"
☆31Nov 24, 2021Updated 4 years ago
xiaojino / RUArt
View on GitHub
RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering
☆10Nov 27, 2022Updated 3 years ago
oranshayer / BRRF
View on GitHub
Boundaries and Region Representation Fusion
☆12Mar 24, 2023Updated 3 years ago
kyocen / Graduation-Design-VQA-based-on-deep-learning
View on GitHub
毕业设计: 基于深度学习的视觉问答
☆13Jun 20, 2018Updated 8 years ago
gbmj / gbmj-timeline-cv
View on GitHub
A three-column graphical LaTeX2e resume
☆12Dec 12, 2019Updated 6 years ago
siddarth-c / FedGMA
View on GitHub
An FL algorithm inspired by FedGMA
☆11Oct 21, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zcxu-eric / Ego4d_TalkNet_ASD
View on GitHub
☆21Feb 15, 2022Updated 4 years ago
ofsoundof / CARN
View on GitHub
Tensorflow implementation of CARN
☆10Oct 3, 2018Updated 7 years ago
ksOAn6g5 / TaiSu
View on GitHub
TaiSu（太素）--a large-scale Chinese multimodal dataset（亿级大规模中文视觉语言预训练数据集）
☆192Nov 17, 2023Updated 2 years ago
Klitter / A-Bayesian-Federated-Learning-Framework-with-Online-Laplace-Approximation
View on GitHub
☆10Jul 21, 2021Updated 5 years ago
EGO4D / ego-exo4d-egopose
View on GitHub
☆18Apr 16, 2024Updated 2 years ago
JIA-Lab-research / Q-LLM
View on GitHub
This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"
☆54Jul 16, 2024Updated 2 years ago
ChanganVR / action2sound
View on GitHub
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
☆26Oct 1, 2024Updated last year
yellow-binary-tree / HawkEye
View on GitHub
Official implementation of HawkEye: Training Video-Text LLMs for Grounding Text in Videos
☆47Apr 29, 2024Updated 2 years ago
yzhangcs / master-thesis
View on GitHub
基于树形条件随机场的高阶句法分析
☆16Apr 28, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
zihuixue / AlignEgoExo
View on GitHub
Code and data release for the paper "Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Align…
☆19Apr 5, 2024Updated 2 years ago
nanduan / NLPCC-KBQA
View on GitHub
NLPCC-KBQA Dataset
☆15Dec 7, 2021Updated 4 years ago
lucasjinreal / gluon_ssd
View on GitHub
Implement SSD using Gluon in only 300 lines of codes!
☆10Nov 12, 2017Updated 8 years ago
OpenCausaLab / MORE
View on GitHub
☆15Jan 9, 2026Updated 6 months ago
ntunlp / Evaluation-of-ChatGPT
View on GitHub
A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets.
☆15Jul 10, 2023Updated 3 years ago
BapFL / code
View on GitHub
Implementation of BapFL: You can Backdoor Attack Personalized Federated Learning
☆15Sep 18, 2023Updated 2 years ago