FunQA benchmarks funny, creative, and magic videos for challenging tasks including timestamp localization, video description, reasoning, and beyond.
☆104Dec 25, 2025Updated 2 months ago
Alternatives and similar repositories for FunQA
Users that are interested in FunQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Syphus: Automatic Instruction-Response Generation Pipeline☆14Dec 14, 2023Updated 2 years ago
- Benchmarking and Analyzing Generative Data for Visual Recognition☆26Jul 25, 2023Updated 2 years ago
- Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.☆461Jul 4, 2023Updated 2 years ago
- On-Device Domain Generalization☆46Nov 9, 2022Updated 3 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- [ECCV 2022] StyleLight: HDR Panorama Generation for Lighting Estimation and Editing☆148Oct 9, 2023Updated 2 years ago
- [CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception☆124Jun 23, 2022Updated 3 years ago
- General video interaction platform based on LLMs, including Video ChatGPT☆256Jul 26, 2023Updated 2 years ago
- 🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing imp…☆3,344Mar 5, 2024Updated 2 years ago
- Toolbox for HuMMan Dataset☆126Dec 7, 2024Updated last year
- [NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"☆105Nov 9, 2023Updated 2 years ago
- [ICML 2025] Streamline Without Sacrifice - Squeeze out Computation Redundancy in LMM☆20May 22, 2025Updated 10 months ago
- Official Code for "Digital Life Project: Autonomous 3D Characters with Social Intelligence"☆44Sep 9, 2024Updated last year
- [TPAMI] Searching prompt modules for parameter-efficient transfer learning.☆238Dec 8, 2023Updated 2 years ago
- Toolbox for GTA-Human Datasets☆26Oct 9, 2024Updated last year
- A framework that allows you to apply Sparse AutoEncoder on any models☆51Jul 11, 2025Updated 8 months ago
- Code for paper "Half-Physics: Enabling Kinematic 3D Human Model with Physical Interactions". Coming soon.☆33Jul 31, 2025Updated 7 months ago
- ☆77May 4, 2025Updated 10 months ago
- Code for our IJCV paper "HumanLiff: Layer-wise 3D Human Generation with Diffusion Model"☆53May 6, 2024Updated last year
- [IJCV 2025] Code for DeepFake-Adapter: Dual-Level Adapter for DeepFake Detection☆60Dec 24, 2024Updated last year
- [ECCV2022] New benchmark for evaluating pre-trained model; New supervised contrastive learning framework.☆110Dec 8, 2023Updated 2 years ago
- ConsistentNeRF Enhances Neural Radiance Fields with 3D Consistency for Sparse View Synthesis☆75Oct 12, 2023Updated 2 years ago
- Long Context Transfer from Language to Vision☆402Mar 18, 2025Updated last year
- A local AI assistant running on your device. It turns your files into actionable memory.☆54Mar 14, 2026Updated last week
- [NeurIPS 2023] Official Code for "Towards Robust and Expressive Whole-body Human Pose and Shape Estimation"☆51Feb 13, 2026Updated last month
- [NeurIPS 2025] GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data☆85Sep 24, 2025Updated 5 months ago
- [ECCV 2022 & IJCV 2025] PyTorch code for SeqDeepFake: Detecting and Recovering Sequential DeepFake Manipulation☆150Dec 3, 2024Updated last year
- [NeurIPS 2021] ORL: Unsupervised Object-Level Representation Learning from Scene Images☆58Dec 6, 2021Updated 4 years ago
- The official repository of "Video assistant towards large language model makes everything easy"☆232Dec 24, 2024Updated last year
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated last year
- [ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.☆191Sep 26, 2025Updated 5 months ago
- Official code release for DeformToon3D: Deformable 3D Toonification from Neural Radiance Fields (ICCV 2023)☆56May 24, 2024Updated last year
- ☆27Oct 5, 2023Updated 2 years ago
- ☆157Oct 31, 2024Updated last year
- [CVPR2023] All in One: Exploring Unified Video-Language Pre-training☆281Mar 25, 2023Updated 2 years ago
- NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR'21)☆185Aug 2, 2025Updated 7 months ago
- Official Implementation of ICCV 2023 paper "StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation"☆23May 10, 2024Updated last year
- A lightweight flexible Video-MLLM developed by TencentQQ Multimedia Research Team.☆74Oct 14, 2024Updated last year
- Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".☆59Jun 27, 2023Updated 2 years ago