SMILE: A Multimodal Dataset for Understanding Laughter
☆13Jun 15, 2023Updated 2 years ago
Alternatives and similar repositories for SMILE
Users that are interested in SMILE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- ☆16Jun 5, 2022Updated 3 years ago
- ☆13Jul 20, 2024Updated last year
- showing how to use CLIP-Vip to do video search☆16Nov 16, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago
- Dataset splits and evaluation code for the paper "Benchmark for Compositional Text-to-Image Synthesis" (NeurIPS 2021)☆45May 3, 2022Updated 4 years ago
- ☆54Jul 31, 2022Updated 3 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- collection of pitch (f0, fundamental frequency) detection algorithms with unified interface☆25Nov 25, 2024Updated last year
- COLA: Evaluate how well your vision-language model can Compose Objects Localized with Attributes!☆25Nov 23, 2024Updated last year
- An official codebase for "NormLens: Reading Books is Great, But Not if You Are Driving! Visually Grounded Reasoning about Defeasible Comm…☆10May 9, 2024Updated 2 years ago
- A python implementation of PROCLUS: PROjected CLUStering algorithm.☆10Jan 12, 2015Updated 11 years ago
- Official PyTorch implementation of Vision DiffMask, a post-hoc interpretation method for vision models.☆32Mar 5, 2024Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017☆14Aug 7, 2018Updated 7 years ago
- Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)☆23Jul 16, 2025Updated 10 months ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- This repository contains the Adverbs in Recipes (AIR) dataset and the code published at the CVPR 23 paper: "Learning Action Changes by Me…☆13May 25, 2023Updated 2 years ago
- ☆11Jul 31, 2022Updated 3 years ago
- We build a novel self-supervised segmentation pipeline to segment transparent liquids (clear water) placed inside transparent containers.☆26Nov 22, 2022Updated 3 years ago
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆42Dec 23, 2023Updated 2 years ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆12Dec 15, 2021Updated 4 years ago
- A unix pipeline utils based on LLM☆16May 15, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 4 years ago
- [Corca / OR] Solver for Multi-dimensional Multi-demand Quadratic Knapsack Problems☆12Mar 22, 2022Updated 4 years ago
- ☆31Mar 24, 2022Updated 4 years ago
- [CVPR23 Highlight] CREPE: Can Vision-Language Foundation Models Reason Compositionally?☆35Apr 27, 2023Updated 3 years ago
- ☆21Nov 24, 2022Updated 3 years ago
- Online visual analytics tool designed to investigate how attention maps in transformer models behaves, and build hypothesis on those mode…☆10Nov 10, 2021Updated 4 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆26Jul 11, 2023Updated 2 years ago
- NNVisBuilder and some cases including KD-t☆12Nov 18, 2023Updated 2 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆30Dec 1, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Labeled Movie Trailer Dataset☆16Mar 23, 2018Updated 8 years ago
- IRFL: Image Recognition of Figurative Language☆11Nov 30, 2023Updated 2 years ago
- Data repository for the VALSE benchmark.☆38Feb 15, 2024Updated 2 years ago
- [CAC2023] Bilateral Network with Residual U-blocks and Dual-Guided Attention for Real-time Semantic Segmentation☆11Nov 28, 2024Updated last year
- Temporal Compact Bilinear Pooling (TCBP)☆11May 27, 2020Updated 5 years ago
- ☆12Apr 10, 2024Updated 2 years ago
- ☆40Apr 14, 2025Updated last year