Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation (NeurIPS 2023)
☆22Oct 1, 2023Updated 2 years ago
Alternatives and similar repositories for SMILE
Users that are interested in SMILE are comparing it to the libraries listed below
Sorting:
- Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)☆57Oct 28, 2024Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Jul 21, 2023Updated 2 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- This is the implementation of the visual model mentioned in our paper 'Automated Radiology Report Generation using Conditioned Transforme…☆10Jul 25, 2024Updated last year
- [ACM MM 2025] TimeChat-online: 80% Visual Tokens are Naturally Redundant in Streaming Videos☆117Dec 12, 2025Updated 2 months ago
- CVPR 2021 Official Pytorch Code for UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training☆34Nov 9, 2021Updated 4 years ago
- Partially Non-Autoregressive Image Captioning☆10Sep 30, 2021Updated 4 years ago
- ☆10Jul 23, 2019Updated 6 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Official repository for 'Risk of Bias in Chest Radiography Deep Learning Foundation Models'☆12Sep 27, 2023Updated 2 years ago
- Generalization in Metric Learning: Should the Embedding Layer be the Embedding Layer?☆11Jan 3, 2019Updated 7 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- This repository collects awesome representative papers and resources for "From Pre-training to Post-training: A Survey on Time Series Fou…☆30Feb 1, 2026Updated last month
- [ACM MM2025]: Unleashing the Power of Data Generation in One-Pass Outdoor LiDAR Localization☆18Oct 29, 2025Updated 4 months ago
- The Ecoacoustic Dataset from Arctic North Slope Alaska☆11May 29, 2025Updated 9 months ago
- NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks, CVPR 2022 (Oral)☆50Jan 30, 2024Updated 2 years ago
- The PyTorch implementation of DSM (EMNLP 2022).☆10Mar 26, 2024Updated last year
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated last year
- ChangeIt dataset with more than 2600 hours of video with state-changing actions published at CVPR 2022☆11Mar 23, 2022Updated 3 years ago
- Implementation of PPO for CartPole-v1☆10Jan 1, 2019Updated 7 years ago
- Running various MACHINE LEARNING algorithms and comparing their performance☆10Sep 26, 2018Updated 7 years ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- Testing Difference Target Propagation (DTP) on MNIST.☆12Oct 12, 2020Updated 5 years ago
- ☆33Jan 9, 2026Updated last month
- Simple, convenient, configurable SSH server selection for the macOS and GNOME Terminal and iTerm2 on macOS.☆12Aug 13, 2021Updated 4 years ago
- Code for the paper "Refining Language Model with Compositional Explanation" (NeurIPS 2021)☆12Oct 25, 2021Updated 4 years ago
- Learn to find the notes on your new musical instrument☆10May 21, 2020Updated 5 years ago
- The official source code of our AAAI25 paper "D&M: Enriching E-commerce Videos with Sound Effects by Key Moment Detection and SFX Matchin…☆10Feb 9, 2025Updated last year
- Database of annotated field recording samples that can be used for training audio labelling algorithms☆10Feb 1, 2019Updated 7 years ago
- Cross-Platform Annotation Tool for Person Search Datasets☆11Aug 29, 2017Updated 8 years ago
- Accurate spatial quantification in computational pathology with multiple instance learning☆28Nov 19, 2025Updated 3 months ago
- DayBit 是一个使用 Tornado 作为后台框架的文字交互游戏。☆13Feb 25, 2016Updated 10 years ago
- 将pdf分成彩色和黑白部分,便于打印☆11Mar 9, 2025Updated 11 months ago
- Source code for Multi-resolution Common Fate Transform.☆12Jun 5, 2020Updated 5 years ago
- Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions☆22Feb 11, 2026Updated 2 weeks ago
- My defense presentation☆10Mar 7, 2022Updated 3 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- ☆12Jun 1, 2024Updated last year
- App for cataloguing vintage cameras, lenses, films, negatives & prints☆13Feb 13, 2026Updated 2 weeks ago