xid32 / NAACL_2025_TWMView external linksLinks
We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFMs). This plug-and-play module can be easily integrated into existing MFMs. With our TWM, nine state-of-the-art models exhibit significant performance improvements across QA, captioning, and retrieval tasks.
☆313Nov 26, 2025Updated 2 months ago
Alternatives and similar repositories for NAACL_2025_TWM
Users that are interested in NAACL_2025_TWM are comparing it to the libraries listed below
Sorting:
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…☆1,105Nov 26, 2025Updated 2 months ago
- ☆176Feb 21, 2025Updated 11 months ago
- ☆104Jan 24, 2025Updated last year
- PhishIntention: Phishing detection through webpage intention☆255Jan 5, 2026Updated last month
- A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classificati…☆252Jan 15, 2026Updated last month
- Simple yet powerful Twitter data retrieval SDK with multi-language support.No Limits, No Auth Required☆183Jan 6, 2025Updated last year
- ☆98Mar 8, 2025Updated 11 months ago
- ☆247Nov 24, 2024Updated last year
- Vim mode for VSCode, run Vim/Nvim in integrated terminal with seamless switching☆120Apr 30, 2025Updated 9 months ago
- ☆297Sep 14, 2025Updated 5 months ago
- One-click training of your own GPT. Training a GPT has never been easier for beginners. / 一键预训练+SFT一个属于自己的LLM,0基础训练GPT原来可以这么简单?☆364Feb 4, 2026Updated last week
- ☆134Feb 15, 2025Updated last year
- An Workspace for HMI tools☆164Jul 11, 2024Updated last year
- ☆279Apr 29, 2025Updated 9 months ago
- kight is a static analysis tool for c/c++ programs.☆214Dec 27, 2024Updated last year
- Advanced Unsupervised Image Enhancement with GAN☆247Nov 11, 2024Updated last year
- Evaluation of Text-to-Video Generation Models: A Dynamics Perspective[NeurIPS 2024].☆274Dec 3, 2024Updated last year
- 一个超超超好用的 uniapp 开发框架:uni-plus 是由 Uniapp + Vue3 + TS + Vite + Pinia + Unocss + WotUi 驱动的跨端快速启动模板,使用 VS Code 开发,具有丰富的代码提示、错误校验、类型提醒、预先插件安装、…☆272Mar 14, 2025Updated 11 months ago
- MetaTrx: Comprehensive Cross-Species Transcriptome Analysis☆118Jun 4, 2024Updated last year
- GENERanno: A Genomic Foundation Model for Metagenomic Annotation☆306Updated this week
- A public good tool to help users verify Safe (Gnosis Safe) transactions before signing or execution.☆525May 22, 2025Updated 8 months ago
- [ECCV 2024] Tuning-Free Image Customization with Image and Text Guidance☆148Feb 1, 2025Updated last year
- A React-based virtual avatar component for real-time gameplay analysis and emotional support. Integrate with screen capture to provide in…☆149Jan 9, 2025Updated last year
- GENERator: A Long-Context Generative Genomic Foundation Model☆445Updated this week
- Official Code of "GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering"☆112Nov 21, 2025Updated 2 months ago
- ☆75Feb 17, 2025Updated 11 months ago
- Deep Reinforcement Learning Algorithms for solving Atari 2600 Games☆143Mar 23, 2023Updated 2 years ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆533Updated this week
- 【 ICLR 2025 】I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength☆114Mar 8, 2025Updated 11 months ago
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- ☆391May 5, 2025Updated 9 months ago
- YiTu is an easy-to-use runtime to fully exploit the hybrid parallelism of different hardwares (e.g., GPU) to efficiently support the exec…☆254Jan 7, 2026Updated last month
- ☆251Feb 11, 2025Updated last year
- A curated list of awesome papers related to adversarial attacks and defenses for information retrieval. If I missed any papers, feel free…☆221Jul 11, 2024Updated last year
- Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…☆206Jan 15, 2026Updated last month
- ☆142May 8, 2024Updated last year
- Improvements to animations based on Manim, designed to facilitate the demonstration of algorithms in data structures, operating systems, …☆207Dec 15, 2025Updated 2 months ago
- ☆135May 6, 2024Updated last year
- JavaScript Runtime Environment In Embedded Device☆382Feb 3, 2026Updated last week