Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"
☆25Mar 9, 2024Updated 2 years ago
Alternatives and similar repositories for VSD
Users that are interested in VSD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Mar 2, 2026Updated 2 months ago
- ☆19Jul 5, 2023Updated 2 years ago
- Collection of evaluation code for natural language generation.☆12Jan 6, 2021Updated 5 years ago
- An extremely lightweight macOS menu bar widget with zero server-side deployment, designed to monitor the status of NVIDIA GPUs on remote …☆29Jan 12, 2026Updated 3 months ago
- ☆27Mar 20, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments☆25Apr 8, 2025Updated last year
- ☆16Mar 17, 2025Updated last year
- Lyra: A Benchmark for Turducken-Style Code Generation☆15Apr 22, 2022Updated 4 years ago
- Implementation of "GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors" (ICLR 2024)☆14May 18, 2024Updated last year
- Official implementation of the WACV 2023 paper "Benchmarking Visual Localization for Autonomous Navigation".☆24Sep 25, 2023Updated 2 years ago
- 利用BERT预训练模型进行文本生成,可用于对话、摘要、问题生成等任务。 目前支持策略,词表的插入和删除、自定义Character Embedding、随机词替换等☆10Jun 1, 2022Updated 3 years ago
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆26May 22, 2024Updated last year
- IROS☆18Aug 10, 2025Updated 8 months ago
- ☆13Sep 6, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [NeurIPS 2025] Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling☆27Dec 16, 2025Updated 4 months ago
- Learning Matchable Image Transformations☆13Sep 10, 2019Updated 6 years ago
- IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization☆12Nov 23, 2021Updated 4 years ago
- 基于 LangChain 生态与混合检索技术构建的智能化学术研究辅助平台,旨在提供高效、精准的文献深度分析能力。系统支持 PDF/TXT/DOCX 多格式学术文献上传,通过 BGE-Small-ZH 向量嵌入模型与 BM25 关键词检索融合的混合检索策略,实现跨文档语义关联…☆20Aug 28, 2025Updated 8 months ago
- The code and data for "Summary-Oriented Vision Modeling for Multimodal Abstractive Summarization"☆11May 16, 2023Updated 2 years ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- ☆11Jan 3, 2023Updated 3 years ago
- ☆45Mar 22, 2024Updated 2 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Oct 9, 2022Updated 3 years ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Jul 12, 2023Updated 2 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- Cross-Perspective Topic Modeling☆11Oct 27, 2017Updated 8 years ago
- 吴恩达《机器学习》课后习题 Python 版 These are Exercises for Coursera's MachineLearning (by Andrew Ng) by Python.☆11Oct 26, 2018Updated 7 years ago
- Here is the repo for public scripts.☆12Jul 16, 2022Updated 3 years ago
- [ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.☆19Jun 7, 2024Updated last year
- ☆10Oct 4, 2022Updated 3 years ago
- Logo detection in images using SSD☆10Jul 13, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation for ACL 2024 paper "Meta-Task Prompting Elicits Embeddings from Large Language Models"☆12Jul 25, 2024Updated last year
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Mar 10, 2023Updated 3 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Dec 7, 2022Updated 3 years ago
- [KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".☆11Jun 14, 2023Updated 2 years ago
- Dataset of spoken conversational search utterances☆14Aug 27, 2021Updated 4 years ago
- search-rattailcollagen1 created by GitHub Classroom☆10Jan 17, 2021Updated 5 years ago
- Boundaries and Region Representation Fusion☆12Mar 24, 2023Updated 3 years ago