Code for "Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text Generation"
☆25Mar 9, 2024Updated 2 years ago
Alternatives and similar repositories for VSD
Users that are interested in VSD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Apr 3, 2026Updated last month
- [ECCV2022] A PyTorch implementation of the paper "Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embo…☆13Mar 20, 2023Updated 3 years ago
- Monitor Google Scholar author citation counts and track changes automatically without opening tabs.☆69Updated this week
- Collection of evaluation code for natural language generation.☆12Jan 6, 2021Updated 5 years ago
- An extremely lightweight macOS menu bar widget with zero server-side deployment, designed to monitor the status of NVIDIA GPUs on remote …☆30Jan 12, 2026Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆28Mar 20, 2023Updated 3 years ago
- Lyra: A Benchmark for Turducken-Style Code Generation☆15Apr 22, 2022Updated 4 years ago
- Implementation of "GNeRP: Gaussian-guided Neural Reconstruction of Reflective Objects with Noisy Polarization Priors" (ICLR 2024)☆14May 18, 2024Updated 2 years ago
- Official implementation of the WACV 2023 paper "Benchmarking Visual Localization for Autonomous Navigation".☆24Sep 25, 2023Updated 2 years ago
- The released data for paper "Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models".☆34Sep 16, 2023Updated 2 years ago
- Data preprocessing for IUPUI-CSRC Pedestrian Situated Intent (PSI) benchmark dataset.☆11Oct 5, 2023Updated 2 years ago
- The implementation of Text Classification with Negative Supervision (ACL, 2020)☆10Oct 8, 2020Updated 5 years ago
- Code for CascadeBERT, Findings of EMNLP 2021☆12Mar 30, 2022Updated 4 years ago
- ☆13Jul 20, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- IROS☆17Aug 10, 2025Updated 9 months ago
- Extreme Multi-label Text Classification based on X-BERT with GCN and Clustering modules☆11Nov 10, 2019Updated 6 years ago
- Code for the paper : "Weakly-supervised learning of visual relations", ICCV17☆40Oct 20, 2017Updated 8 years ago
- ☆13Sep 6, 2022Updated 3 years ago
- Task-Focused Few-Shot Object Detection Benchmark☆14Jun 24, 2025Updated 11 months ago
- [NeurIPS 2025] Few-Shot Learning from Gigapixel Images via Hierarchical Vision-Language Alignment and Modeling☆27May 20, 2026Updated last week
- Prompt-Guided Retrieval For Non-Knowledge-Intensive Tasks☆12Sep 1, 2023Updated 2 years ago
- Learning Matchable Image Transformations☆13Sep 10, 2019Updated 6 years ago
- 关键点标注工具 | Landmark-Annotation☆14Jan 2, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 基于 LangChain 生态与混合检索技术构建的智能化学术研究辅助平台,旨在提供高效、精准的文献深度分析能力。系统支持 PDF/TXT/DOCX 多格式学术文献上传,通过 BGE-Small-ZH 向量嵌入模型与 BM25 关键词检索融合的混合检索策略,实现跨文档语义关联…☆20Aug 28, 2025Updated 9 months ago
- The official code and model for ACL 2023 paper 'mCLIP: Multilingual CLIP via Cross-lingual Transfer'☆10Jan 23, 2024Updated 2 years ago
- ☆14Jan 10, 2024Updated 2 years ago
- ☆45Mar 22, 2024Updated 2 years ago
- ☆10Oct 9, 2022Updated 3 years ago
- GRAIN: Gradient-based Intra-attention Pruning on Pre-trained Language Models☆19Jul 12, 2023Updated 2 years ago
- Code for "Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning" (EMNLP 2022) and "Empowering Parameter-Efficient Transfer Learning…☆11Feb 6, 2023Updated 3 years ago
- 吴恩达《机器学习》课后习题 Python 版 These are Exercises for Coursera's MachineLearning (by Andrew Ng) by Python.☆11Oct 26, 2018Updated 7 years ago
- Here is the repo for public scripts.☆12Jul 16, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICCV 2023] With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning.☆19Jun 7, 2024Updated last year
- Multi-model video-to-text by combining embeddings from Flan-T5 + CLIP + Whisper + SceneGraph. The 'backbone LLM' is pre-trained from scra…☆54Apr 21, 2023Updated 3 years ago
- User data, including text descriptions, eyetracking data and Matlab code for visualizing it.☆15Jan 12, 2016Updated 10 years ago
- ☆10Oct 4, 2022Updated 3 years ago
- 微信集赞,朋友圈集赞,支持多种集赞模式,快速生成1000赞。☆18Mar 4, 2023Updated 3 years ago
- [AAAI 2023] Official implementation of FiTs: Fine-grained Two-stage Training for Knowledge Base Question Answering☆11Mar 10, 2023Updated 3 years ago
- [ICML 2022] Code and data for our paper "IGLUE: A Benchmark for Transfer Learning across Modalities, Tasks, and Languages"☆49Dec 7, 2022Updated 3 years ago