TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)
☆15Jun 14, 2025Updated last year
Alternatives and similar repositories for topviewrs
Users that are interested in topviewrs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository for Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapte…☆17Jan 15, 2024Updated 2 years ago
- Code repo for FaStfact: Faster, Stronger Long-Form Factuality Evaluations in LLMs.☆31Nov 5, 2025Updated 7 months ago
- Survival of the Most Influential Prompts: Efficient Black-Box Prompt Search via Clustering and Pruning (Zhou et al.; EMNLP 2023 Findings)☆17Feb 17, 2024Updated 2 years ago
- ☆21Jul 5, 2024Updated last year
- ☆23May 25, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- PyTorch code for the paper: "Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation"☆19Aug 5, 2021Updated 4 years ago
- Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty☆23Dec 11, 2023Updated 2 years ago
- [ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting☆29Dec 16, 2024Updated last year
- Improving Word Translation via Two-Stage Contrastive Learning (ACL 2022). Keywords: Bilingual Lexicon Induction, Word Translation, Cross-…☆36Jan 23, 2025Updated last year
- Test-Time Label-Shift Adaptation☆13May 24, 2023Updated 3 years ago
- ☆19Jun 29, 2025Updated 11 months ago
- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins☆12Sep 20, 2024Updated last year
- Portal Tutorial☆11Feb 3, 2018Updated 8 years ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆35Apr 23, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repository contains 2 tools: - A py3 Lib for NLP & image-caption metrics - Code for a two-tailed t-test with paired samples. It wil…☆18Apr 4, 2021Updated 5 years ago
- A method to automatically calibrate lidar and camera☆21Jun 11, 2024Updated 2 years ago
- Score and Distribution Matching Policy: Advanced accelerated Visuomotor Policies via matched distillation☆11May 9, 2025Updated last year
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆68May 21, 2025Updated last year
- A multi-source cross-modal retrieval network☆14Jan 8, 2024Updated 2 years ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆14Oct 3, 2024Updated last year
- ☆15May 12, 2025Updated last year
- Forcing Diffuse Distributions out of Language Models☆18Sep 10, 2024Updated last year
- ☆10Nov 16, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning (Zhou et al.; TACL 2024)☆50Mar 17, 2024Updated 2 years ago
- A collection of research papers related to Natural Language Reasoning☆10May 27, 2022Updated 4 years ago
- [NeurIPS 2024] Official code for "Variational Distillation of Diffusion Policies into Mixture of Experts"☆17Dec 7, 2024Updated last year
- Language-Aligned Waypoint (LAW) Supervision for Vision-and-Language Navigation in Continuous Environments☆13Nov 29, 2021Updated 4 years ago
- A human-annotated, fine-grained dataset for Vision-and-Language Navigation☆17Jan 20, 2022Updated 4 years ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆29Feb 17, 2026Updated 4 months ago
- [ICLR 2026] Official repository of "InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models".☆116Feb 6, 2026Updated 4 months ago
- Materials for paper "Are Large Language Models Temporally Grounded?"☆14Nov 16, 2023Updated 2 years ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆37Nov 18, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Memory experiments with LLMs☆10Mar 31, 2023Updated 3 years ago
- Cost-sensitive multiclass classification with Adaptive Regularization of Weights☆16Sep 12, 2016Updated 9 years ago
- RESTful API server template of Vert.x 3.x☆13Oct 12, 2020Updated 5 years ago
- ☆21Aug 18, 2024Updated last year
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- ViBE: a hierarchical BERT model to identify viruses using metagenome sequencing data☆11Sep 6, 2022Updated 3 years ago
- Fish4Knowledge dataset cleaning, UOE 4th Year Honours Project.☆11Jun 13, 2018Updated 8 years ago