alwynpan / uom-comp90024
Demo Code for Subject COMP90024
☆10Updated last year
Alternatives and similar repositories for uom-comp90024:
Users that are interested in uom-comp90024 are comparing it to the libraries listed below
- official repo for paper "[CLS] Token Tells Everything Needed for Training-free Efficient MLLMs"☆11Updated 2 months ago
- Project Description☆22Updated 9 months ago
- [ICML 2024] "Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training"☆14Updated 8 months ago
- 如何做好科研写好科研文章?发顶刊顶会总结☆58Updated last year
- ☆75Updated 6 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆208Updated 3 weeks ago
- This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and fol…☆146Updated 3 weeks ago
- It's not a list of papers, but a list of paper reading lists...☆138Updated last week
- [ICASSP 2024] VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders☆14Updated last week
- [ICML 2024] Visual-Text Cross Alignment: Refining the Similarity Score in Vision-Language Models☆16Updated 5 months ago
- [COLING 2024 (Oral)] PromISe:Releasing the Capabilities of LLMs with Prompt Introspective Search☆19Updated 5 months ago
- Official repo of VLABench, a large scale benchmark designed for fairly evaluating VLA, Embodied Agent, and VLMs.☆120Updated last week
- Yet Another Academic Homepage Template☆19Updated last month
- [TPAMI reviewing] Towards Visual Grounding: A Survey☆91Updated last week
- ☆37Updated 3 months ago
- The Official Implementation of RoboMatrix☆80Updated last month
- Awesome paper for multi-modal llm with grounding ability☆14Updated 6 months ago
- A paper list of my history reading. Robotics, Learning, Vision.☆336Updated this week
- [AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models☆190Updated last year
- (ECCV 2024) Official repository of paper "Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection"☆14Updated 2 weeks ago
- Latest Advances on Embodied Multimodal LLMs (or Vison-Language-Action Models).☆96Updated 7 months ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆145Updated 2 months ago
- ☆10Updated 4 months ago
- official implementation of "Interpreting CLIP's Image Representation via Text-Based Decomposition"☆187Updated 2 months ago
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆51Updated 7 months ago
- The official implementation for "Towards Physically-Realizable Adversarial Attacks in Embodied Vision Navigation"☆18Updated 3 months ago
- ☆339Updated 10 months ago