RO-ViT CVPR 2023 "Region-Aware Pretraining for Open-Vocabulary Object Detection with Vision Transformers"
☆17Aug 24, 2023Updated 2 years ago
Alternatives and similar repositories for rovit
Users that are interested in rovit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Sep 15, 2023Updated 2 years ago
- Ranking-Consistent Language-Image Pretraining☆13Oct 24, 2025Updated 7 months ago
- A pytorch Implementation of Open Vocabulary Object Detection with Pseudo Bounding-Box Labels☆65Mar 27, 2023Updated 3 years ago
- ☆31Mar 2, 2023Updated 3 years ago
- ☆22Apr 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Mar 28, 2024Updated 2 years ago
- ALIGN trained on COYO-dataset☆29Apr 30, 2024Updated 2 years ago
- A project demonstrating how to improve the model accuracy by suppression the false postive using an assessor model☆10Aug 13, 2021Updated 4 years ago
- (NeurIPS2023) CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection☆123Apr 26, 2024Updated 2 years ago
- A project showcasing how to leverage AI coding assistants (Cursor, Claude Code, etc.) for accelerated NVIDIA DeepStream SDK application d…☆59May 14, 2026Updated last week
- [CVPRW 2025] Official repository of paper titled "Towards Evaluating the Robustness of Visual State Space Models"☆26Jun 8, 2025Updated 11 months ago
- 微信小程序学习Demo☆15Sep 14, 2018Updated 7 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- ☆11Feb 9, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official code of the ACCV 2024 paper "Image Deraining with Frequency-Enhanced State Space Model (DFSSM)"☆16Dec 6, 2025Updated 5 months ago
- [CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection☆186Oct 25, 2023Updated 2 years ago
- ☆17Mar 13, 2023Updated 3 years ago
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 Languages (ACL 2022)☆19May 17, 2022Updated 4 years ago
- Learning Open-World Object Proposals without Learning to Classify☆205Apr 8, 2022Updated 4 years ago
- ☆19Oct 4, 2024Updated last year
- ☆45Aug 14, 2023Updated 2 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- A Manchu dictionary website☆13Feb 26, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DESSERT Effeciently Searches Sets of Embeddings via Retrieval Tables☆17Feb 21, 2024Updated 2 years ago
- Seq2seqAttGeneration, an basic implementation of text generation that using seq2seq attention model to generate poem series. this project…☆18Jan 11, 2021Updated 5 years ago
- The official implementation of MAGVLT: Masked Generative Vision-and-Language Transformer (CVPR'23)☆28Jan 20, 2024Updated 2 years ago
- Implementation for "Correcting Diffusion Generation through Resampling" [CVPR 2024]☆34Dec 12, 2023Updated 2 years ago
- 🈵 Collected resources to learn/study Manchu (Manchurian Language). 满语滿族満州語入門。☆19Jun 7, 2023Updated 2 years ago
- Repository of "Improving Cross-Modal Retrieval With Set of Diverse Embeddings" (CVPR'23, Highlight)☆41Nov 15, 2023Updated 2 years ago
- Wildfire detection on edge devices☆15Updated this week
- [ICCV 2023] RLIPv2: Fast Scaling of Relational Language-Image Pre-training☆137May 28, 2024Updated last year
- Official repository of paper "Subobject-level Image Tokenization" (ICML-25)☆93Jul 4, 2025Updated 10 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆19May 16, 2024Updated 2 years ago
- [AAAI2024] Code Release of CLIM: Contrastive Language-Image Mosaic for Region Representation☆30Feb 4, 2024Updated 2 years ago
- ☆24Jan 26, 2026Updated 4 months ago
- Example of a Retrieval-Augmented Generation with Postgres, pgvector, ollama, Llama3 and Go.☆19May 9, 2024Updated 2 years ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆45Oct 15, 2023Updated 2 years ago
- Object-Aware Distillation Pyramid for Open-Vocabulary Object Detection☆64Jan 6, 2026Updated 4 months ago