Proposed fuzzy reward model with GRPO to improve VLM's abilities in crowd counting task.
☆21Apr 11, 2025Updated 11 months ago
Alternatives and similar repositories for CrowdVLM-R1
Users that are interested in CrowdVLM-R1 are comparing it to the libraries listed below
Sorting:
- Decoupled Memory Selection for Multi-target Video Segmentation of SAM3☆39Jan 16, 2026Updated 2 months ago
- (CVPR25) Exploring Contextual Attribute Density in Referring Expression Counting☆18Dec 3, 2025Updated 3 months ago
- Segmentation assisted U-shaped multi-scale transformer for crowd counting☆22Jun 9, 2024Updated last year
- ☆25Aug 1, 2023Updated 2 years ago
- 2022年春复旦大学大二下 组成与体系结构实验☆15Feb 20, 2023Updated 3 years ago
- [SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…☆34Mar 10, 2026Updated last week
- 3D Printed Motorised Traveling Equatorial mount☆11Jul 9, 2020Updated 5 years ago
- demo code for "Video Prediction via Selective Sampling" (NeurIPS 2018)☆12Jul 15, 2020Updated 5 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Apr 30, 2021Updated 4 years ago
- PyTorch implementation for our ICLR 2025 paper State Space Model Meets Transformer: A New Paradigm for 3D Object Detection☆40Mar 27, 2025Updated 11 months ago
- BootStrap + Django3 + simpleUI 实现的个人博客☆12Sep 22, 2021Updated 4 years ago
- Unlocking the Essence of Beauty: Advanced Aesthetic Reasoning with Relative-Absolute Policy Optimization☆23Jan 27, 2026Updated last month
- ☆10Jul 21, 2023Updated 2 years ago
- ☆10Nov 6, 2024Updated last year
- ☆14Aug 31, 2023Updated 2 years ago
- Token-free Language Modeling with ByGPT5 & Friends!☆12Jul 18, 2025Updated 8 months ago
- Implementation of ReRAW: RGB-to-RAW Image Reconstruction via Stratified Sampling for Efficient Object Detection on the Edge☆28Sep 24, 2025Updated 5 months ago
- Deep Sea Robotic Imaging Simulator☆16Oct 14, 2022Updated 3 years ago
- ☆14Jul 15, 2025Updated 8 months ago
- Official implementation of "Weakly-supervised positional contrastive learning: application to cirrhosis classification", MICCAI 2023☆11Dec 16, 2025Updated 3 months ago
- Tunisian Arabish Corpus☆12Mar 12, 2024Updated 2 years ago
- some operations for reserve☆17Apr 5, 2022Updated 3 years ago
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"☆11Sep 20, 2024Updated last year
- Display something on an analog oscilloscope☆11Oct 30, 2018Updated 7 years ago
- Curated list of Moroccans publishing in the most prestigious AI conferences☆10Oct 14, 2024Updated last year
- Source code related to the research paper entitled RVENet: A Large Echocardiographic Dataset for the Deep Learning-Based Assessment of Ri…☆12Mar 10, 2024Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Latex template for poster☆12Sep 6, 2023Updated 2 years ago
- Build TVM docker image for production compilation deployments☆12Sep 7, 2021Updated 4 years ago
- ☆15Oct 20, 2023Updated 2 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- Domain adaptation framework for segmentation via reinforcement learning.☆13Oct 13, 2025Updated 5 months ago
- Implementation of Contrastive Predictive Coding for Natural Language☆10Sep 16, 2020Updated 5 years ago
- WBSR: Rethinking Imbalance in Image Super-Resolution for Efficient Inference☆13Oct 8, 2024Updated last year
- License Plate Recognition based on semantic segmentation approach using U-Net☆13Dec 5, 2019Updated 6 years ago
- Implementation of Pix2Seq in PyTorch☆10Feb 3, 2022Updated 4 years ago
- ☆10Dec 8, 2022Updated 3 years ago
- Unofficial implementation of ''BEDSR-Net: A Deep Shadow Removal from a Single Document Image'' with PyTorch☆11Jul 28, 2023Updated 2 years ago
- Diacritization of Arabic texts☆11Apr 13, 2016Updated 9 years ago