Richar-Du / VirgoView external linksLinks
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆20May 27, 2025Updated 8 months ago
Alternatives and similar repositories for Virgo
Users that are interested in Virgo are comparing it to the libraries listed below
Sorting:
- ☆23Jan 16, 2024Updated 2 years ago
- Code for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment☆11Feb 28, 2024Updated last year
- A Python tool for fetching citations from multiple sources.☆14Apr 30, 2025Updated 9 months ago
- [ICLR 2026] P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark☆47Jun 6, 2025Updated 8 months ago
- Official Code for Contrastive Learning with Counterfactual Explanations for Radiology Report Generation (ECCV 2024)☆16Apr 3, 2025Updated 10 months ago
- ☆15Sep 23, 2024Updated last year
- Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'☆19Jul 21, 2024Updated last year
- This is the official code for "Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning"☆19Jul 30, 2025Updated 6 months ago
- ☆25Feb 2, 2025Updated last year
- Code for the paper "ICON: Improving Inter-Report Consistency in Radiology Report Generation via Lesion-aware Mixup Augmentation" (EMNLP'2…☆17Dec 11, 2024Updated last year
- The source code of "Merging Experts into One: Improving Computational Efficiency of Mixture of Experts (EMNLP 2023)":☆44Apr 7, 2024Updated last year
- Implementation of the paper "CXR-IRGen: An Integrated Vision and Language Model for the Generation of Clinically Accurate Chest X-Ray Ima…☆21Jul 2, 2024Updated last year
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Oct 12, 2024Updated last year
- Encourage Medical LLM to engage in deep thinking similar to DeepSeek-R1.☆26Apr 24, 2025Updated 9 months ago
- ☆64Jan 4, 2026Updated last month
- Recent Advances on MLLM's Reasoning Ability☆26Apr 11, 2025Updated 10 months ago
- Ordered or Orderless: A Revisit for Video based Person Re-Identification (T-PAMI 2020)☆27Mar 25, 2020Updated 5 years ago
- Code for the paper "RECAP: Towards Precise Radiology Report Generation via Dynamic Disease Progression Reasoning" (EMNLP'23 Findings).☆28Jun 12, 2025Updated 8 months ago
- The official GitHub page for the survey paper "A Survey of RWKV".☆30Jan 7, 2025Updated last year
- [CVPR2024] DiffusionTrack: Point set Diffussion Model for Visual Object Tracking☆39Aug 20, 2025Updated 5 months ago
- Expert-level AI radiology report evaluator☆36Apr 1, 2025Updated 10 months ago
- [ACL 2025] Exploring Compositional Generalization of Multimodal LLMs for Medical Imaging☆38Jun 4, 2025Updated 8 months ago
- ☆36Jul 1, 2024Updated last year
- ☆35Nov 22, 2022Updated 3 years ago
- Deploy PhoBERT for Abstractive Text Summarization as REST API using StreamLit, Transformers by Hugging Face and PyTorch☆33Apr 8, 2021Updated 4 years ago
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆42Apr 14, 2025Updated 10 months ago
- [EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization☆37Sep 24, 2024Updated last year
- ☆67Oct 31, 2025Updated 3 months ago
- ☆10Dec 16, 2023Updated 2 years ago
- ☆13Feb 23, 2023Updated 2 years ago
- Group-Group Loss Based Global-Regional Feature Learning for Vehicle Re-Identification☆12May 10, 2022Updated 3 years ago
- Website nhận diện và trích xuất thông tin từ Chứng Minh Nhân Dân☆11Oct 6, 2022Updated 3 years ago
- [ACL 2024] Multimodal Reasoning with Multimodal Knowledge Graph (pytorch implementation)☆18Jun 2, 2025Updated 8 months ago
- ☆40Mar 15, 2023Updated 2 years ago
- Official code: "Integrating Segment Anything Model derived boundary prior and high-level semantics for cropland extraction from high-reso…☆18May 26, 2025Updated 8 months ago
- ☆13Sep 23, 2022Updated 3 years ago
- Hierarchical Vision Transformers for Disease Progression Detection in Chest X-Ray Images☆11Jan 11, 2024Updated 2 years ago
- Official implementation of "Meta-Entity Driven Triplet Mining for Aligning Medical Vision-Language Models"☆14Mar 19, 2025Updated 10 months ago
- A simple baseline for Person ReID, it achieves 3rd place in VisDA2020 challenge.☆38Aug 21, 2020Updated 5 years ago