sucv / paperCrawlerLinks
This is a Scrapy-based web-spider. It scrapes papers from TOP conferences and journals.
☆54Updated 9 months ago
Alternatives and similar repositories for paperCrawler
Users that are interested in paperCrawler are comparing it to the libraries listed below
Sorting:
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆89Updated 2 years ago
- CVPR 2022, Robust Contrastive Learning against Noisy Views☆84Updated 3 years ago
- Meaningfully debugging model mistakes with conceptual counterfactual explanations. ICML 2022☆75Updated 3 years ago
- [ICCV 2023] CLR: Channel-wise Lightweight Reprogramming for Continual Learning☆33Updated last year
- ✨ Official code for our paper: "Uncertainty-o: One Model-agnostic Framework for Unveiling Epistemic Uncertainty in Large Multimodal Model…☆18Updated 9 months ago
- ☆92Updated 2 years ago
- Implementation for Label Relation Graphs Enhanced Hierarchical Residual Network for Hierarchical Multi-Granularity Classification☆59Updated 3 years ago
- An efficient multi-modal instruction-following data synthesis tool and the official implementation of Oasis https://arxiv.org/abs/2503.08…☆35Updated 7 months ago
- Fully Open Framework for Democratized Multimodal Reinforcement Learning.☆33Updated 3 weeks ago
- [NeurIPS 2023] Text data, code and pre-trained models for paper "Improving CLIP Training with Language Rewrites"☆287Updated last year
- code for studying OpenAI's CLIP explainability☆37Updated 4 years ago
- [ACLW'24] LMPT: Prompt Tuning with Class-Specific Embedding Loss for Long-tailed Multi-Label Visual Recognition☆56Updated last year
- Reproducible scaling laws for contrastive language-image learning (https://arxiv.org/abs/2212.07143)☆188Updated 6 months ago
- [ICCV 2023] ViLLA: Fine-grained vision-language representation learning from real-world data☆46Updated 2 years ago
- Reproduction of LLaVA-v1.5 based on Llama-3-8b LLM backbone.☆65Updated last year
- MLLMSeg: Unlocking the Potential of MLLMs in Referring Expression Segmentation via a Light-weight Mask Decoder☆49Updated 4 months ago
- LiVT PyTorch Implementation.☆73Updated 2 years ago
- [NeurIPS 2024] Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning☆71Updated 11 months ago
- CaMML:Context-Aware MultiModal Learner for Large Models (ACL 2024 SAC Award)☆15Updated 7 months ago
- AAAI 2024: Visual Instruction Generation and Correction☆95Updated last year
- [NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models☆49Updated last year
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…☆82Updated 10 months ago
- Task Residual for Tuning Vision-Language Models (CVPR 2023)☆75Updated 2 years ago
- ☆21Updated 3 years ago
- (Pattern Recognition Letters 2023) PyTorch implementation of "Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer"☆45Updated 2 years ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆74Updated last year
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Updated last year
- Official implementation for the paper "Prompt Pre-Training with Over Twenty-Thousand Classes for Open-Vocabulary Visual Recognition"☆259Updated last year
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆54Updated 7 months ago
- CLIP: Connecting Text and Image (Learning Transferable Visual Models From Natural Language Supervision)☆82Updated 4 years ago