justchenhao / ChatDailyPapersLinks
Build a daily academic subscription pipeline! Get daily Arxiv papers and corresponding chatGPT summaries with pre-defined keywords. It is deployed on GitHub automated without the need for manual running locally.
☆43Updated 2 years ago
Alternatives and similar repositories for ChatDailyPapers
Users that are interested in ChatDailyPapers are comparing it to the libraries listed below
Sorting:
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆64Updated 7 months ago
- ☆72Updated 4 months ago
- MLLM @ Game☆14Updated 5 months ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆55Updated 4 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆93Updated 6 months ago
- [NeurIPS2023] Parameter-efficient Tuning of Large-scale Multimodal Foundation Model☆88Updated last year
- Code for paper: Reinforced Vision Perception with Tools☆52Updated last week
- The first attempt to replicate o3-like visual clue-tracking reasoning capabilities.☆57Updated 3 months ago
- ☆49Updated 2 months ago
- ☆33Updated 2 months ago
- Watch for idle GPUs and run your jobs: launches jobs in tmux, keeps logs/status and sends start/finish emails..☆79Updated 3 weeks ago
- Code for Retrieval-Augmented Perception (ICML 2025)☆57Updated 2 months ago
- This repository will continuously update the latest papers, technical reports, benchmarks about multimodal reasoning!☆52Updated 6 months ago
- ☆116Updated last year
- MedGen: Unlocking Medical Video Generation by Scaling Granularly-annotated Medical Videos.☆23Updated 3 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109Updated 4 months ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆57Updated 2 months ago
- ☆92Updated 9 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆143Updated 6 months ago
- [ACL 2025 Main] Multi-Agent System for Science of Science☆108Updated 2 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆86Updated 8 months ago
- Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing☆71Updated 2 months ago
- "what, how, where, and how well? a survey on test-time scaling in large language models" repository☆70Updated this week
- Collect the awesome works evolved around reasoning models like O1/R1 in visual domain☆41Updated 2 months ago
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆47Updated 11 months ago
- EMPO, A Fully Unsupervised RLVR Method☆67Updated last week
- [Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Mul…☆30Updated 2 months ago
- [AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning☆39Updated 5 months ago
- VeriGUI: Verifiable Long-Chain GUI Dataset☆81Updated 2 months ago
- [CVPR 2024] Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities☆99Updated last year