wbbeyourself / arxiv_paper_downloaderLinks
Arxiv daily paper downloader and manage papers with markdown preview.
☆39Updated last year
Alternatives and similar repositories for arxiv_paper_downloader
Users that are interested in arxiv_paper_downloader are comparing it to the libraries listed below
Sorting:
- Scaling Preference Data Curation via Human-AI Synergy☆135Updated 6 months ago
- AI Alignment: A Comprehensive Survey☆137Updated 2 years ago
- ☆125Updated last year
- [ICLR 2025] ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆130Updated 3 weeks ago
- Feeling confused about super alignment? Here is a reading list☆43Updated 2 years ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.☆163Updated 3 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆85Updated last year
- The implementation of paper "LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Fee…☆37Updated last year
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆77Updated 2 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆156Updated 6 months ago
- ☆87Updated 2 years ago
- ☆50Updated last year
- The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is unde…☆76Updated 5 months ago
- P2P: Automated Paper-to-Poster Generation and Fine-Grained Benchmark☆46Updated 7 months ago
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆181Updated 10 months ago
- ☆153Updated 7 months ago
- ☆96Updated 2 years ago
- A Survey of Direct Preference Optimization (DPO)☆88Updated 6 months ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆142Updated 2 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆52Updated 7 months ago
- Extrapolating RLVR to General Domains without Verifiers☆190Updated 5 months ago
- VeriWeb: Verifiable Long-Chain Web Benchmark for Agentic Information-Seeking☆83Updated last month
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆389Updated 11 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)☆171Updated 2 months ago
- Token level visualization tools for large language models☆91Updated last year
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆76Updated last year
- ☆39Updated 10 months ago
- ☆53Updated 10 months ago
- [NeurIPS 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆128Updated 7 months ago
- A Comprehensive Survey on Long Context Language Modeling☆215Updated last month