wbbeyourself / arxiv_paper_downloader
Arxiv daily paper downloader and manage papers with markdown preview.
☆32Updated 7 months ago
Alternatives and similar repositories for arxiv_paper_downloader:
Users that are interested in arxiv_paper_downloader are comparing it to the libraries listed below
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆54Updated 2 months ago
- AI Alignment: A Comprehensive Survey☆133Updated last year
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆154Updated last month
- Curation of resources for LLM mathematical reasoning, most of which are screened by @tongyx361 to ensure high quality and accompanied wit…☆114Updated 7 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆45Updated 3 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated last month
- The code and data for the paper JiuZhang3.0☆40Updated 8 months ago
- A Survey on the Honesty of Large Language Models☆53Updated 2 months ago
- A Self-Training Framework for Vision-Language Reasoning☆63Updated 3 weeks ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆110Updated 3 months ago
- Open-Pandora: On-the-fly Control Video Generation☆32Updated 2 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆75Updated last year
- ☆95Updated 4 months ago
- Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process☆25Updated 6 months ago
- ☆60Updated 8 months ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆157Updated 8 months ago
- [ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues☆68Updated 6 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆29Updated 7 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆70Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆48Updated 2 months ago
- ☆80Updated last year
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆77Updated last year
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆106Updated this week
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆96Updated last month
- Official completion of “Training on the Benchmark Is Not All You Need”.☆29Updated last month
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆117Updated 8 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆127Updated last week