wbbeyourself / arxiv_paper_downloader
Arxiv daily paper downloader and manage papers with markdown preview.
☆33Updated 8 months ago
Alternatives and similar repositories for arxiv_paper_downloader:
Users that are interested in arxiv_paper_downloader are comparing it to the libraries listed below
- Feeling confused about super alignment? Here is a reading list☆42Updated last year
- ☆80Updated last year
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆60Updated 5 months ago
- AI Alignment: A Comprehensive Survey☆133Updated last year
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆87Updated 2 weeks ago
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆158Updated 9 months ago
- ☆33Updated last month
- ☆98Updated 5 months ago
- [NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models☆57Updated 3 months ago
- ☆45Updated 9 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆78Updated last year
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆165Updated 2 months ago
- ☆64Updated 9 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆147Updated 6 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆47Updated 9 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆153Updated 9 months ago
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆81Updated 9 months ago
- [ACL 2024 Findings] MathBench: A Comprehensive Multi-Level Difficulty Mathematics Evaluation Dataset☆97Updated 8 months ago
- Fantastic Data Engineering for Large Language Models☆84Updated 3 months ago
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆161Updated last year
- Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales☆32Updated last year
- [SIGIR'24] The official implementation code of MOELoRA.☆154Updated 8 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆130Updated last month
- The code and data for the paper JiuZhang3.0☆43Updated 10 months ago
- code for Scaling Laws of RoPE-based Extrapolation☆72Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated last year
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆52Updated 4 months ago
- Recent advancements propelled by large language models (LLMs), encompassing an array of domains including Vision, Audio, Agent, Robotics,…☆121Updated 3 weeks ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆45Updated 3 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆67Updated 4 months ago