AkihikoWatanabe / paper_notesLinks
たまに追加される論文メモ
☆62Updated last week
Alternatives and similar repositories for paper_notes
Users that are interested in paper_notes are comparing it to the libraries listed below
Sorting:
- List of papers on Self-Correction of LLMs.☆81Updated 11 months ago
- ☆94Updated last year
- FuseAI Project☆87Updated 10 months ago
- Reformatted Alignment☆113Updated last year
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆41Updated last month
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆108Updated 9 months ago
- ☆129Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆192Updated last year
- CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings☆58Updated 10 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆182Updated last month
- ☆120Updated this week
- Scalable Meta-Evaluation of LLMs as Evaluators☆43Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 9 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆212Updated 5 months ago
- [ICLR 2025 Oral] "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆83Updated last year
- ☆75Updated last year
- ☆16Updated last year
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Updated 9 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆145Updated last year
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆56Updated last year
- ☆32Updated last year
- ☆150Updated last year
- ☆163Updated 2 months ago
- ☆78Updated 9 months ago
- RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models☆55Updated 2 years ago
- Benchmarking LLMs with Challenging Tasks from Real Users☆246Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Updated last year
- a curated list of the role of small models in the LLM era☆111Updated last year
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆84Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated 2 years ago