l294265421 / my-llmLinks
All about large language models
☆51Updated last year
Alternatives and similar repositories for my-llm
Users that are interested in my-llm are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Selecting High-Quality Data for Training Language Models☆188Updated last year
- Code implementation of synthetic continued pretraining☆133Updated 9 months ago
- ☆105Updated 2 months ago
- AI Alignment: A Comprehensive Survey☆134Updated last year
- Implementation of ICML 23 Paper: Specializing Smaller Language Models towards Multi-Step Reasoning.☆133Updated 2 years ago
- ☆140Updated 2 years ago
- [ICLR 2025] 🧬 RegMix: Data Mixture as Regression for Language Model Pre-training (Spotlight)☆171Updated 7 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆139Updated 5 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆83Updated last year
- Project for the paper entitled `Instruction Tuning for Large Language Models: A Survey`☆188Updated last month
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆163Updated last year
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆186Updated 8 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆176Updated 3 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆266Updated last year
- Fantastic Data Engineering for Large Language Models☆90Updated 9 months ago
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆48Updated 11 months ago
- Benchmarking Complex Instruction-Following with Multiple Constraints Composition (NeurIPS 2024 Datasets and Benchmarks Track)☆94Updated 7 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆80Updated 2 years ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆82Updated 8 months ago
- MathEval is a benchmark dedicated to the holistic evaluation on mathematical capacities of LLMs.☆83Updated 10 months ago
- Code for ACL2023 paper: Pre-Training to Learn in Context☆107Updated last year
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆51Updated 4 months ago
- ☆83Updated last year
- Code for "Small Models are Valuable Plug-ins for Large Language Models"☆131Updated 2 years ago
- EMNLP'23 survey: a curation of awesome papers and resources on refreshing large language models (LLMs) without expensive retraining.☆135Updated last year
- ☆96Updated 9 months ago
- Generative Judge for Evaluating Alignment☆246Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆69Updated 4 months ago
- Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)☆193Updated last year
- Repository for Label Words are Anchors: An Information Flow Perspective for Understanding In-Context Learning☆165Updated last year