susumuota / nano-askllmLinks
Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.
☆12Updated last year
Alternatives and similar repositories for nano-askllm
Users that are interested in nano-askllm are comparing it to the libraries listed below
Sorting:
- ☆25Updated 10 months ago
- This is the official repository for Inheritune.☆113Updated 7 months ago
- List of papers on Self-Correction of LLMs.☆76Updated 9 months ago
- ☆15Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Unofficial Implementation of Evolutionary Model Merging☆39Updated last year
- FuseAI Project☆87Updated 8 months ago
- ☆74Updated last year
- Unofficial implementation of AlpaGasus☆93Updated 2 years ago
- ☆127Updated 11 months ago
- たまに追加される論文メモ☆49Updated this week
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆119Updated last year
- Teacher - student distillation using DeepSpeed☆19Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)☆148Updated last year
- ☆154Updated last year
- Code for KaLM-Embedding models☆91Updated 2 months ago
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆80Updated last year
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆190Updated last year
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆56Updated last year
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023☆36Updated last year
- RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models☆55Updated last year
- Reformatted Alignment☆113Updated last year
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Updated 10 months ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆55Updated 11 months ago
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆45Updated 3 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆49Updated last month
- ☆23Updated last year
- [NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.☆153Updated last year
- ☆69Updated 11 months ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 7 months ago