zhaochenyang20 / Prompt2Model-Self-Guide
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆25Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for Prompt2Model-Self-Guide
- We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆43Updated 2 weeks ago
- ☆50Updated 3 weeks ago
- ☆56Updated 2 weeks ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆46Updated last month
- FuseAI Project☆76Updated 2 months ago
- Code implementation of synthetic continued pretraining☆54Updated last month
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆52Updated 6 months ago
- ☆31Updated 7 months ago
- ☆78Updated 6 months ago
- ☆15Updated 3 months ago
- ☆34Updated 2 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆30Updated last month
- ☆37Updated last week
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆36Updated 8 months ago
- Codebase for Instruction Following without Instruction Tuning☆29Updated last month
- Reformatted Alignment☆112Updated last month
- ☆37Updated 4 months ago
- ☆77Updated last month
- The paper list of multilingual pre-trained models (Continual Updated).☆17Updated 4 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆32Updated 9 months ago
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆72Updated 7 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆37Updated 4 months ago
- The code and data of DPA-RAG☆49Updated last month
- Fantastic Data Engineering for Large Language Models☆49Updated 3 months ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆34Updated 10 months ago
- ☆13Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆123Updated 2 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆67Updated 5 months ago
- ☆55Updated this week
- ☆48Updated 8 months ago