Julia-LiuJ / NLFT
The official implementation of Natural Language Fine-Tuning
☆39Updated last month
Alternatives and similar repositories for NLFT:
Users that are interested in NLFT are comparing it to the libraries listed below
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆283Updated 9 months ago
- A Self-Training Framework for Vision-Language Reasoning☆63Updated 3 weeks ago
- WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge☆113Updated 3 months ago
- ☆123Updated 6 months ago
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆104Updated last month
- 通义千问的DPO训练☆31Updated 4 months ago
- ☆92Updated 7 months ago
- ☆61Updated this week
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆110Updated 3 months ago
- Efficient Multimodal Large Language Models: A Survey☆312Updated 6 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆129Updated 5 months ago
- ☆140Updated 5 months ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆347Updated last month
- ☆471Updated last month
- ☆91Updated 5 months ago
- 在没有sudo权限的情况下,在linux上使用clash☆61Updated 3 months ago
- ☆45Updated 4 months ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆36Updated 9 months ago
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆69Updated 5 months ago
- 主要记录大语言大模型(LLMs) 算法(应用)工程师多模态相关知识☆121Updated 9 months ago
- A series of technical report on Slow Thinking with LLM☆398Updated last week
- ☆318Updated last week
- This repository collects awesome survey, resource, and paper for Lifelong Learning for Large Language Models. (Updated Regularly)☆38Updated last week
- ☆106Updated 6 months ago
- Survey on Data-centric Large Language Models☆76Updated 7 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆128Updated last week
- A RLHF Infrastructure for Vision-Language Models☆159Updated 3 months ago
- The related works and background techniques about Openai o1☆210Updated last month