pphuc25 / distil-cdLinks
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation
☆35Updated last year
Alternatives and similar repositories for distil-cd
Users that are interested in distil-cd are comparing it to the libraries listed below
Sorting:
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆27Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated last year
- ⚡ LLaMA-2 model experiment☆12Updated last year
- Who's Who: Large Language Models Meet Knowledge Conflicts in Practice (EMNLP 2024 Findings)☆10Updated 4 months ago
- ☆14Updated 2 years ago
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS☆40Updated 2 weeks ago
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆33Updated 9 months ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models☆52Updated 4 months ago
- ☆18Updated 6 months ago
- ☆70Updated last year
- ☆34Updated last month
- Model merging is a highly efficient approach for long-to-short reasoning.☆65Updated 3 weeks ago
- Pioneering in Vietnamese Multimodal Large Language Model☆47Updated 5 months ago
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆54Updated 7 months ago
- contrastive decoding☆201Updated 2 years ago
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆63Updated 6 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆222Updated 3 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 11 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆112Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆160Updated this week
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆78Updated 5 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆73Updated last month
- ☆24Updated 3 months ago
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆105Updated 4 months ago
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆38Updated last year
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don't…☆114Updated 11 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆70Updated 7 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆89Updated 7 months ago
- [ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆63Updated 8 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year