pphuc25 / distil-cdLinks
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation
☆35Updated last year
Alternatives and similar repositories for distil-cd
Users that are interested in distil-cd are comparing it to the libraries listed below
Sorting:
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆26Updated last year
- Who's Who: Large Language Models Meet Knowledge Conflicts in Practice (EMNLP 2024 Findings)☆10Updated 4 months ago
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆111Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆61Updated 5 months ago
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆14Updated 5 months ago
- ☆14Updated 2 years ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS☆39Updated last month
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆33Updated 8 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆89Updated 7 months ago
- Multilingual Large Language Models Evaluation Benchmark☆122Updated 9 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 11 months ago
- ⚡ LLaMA-2 model experiment☆12Updated last year
- ☆18Updated 6 months ago
- ☆70Updated last year
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆37Updated last year
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆78Updated 4 months ago
- Organize the Web: Constructing Domains Enhances Pre-Training Data Curation☆52Updated last month
- ☆79Updated 4 months ago
- Model merging is a highly efficient approach for long-to-short reasoning.☆56Updated this week
- Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]☆106Updated 3 months ago
- Official Code Repository for LM-Steer Paper: "Word Embeddings Are Steers for Language Models" (ACL 2024 Outstanding Paper Award)☆112Updated 8 months ago
- My own implementation of "Fast Inference from Transformers via Speculative Decoding"☆10Updated last year
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆147Updated 2 months ago
- [ACL-25] We introduce ScaleQuest, a scalable, novel and cost-effective data synthesis method to unleash the reasoning capability of LLMs.☆63Updated 7 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆157Updated 9 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆76Updated 6 months ago
- ☆259Updated last year
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆26Updated last month