pphuc25 / distil-cd
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation
☆35Updated last year
Alternatives and similar repositories for distil-cd
Users that are interested in distil-cd are comparing it to the libraries listed below
Sorting:
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS☆37Updated 2 weeks ago
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆15Updated 4 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆95Updated last year
- ☆69Updated last year
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated last year
- ☆13Updated 2 years ago
- Who's Who: Large Language Models Meet Knowledge Conflicts in Practice (EMNLP 2024 Findings)☆10Updated 3 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 10 months ago
- Pioneering in Vietnamese Multimodal Large Language Model☆47Updated 3 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆215Updated 2 months ago
- ⚡ LLaMA-2 model experiment☆12Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆87Updated 6 months ago
- Machine Reading Comprehension special for the Vietnamese language☆40Updated 3 years ago
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆32Updated 7 months ago
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆23Updated 3 weeks ago
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆167Updated 9 months ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆56Updated last year
- ☆147Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆109Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆151Updated 8 months ago
- ☆257Updated last year
- contrastive decoding☆199Updated 2 years ago
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆123Updated last year
- ☆78Updated 4 months ago
- Large Language Models Can Self-Improve in Long-context Reasoning☆69Updated 5 months ago
- Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"☆68Updated 7 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"☆75Updated 5 months ago
- Multilingual Large Language Models Evaluation Benchmark☆123Updated 8 months ago
- ☆180Updated last year