pphuc25 / distil-cd
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation
☆33Updated 11 months ago
Alternatives and similar repositories for distil-cd:
Users that are interested in distil-cd are comparing it to the libraries listed below
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆92Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆21Updated 7 months ago
- ⚡ LLaMA-2 model experiment☆13Updated last year
- ☆64Updated 9 months ago
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS☆35Updated 3 weeks ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 9 months ago
- ☆12Updated 2 years ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated 9 months ago
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆31Updated 4 months ago
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆13Updated last month
- Machine Reading Comprehension special for the Vietnamese language☆40Updated 2 years ago
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated 3 weeks ago
- [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia☆158Updated 6 months ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆54Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆69Updated last year
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- ☆75Updated last month
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆56Updated 4 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆84Updated 3 months ago
- ☆14Updated last year
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆141Updated 5 months ago
- ☆17Updated 2 months ago
- [NeurIPS 2024 Main Track] Code for the paper titled "Instruction Tuning With Loss Over Instructions"☆35Updated 8 months ago
- Supervised Instruction Finetuning for Mistral 7B using Dolly-15K dataset☆8Updated 9 months ago
- Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.☆77Updated last year
- ☆165Updated last year
- ☆64Updated 2 weeks ago
- [ICLR 2025] 🚀 CodeMMLU Evaluator: A framework for evaluating LM models on CodeMMLU MCQs benchmark.☆17Updated 2 months ago
- Multilingual Large Language Models Evaluation Benchmark☆117Updated 6 months ago