pphuc25 / distil-cd
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation
☆33Updated last year
Alternatives and similar repositories for distil-cd:
Users that are interested in distil-cd are comparing it to the libraries listed below
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆94Updated last year
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS☆35Updated last month
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 9 months ago
- ⚡ LLaMA-2 model experiment☆13Updated last year
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆13Updated 2 months ago
- RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)☆30Updated 5 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 8 months ago
- ☆13Updated 2 years ago
- ☆64Updated 10 months ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆55Updated last year
- ☆78Updated this week
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated last month
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆78Updated 8 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆86Updated 4 months ago
- ☆75Updated last month
- ☆14Updated last year
- Machine Reading Comprehension special for the Vietnamese language☆40Updated 2 years ago
- ☆167Updated last year
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated 10 months ago
- ☆253Updated last year
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆77Updated last month
- [NeurIPS DB Track, 2024] MATH-Vision dataset and code to measure multimodal mathematical reasoning capabilities.☆89Updated this week
- MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria☆64Updated 4 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆193Updated 5 months ago
- [EMNLP 2023] The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation☆89Updated 6 months ago
- ☆17Updated 3 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆72Updated last month
- Training code for Baby-Llama, our submission to the strict-small track of the BabyLM challenge.☆78Updated last year