wutaiqiang / LLM_KD_AKLView external linksLinks
☆22Oct 22, 2024Updated last year
Alternatives and similar repositories for LLM_KD_AKL
Users that are interested in LLM_KD_AKL are comparing it to the libraries listed below
Sorting:
- LongAttn :Selecting Long-context Training Data via Token-level Attention☆15Jul 16, 2025Updated 7 months ago
- ☆21Jul 3, 2025Updated 7 months ago
- ☆16Mar 22, 2024Updated last year
- ☆16Jul 23, 2024Updated last year
- Research work aimed at addressing the problem of modeling infinite-length context☆46Dec 18, 2025Updated last month
- ☆21Sep 5, 2023Updated 2 years ago
- AI Challenger 2018 阅读理解赛道代码分享☆20Dec 6, 2018Updated 7 years ago
- RAN: Recurrent Attention Networks for Long-text Modeling | Findings of ACL23☆23Aug 12, 2023Updated 2 years ago
- Generate the WizardCoder Instruct from the CodeAlpaca☆21Jun 27, 2023Updated 2 years ago
- OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…☆25May 10, 2024Updated last year
- Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…☆61Aug 26, 2025Updated 5 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- ☆34Sep 14, 2024Updated last year
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆250Mar 13, 2025Updated 11 months ago
- ☆41Nov 30, 2023Updated 2 years ago
- ☆11Aug 20, 2025Updated 5 months ago
- ☆11Aug 3, 2023Updated 2 years ago
- pytorch版损失函数,改写自科学空间文章,【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】☆12Aug 22, 2021Updated 4 years ago
- This repo is the artifact of FUEL☆13Dec 2, 2025Updated 2 months ago
- [EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions☆12Dec 18, 2023Updated 2 years ago
- ☆12Nov 25, 2023Updated 2 years ago
- ☆87Dec 29, 2023Updated 2 years ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 3 years ago
- Source code for ISSTA'24 paper "AI Coders Are Among Us: Rethinking Programming Language Grammar Towards Efficient Code Generation"☆12Oct 21, 2024Updated last year
- This repository contains 4000 vulnerable hardware designs. Currently this is in Jsonl format for directly using it for fine-tuning LLMs. …☆20Mar 25, 2025Updated 10 months ago
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆11Dec 30, 2024Updated last year
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- Python3入门机器学习 经典算法与应用 学习☆11Nov 9, 2018Updated 7 years ago
- The code for AAAI 2025 “Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation”☆15Jan 3, 2025Updated last year
- Official PyTorch implementation of DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs (ICML 2025 Oral)☆57Jun 27, 2025Updated 7 months ago
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆201Nov 30, 2025Updated 2 months ago
- MetaLadder: Ascending Mathematical Solution Quality via Analogical-Problem Reasoning Transfer (EMNLP 2025)☆11Apr 18, 2025Updated 9 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆46Aug 13, 2025Updated 6 months ago
- DocChecker: Bootstrapping Code-Text Pretrained Language Model to Detect Inconsistency Between Code and Comment☆15Jan 23, 2024Updated 2 years ago
- [NLPCC 2021] Shared Task on AutoIE2: Sub-Event Identification☆14Jul 19, 2021Updated 4 years ago
- Learning MLPs to replace GNN☆10Jun 3, 2023Updated 2 years ago
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- 本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。☆11May 30, 2023Updated 2 years ago
- The code of COMMA: Modeling Relationship among Motivations, Emotions and Actions in Language-based Human Activities. https://aclanthology…☆12Oct 12, 2022Updated 3 years ago