wutaiqiang/LLM_KD_AKL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/wutaiqiang/LLM_KD_AKL)

wutaiqiang / LLM_KD_AKL

☆22

Alternatives and similar repositories for LLM_KD_AKL

Users that are interested in LLM_KD_AKL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yang3121099 / LLM-Neo
View on GitHub
The code for paper "LLM-Neo: Parameter Efficient Knowledge Distillation for Large Language Models"
☆15Mar 2, 2025Updated last year
swtheing / PF-PPO-RLHF
View on GitHub
☆34Sep 14, 2024Updated last year
Utaotao / ProFit
View on GitHub
☆35Jan 20, 2026Updated 6 months ago
RUCAIBox / RLMEC
View on GitHub
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆39Jan 12, 2024Updated 2 years ago
songmzhang / DSKD
View on GitHub
Repo for the EMNLP'24 Paper "Dual-Space Knowledge Distillation for Large Language Models". A general white-box KD framework for both same…
☆63Mar 21, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jongwooko / distillm
View on GitHub
Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)
☆266Mar 13, 2025Updated last year
AboveParadise / LLMCBench
View on GitHub
☆28Dec 2, 2024Updated last year
juncaili / The-2nd-Chinese-Frame-Semantic-Parsing
View on GitHub
☆17Mar 22, 2024Updated 2 years ago
DamienIrving / ocean-analysis
View on GitHub
Code used for analysis and visualiation of ocean model data during my postdoc
☆12Mar 1, 2023Updated 3 years ago
wzhuang-xmu / LoSA
View on GitHub
[ICLR 2025] Official implementation of paper "Dynamic Low-Rank Sparse Adaptation for Large Language Models".
☆25Mar 16, 2025Updated last year
ShaojieJiang / tldr
View on GitHub
Source code repo for paper "TLDR: Token Loss Dynamic Reweighting for Reducing Repetitive Utterance Generation"
☆10Aug 11, 2023Updated 2 years ago
renjunxiang / oqmrc_2018
View on GitHub
AI Challenger 2018 阅读理解赛道代码分享
☆20Dec 6, 2018Updated 7 years ago
swtheing / WizardCoder_Instruct_Generator
View on GitHub
Generate the WizardCoder Instruct from the CodeAlpaca
☆21Jun 27, 2023Updated 3 years ago
Lyun0912-wu / LongAttn
View on GitHub
LongAttn ：Selecting Long-context Training Data via Token-level Attention
☆15Jul 16, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
GanjinZero / GTS
View on GitHub
Code for Unsupervised multi-granular Chinese word segmentation and term discovery via graph partition [JBI]
☆16Jan 28, 2022Updated 4 years ago
wyzjack / CNTP
View on GitHub
[ACL 2025] Cautious Next Token Prediction
☆16Jul 24, 2025Updated last year
agarwalishika / DELIFT
View on GitHub
☆16Feb 21, 2025Updated last year
OpenNLG / OpenBA-v2
View on GitHub
OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-1…
☆25May 10, 2024Updated 2 years ago
abdelfattah-lab / TokenButler
View on GitHub
☆27May 12, 2026Updated 2 months ago
ZKI-PH-ImageAnalysis / Next-Generation-Loss
View on GitHub
☆12Jan 8, 2025Updated last year
GAIR-NLP / BeHonest
View on GitHub
BeHonest: Benchmarking Honesty in Large Language Models
☆35Aug 15, 2024Updated last year
IIGROUP / AutoIE2
View on GitHub
[NLPCC 2021] Shared Task on AutoIE2: Sub-Event Identification
☆14Jul 19, 2021Updated 5 years ago
TianHongZXY / qaap
View on GitHub
[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions
☆12Dec 18, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ghwang-s / abkd
View on GitHub
ICML 2025 Oral: ABKD: Pursuing a Proper Allocation of the Probability Mass in Knowledge Distillation via α-β-Divergence
☆46Aug 8, 2025Updated 11 months ago
RyanLiu112 / compute-optimal-tts
View on GitHub
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
☆288Feb 19, 2025Updated last year
nambo / menu-rag
View on GitHub
Beyond Basic RAG, Empowering Real-Time Deep Research
☆20Sep 12, 2025Updated 10 months ago
Adaxry / Post-Instruction
View on GitHub
☆21Sep 5, 2023Updated 2 years ago
ModelTC / QLLM
View on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆39Mar 11, 2024Updated 2 years ago
danyang-liu / NewsGraphRec
View on GitHub
☆14Sep 11, 2019Updated 6 years ago
yongzhuo / pytorch-loss
View on GitHub
pytorch版损失函数，改写自科学空间文章，【通过互信息思想来缓解类别不平衡问题】、【将“softmax+交叉熵”推广到多标签分类问题】
☆12Aug 22, 2021Updated 4 years ago
baoy-nlp / DSS-VAE-pytorch
View on GitHub
Generating Sentences from Disentangled Syntactic and Semantic Spaces
☆11Jun 24, 2019Updated 7 years ago
hemingkx / TokenSkip
View on GitHub
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆224Nov 30, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
gcosne / OceanographyProject
View on GitHub
Today satellites provide a surface signature of the temperature with a high spatial frequency: ie a good horizontal resolution but a low …
☆13Oct 23, 2019Updated 6 years ago
rlin27 / DeBut
View on GitHub
Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.
☆16Nov 1, 2021Updated 4 years ago
MIRALab-USTC / LLM-AttentionPredictor
View on GitHub
The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…
☆29Jul 15, 2025Updated last year
lancopku / DAN
View on GitHub
[Findings of EMNLP 2022] Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks
☆13Feb 26, 2023Updated 3 years ago
schauppi / Self-Rewarding-Language-Models
View on GitHub
☆50May 13, 2024Updated 2 years ago
StarDewXXX / Awesome-Hybrid-CoT-Reasoning
View on GitHub
☆62Jun 7, 2025Updated last year
Zce1112zslx / IKE
View on GitHub
☆41Nov 30, 2023Updated 2 years ago