liyucheng09/llm-compressive

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liyucheng09/llm-compressive)

liyucheng09 / llm-compressive

Longitudinal Evaluation of LLMs via Data Compression

☆32

Alternatives and similar repositories for llm-compressive

Users that are interested in llm-compressive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liyucheng09 / LatestEval
View on GitHub
Latest Evaluation Toolkit (LatestEval). Assessing the language models with latest, uncontaminated materials.
☆29Feb 17, 2025Updated last year
yuzhenmao / IceFormer
View on GitHub
Implementation for IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).
☆25Jun 9, 2026Updated last month
SkyworkAI / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆17Jun 3, 2024Updated 2 years ago
iboB / git-lfs-download
View on GitHub
Download full or partial git-lfs repos without temporarily using 2x disk space
☆32Oct 13, 2023Updated 2 years ago
shreyansh26 / An-Empirical-Model-of-Large-Batch-Training
View on GitHub
An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST
☆11Nov 19, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
cnunlp / Chinese-Simile-Recognition-Dataset
View on GitHub
A chinese simile recognition dataset of "Xiang".
☆24Oct 5, 2022Updated 3 years ago
trestad / mitigating-reversal-curse
View on GitHub
Code for paper 'Are We Falling in a Middle-Intelligence Trap? An Analysis and Mitigation of the Reversal Curse'
☆14Aug 2, 2024Updated last year
feifeibear / ChituAttention
View on GitHub
Quantized Attention on GPU
☆45Nov 22, 2024Updated last year
ClubieDong / QAQ-KVCacheQuantization
View on GitHub
QAQ: Quality Adaptive Quantization for LLM KV Cache
☆55Mar 27, 2024Updated 2 years ago
abacusai / smaug
View on GitHub
☆77Feb 22, 2024Updated 2 years ago
cofe-ai / Mu-scaling
View on GitHub
Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales
☆32Jul 17, 2023Updated 3 years ago
dunzeng / MORE
View on GitHub
Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment
☆16Aug 6, 2024Updated last year
BBuf / flash-rwkv
View on GitHub
☆32May 26, 2024Updated 2 years ago
co0ontty / pocdb
View on GitHub
my poc
☆16Oct 28, 2020Updated 5 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
hal-314 / fastai-batch-size-finder
View on GitHub
Implementation of OpenAI paper with Simple Noise Scale on Fastai V2
☆19Apr 16, 2021Updated 5 years ago
0xWelt / VibeRL
View on GitHub
VibeRL is a Reinforcement Learning framework built essentially through vibe coding with Kimi K2.
☆17Jul 20, 2026Updated last week
recursal / minmodmon
View on GitHub
Mini Model Daemon
☆13Nov 9, 2024Updated last year
hpc203 / CoupledTPS-opencv-dnn
View on GitHub
使用OpenCV部署CoupledTPS，包含了肖像矫正，不规则边界的图像矩形化，旋转图像矫正，三个模型。依然是包含C++和Python两个版本的程序
☆21Jul 4, 2024Updated 2 years ago
zhzihao / WikiGenBench
View on GitHub
WIKIGENBENCH: Exploring Full-length Wikipedia Generation under Real-World Scenario (COLING 2025)
☆13Jan 5, 2025Updated last year
zxytim / arithmetic-encoding-compression
View on GitHub
☆11Apr 3, 2023Updated 3 years ago
computer-animation-perception-group / DeepDance_train
View on GitHub
Training code repo of the paper "DeepDance: Music-to-Dance Motion Choreography with Adversarial Learning"
☆11May 18, 2021Updated 5 years ago
chenllliang / CTDNN
View on GitHub
MMM 2021: Crossed-Time Delay Neural Network for Speaker Recognition
☆11Dec 4, 2021Updated 4 years ago
pany8125 / ShareGPTQAExtractor-mnbvc
View on GitHub
MNBVC项目-ShareGPT语料清洗
☆16Oct 4, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jiamingkong / rwkv_reward
View on GitHub
Training a reward model for RLHF using RWKV.
☆15Jun 5, 2023Updated 3 years ago
youkaichao / mnist-wrong-test
View on GitHub
test images with not appropriate labels in MNIST dataset
☆10Mar 3, 2018Updated 8 years ago
ssbuild / aigc_evals
View on GitHub
aigc evals
☆10Dec 2, 2023Updated 2 years ago
JiangYanting / English_books_classification_Program
View on GitHub
英文文献的《中国图书馆分类法》自动标注小程序
☆13Oct 29, 2024Updated last year
Triang-jyed-driung / rwkv7mini
View on GitHub
RWKV-7 mini
☆12Mar 29, 2025Updated last year
choe-hyonsu-gabrielle / korean-amr-corpus
View on GitHub
Korean Abstract Meaning Representation (AMR) Corpus
☆10Feb 27, 2022Updated 4 years ago
Wangpeiyi9979 / HCL-Text2AMR
View on GitHub
Code for ACL22 short Paper "Hierarchical Curriculum Learning for AMR Parsing"
☆13Jun 1, 2022Updated 4 years ago
Triang-jyed-driung / RWKV-LM-RLHF-DPO
View on GitHub
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Mar 1, 2024Updated 2 years ago
pkunlp-icler / MLS
View on GitHub
Source code of our paper "Focus on the Target’s Vocabulary: Masked Label Smoothing for Machine Translation" @ ACL 2022
☆13Apr 13, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Bruce-Lee-LY / decoding_attention
View on GitHub
Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.
☆48Jun 11, 2025Updated last year
SmerkyG / GoldFinch-paper
View on GitHub
GoldFinch and other hybrid transformer components
☆16Dec 9, 2025Updated 7 months ago
tuhinjubcse / MetaphorGenNAACL2021
View on GitHub
Code for MERMAID : Metaphor Generation with Symbolism and Discriminative Decoding
☆11May 2, 2022Updated 4 years ago
chenllliang / ParetoMNMT
View on GitHub
Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023
☆17Sep 27, 2023Updated 2 years ago
qianyu-wang-flexport / ABSA_AE_BERT_Pytorch
View on GitHub
Utilize BERT model for multi task including ABSA (aspect based sentiment analysis) task and AE (Aspect Extraction) task
☆10May 31, 2019Updated 7 years ago
mayank31398 / ladder-residual-inference
View on GitHub
☆14Jul 13, 2025Updated last year
xmk2222 / TsinghuaDailyReport
View on GitHub
清华大学学生健康和出行情况报告每日自动提交
☆14Jan 30, 2021Updated 5 years ago