USTC-StarTeam/ZIP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/USTC-StarTeam/ZIP)

USTC-StarTeam / ZIP

arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.

☆28

Alternatives and similar repositories for ZIP

Users that are interested in ZIP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liutianlin0121 / decoding-time-realignment
View on GitHub
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated 2 years ago
yichengchen24 / DataChef
View on GitHub
☆25Feb 12, 2026Updated 5 months ago
tbh-98 / Hypergraph-MLP
View on GitHub
☆20Jan 9, 2024Updated 2 years ago
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
zeroxleo / HyperGT
View on GitHub
The implementation of ICASSP 2024 lecture presentation paper "Hypergraph Transformer for Semi-Supervised Classification"
☆21Nov 23, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Farseer-Scaling-Law / Farseer
View on GitHub
☆21Jun 12, 2025Updated last year
yichengchen24 / MIG
View on GitHub
[ACL2025 Findings] Official code for MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Spac…
☆28Aug 30, 2025Updated 10 months ago
USTC-StarTeam / ChemEval
View on GitHub
ICLR 2026 | ChemEval: 4-level, 13-dimension, 62-task text/multimodal chemistry benchmark for evaluating LLMs and MLLMs.
☆33Jun 9, 2026Updated last month
yilunzhao / RobuT
View on GitHub
Data and code for ACL 2023 paper "RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations"
☆15Feb 8, 2024Updated 2 years ago
PKU-AICare / ConfAgents
View on GitHub
ConfAgents: A Conformal-Guided Multi-Agent Framework for Cost-Efficient Medical Diagnosis
☆15Updated this week
PKU-ML / LongPPL
View on GitHub
Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"
☆116Oct 11, 2025Updated 9 months ago
JayZhang42 / SLED
View on GitHub
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433
☆122Dec 5, 2024Updated last year
USTC-StarTeam / DR4SR
View on GitHub
KDD 2024 Best Student Paper | DR4SR: dataset regeneration for sequential recommendation.
☆73Jun 10, 2026Updated last month
BaiTheBest / SparseLLM
View on GitHub
Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)
☆70Mar 27, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
THUDM / MoELoRA_Riemannian
View on GitHub
Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)
☆39Apr 2, 2025Updated last year
jtonglet / Numerical-Hybrid-QA-Literature
View on GitHub
A list of Numerical Multimodal reasoning papers and their implementation
☆11May 13, 2024Updated 2 years ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
BaiTheBest / SRDML
View on GitHub
GitHub Repository for KDD 2022 paper "Saliency-Regularized Deep Multi-Task Learning"
☆12Sep 26, 2023Updated 2 years ago
Qwen-Applications / GD2PO
View on GitHub
☆20Jun 16, 2026Updated last month
PasaLab / NAS-CTR
View on GitHub
☆12Oct 31, 2022Updated 3 years ago
princeton-nlp / ELIZA-Transformer
View on GitHub
[NAACL 2025] Representing Rule-based Chatbots with Transformers
☆23Feb 9, 2025Updated last year
yakuza8 / first-order-predicate-logic-theorem-prover
View on GitHub
Autonomous Theorem Prover for First Order Predicate Logic
☆12Jun 29, 2020Updated 6 years ago
tanzelin430 / The-Scaling-Law-for-Reinforcement-Learning
View on GitHub
[ACL2026]Code Repo for paper "Scaling Behaviors of LLM Reinforcement Learning Post-Training"
☆24Jul 1, 2026Updated 3 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
hpcaitech / transformers
View on GitHub
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆11Nov 19, 2024Updated last year
lorelupo / divide-and-rule
View on GitHub
☆12Oct 17, 2022Updated 3 years ago
syoyo / dynamic_bitset
View on GitHub
Simple dynamic bitset template class
☆12Nov 8, 2019Updated 6 years ago
OrangeInSouth / DeePEn
View on GitHub
A method of ensemble learning for heterogeneous large language models.
☆62Aug 7, 2024Updated last year
Olivia-fsm / DoGE
View on GitHub
Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"
☆21Feb 29, 2024Updated 2 years ago
CASIA-LM / MoDS
View on GitHub
☆153Apr 16, 2024Updated 2 years ago
foundation-multimodal-models / CAL
View on GitHub
[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
☆58Sep 26, 2024Updated last year
hengzzzhou / ReSo
View on GitHub
☆25Jan 29, 2026Updated 5 months ago
zhaoxlpku / SubgoalXL
View on GitHub
☆26Aug 23, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Bai-YT / AdaptiveSmoothing
View on GitHub
Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".
☆10Feb 6, 2024Updated 2 years ago
ByteDance-Seed / DATAMASK
View on GitHub
Joint Selection for Large-Scale Pre-Training Data via Policy Gradient-based Mask Learning
☆21Jan 4, 2026Updated 6 months ago
wangf3014 / Patch_Scaling
View on GitHub
Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
☆25Feb 25, 2025Updated last year
usaito / unbiased-pairwise-rec
View on GitHub
(ICTIR2020) "Unbiased Pairwise Learning from Biased Implicit Feedback"
☆19Nov 21, 2022Updated 3 years ago
HauffQian / DGAP
View on GitHub
☆14May 13, 2025Updated last year
RulinShao / RAG-evaluation-harnesses
View on GitHub
An evaluation suite for Retrieval-Augmented Generation (RAG).
☆25Apr 26, 2025Updated last year
keep-smile-001 / opentqa
View on GitHub
opentqa is a open framework of the textbook question answering, which includes xtqa, mcan, cmr, mfb, mutan.
☆11Mar 27, 2021Updated 5 years ago