☆11Sep 20, 2024Updated last year
Alternatives and similar repositories for shadow_llm
Users that are interested in shadow_llm are comparing it to the libraries listed below
Sorting:
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…☆21Sep 10, 2024Updated last year
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 8 months ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆26Apr 15, 2025Updated 10 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆13Apr 29, 2025Updated 10 months ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆33Feb 26, 2026Updated last week
- Enhanced Explainable Neural Network☆10Dec 25, 2021Updated 4 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- Code for the paper "SMACE: A New Method for the Interpretability of Composite Decision Systems", ECML 2022☆15Apr 17, 2023Updated 2 years ago
- ☆39Aug 27, 2024Updated last year
- ☆11Dec 14, 2022Updated 3 years ago
- Generic library for neural collapse and several derivative works on the phenomenon.☆18Apr 14, 2025Updated 10 months ago
- [FCCM 2023] PASTA: Programming and Automation Support for Scalable Task-Parallel HLS Programs on Modern Multi-Die FPGAs☆13Jun 26, 2025Updated 8 months ago
- Software package for intertemporal pricing optimization under reference effects and consumer heterogeneity estimation. Please see REAMDE.…☆10Mar 7, 2024Updated last year
- POSTECH: Compiler Construction (Spring 2022)☆11Mar 10, 2023Updated 2 years ago
- Multi-resource Dynamic Coordinated Planning of Flexible Distribution Network☆15Jun 11, 2024Updated last year
- ☆10Oct 26, 2022Updated 3 years ago
- This repository reproduces the results in the paper "How expressive are transformers in spectral domain for graphs?"(published in TMLR)☆12Jul 10, 2022Updated 3 years ago
- Deep Generative Model (Torch)☆11Apr 19, 2016Updated 9 years ago
- Temporal summarization framework☆10Dec 4, 2023Updated 2 years ago
- ☆11Jan 13, 2026Updated last month
- Repository for the DPP'23 course☆11May 2, 2024Updated last year
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆26Jun 16, 2025Updated 8 months ago
- ☆10Apr 15, 2022Updated 3 years ago
- Pluralsight Reporting API - Python Client☆11Apr 13, 2017Updated 8 years ago
- ☆10Aug 16, 2023Updated 2 years ago
- ☆11Oct 28, 2021Updated 4 years ago
- ☆13Feb 4, 2025Updated last year
- ☆12Feb 27, 2023Updated 3 years ago
- Comparing sequential forecasters via confidence sequences & e-processes☆11Oct 24, 2023Updated 2 years ago
- 2019年全国大学生电子设计大赛G题双路语音调频接收机的FPGA全实现☆17Apr 15, 2020Updated 5 years ago
- ☆52Nov 5, 2024Updated last year
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- ☆10Apr 24, 2024Updated last year
- SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks☆14Mar 2, 2023Updated 3 years ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆13Nov 1, 2022Updated 3 years ago
- JSSP dataset for LLMs☆17May 29, 2025Updated 9 months ago
- Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆55Jul 16, 2025Updated 7 months ago
- A novel incremental hierarchical clustering algorithm (KDD 22)☆10Aug 31, 2023Updated 2 years ago
- Deep Large-Scale Inference UsingKnockoffs☆11Nov 4, 2021Updated 4 years ago