TUDa-HWAI / Basis_SharingLinks
☆11Updated 9 months ago
Alternatives and similar repositories for Basis_Sharing
Users that are interested in Basis_Sharing are comparing it to the libraries listed below
Sorting:
- Official implementation for "Pruning Large Language Models with Semi-Structural Adaptive Sparse Training" (AAAI 2025)☆12Updated 2 weeks ago
- ☆13Updated 7 months ago
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Updated 8 months ago
- ☆14Updated 7 months ago
- KV cache compression via sparse coding☆11Updated 2 months ago
- [ICLR 2025] ELICIT: LLM Augmentation Via External In-context Capability☆11Updated 4 months ago
- Generic library for neural collapse and several derivative works on the phenomenon.☆12Updated 3 months ago
- This is the official pytorch implementation for paper: Filter, Correlate, Compress: Training-Free Token Reduction for MLLM Acceleration☆15Updated 4 months ago
- Official code for the paper "HEXA-MoE: Efficient and Heterogeneous-Aware MoE Acceleration with Zero Computation Redundancy"☆13Updated 4 months ago
- ☆10Updated this week
- ☆11Updated 6 months ago
- Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents☆13Updated 7 months ago
- ☆24Updated 2 months ago
- Confidence Regulation Neurons in Language Models (NeurIPS 2024)☆10Updated 5 months ago
- ☆16Updated 6 months ago
- ☆61Updated last month
- 2D-TPE: Two-Dimensional Positional Encoding Enhances Table Understanding for Large Language Models (WWW 2025)☆10Updated 3 months ago
- ChatCoach is a fitness correction system based on pose estimation and large language models (LLMs). The primary goal is to provide fitnes…☆8Updated 8 months ago
- A code sample demonstrating how to share and rebuild a PyTorch GPU tensor via its pointer/reference between different processes.☆12Updated 10 months ago
- ☆12Updated 3 months ago
- Repo for Anonymous purpose, pls don't distribute☆10Updated 9 months ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆14Updated 3 months ago
- In this repository, we delve into the basic concepts of python from scratch. We explored every thing from a beginner perscpective who sta…☆8Updated 9 months ago
- This is web api for book site☆9Updated 8 months ago
- ☆16Updated 3 months ago
- ☆26Updated 3 months ago
- Chrome extension designed to calculate the weighted average of grades from your university grade summary page.☆9Updated this week
- Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs (EMNLP 2024)☆10Updated 8 months ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆52Updated 4 months ago
- Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"☆74Updated last week