3% Is All You Need: Breaking TurboQuant's Compression Limit via Spectral Structure
☆133Apr 7, 2026Updated last month
Alternatives and similar repositories for spectralquant
Users that are interested in spectralquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository provides a framework to serve LLM(Large Language Model) based applications such as Chatbot.☆18Apr 20, 2023Updated 3 years ago
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- ☆10Nov 12, 2024Updated last year
- ☆28Feb 11, 2026Updated 3 months ago
- Language modeling with linear-cost context☆118Sep 25, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🚀 [ICLR '25] RocketEval: Efficient Automated LLM Evaluation via Grading Checklist☆16Aug 21, 2025Updated 8 months ago
- Go implementation of the Gun distributed graph database☆11Feb 26, 2019Updated 7 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 6 months ago
- Play Chrome's Dinosaur Game with Reinforcement Learning☆13Jun 25, 2023Updated 2 years ago
- ☆22Jan 13, 2025Updated last year
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆16Feb 4, 2025Updated last year
- Understanding deep networks and large models.☆28Jan 23, 2026Updated 3 months ago
- ☆17Feb 23, 2025Updated last year
- LMDB Adapter for gunDB☆14Dec 8, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Neural text to speech system that uses eSpeak as a text/phoneme front-end☆16Oct 20, 2021Updated 4 years ago
- Open-source web scraping API. Turn any website into clean markdown or structured JSON. Anti-detect browser, proxy auto-selection, self-ho…☆102May 6, 2026Updated 2 weeks ago
- 🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)☆10Updated this week
- ☆21Jun 12, 2024Updated last year
- MoE-Visualizer is a tool designed to visualize the selection of experts in Mixture-of-Experts (MoE) models.☆16Apr 8, 2025Updated last year
- This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…☆14Feb 25, 2026Updated 2 months ago
- Android Photo/Video Recording/Capture/Effects via OpenGL☆10Feb 21, 2021Updated 5 years ago
- A whisper repo for TPU☆11Jun 4, 2024Updated last year
- PAct: Part-Decomposed Single-View Articulated Object Generation☆51May 12, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Integration test of Verilog AXI modules (https://github.com/alexforencich/verilog-axi) with LiteX.☆17Dec 19, 2022Updated 3 years ago
- Chameleon: A Multiplier-Free Temporal Convolutional Network Accelerator for End-to-End Few-Shot and Continual Learning from Sequential Da…☆27Mar 5, 2026Updated 2 months ago
- Firecracker VM orchestration for Claude Code sessions☆29Mar 30, 2026Updated last month
- GunDB HTTP/HTTPS Server and API☆19Feb 15, 2018Updated 8 years ago
- Infusion: Preventing Customized Text-to-Image Diffusion from Overfitting☆14Dec 19, 2025Updated 5 months ago
- 适用于sophon bm1684x,基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答☆14Jun 5, 2024Updated last year
- ☆271Apr 15, 2026Updated last month
- Used FPGA board and System Verilog to design controller, DMA, pipelined SIMD processor, and GEMM accelerator☆12Aug 26, 2023Updated 2 years ago
- SkinTokens: A Learned Compact Representation for Unified Autoregressive Rigging☆91May 12, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17May 27, 2019Updated 6 years ago
- [ECCV 2024] CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTs☆18Jul 2, 2024Updated last year
- A Nestjs Angular Graphql Arangodb starter TypeScript project。☆15Jul 29, 2020Updated 5 years ago
- [ICML 2021] "Double-Win Quant: Aggressively Winning Robustness of Quantized DeepNeural Networks via Random Precision Training and Inferen…☆16Feb 13, 2022Updated 4 years ago
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Structured pruning and bias visualization for Large Language Models. Tools for LLM optimization and fairness analysis.☆39Updated this week
- Can Language Models Solve Olympiad Programming?☆124Jan 14, 2025Updated last year