The codes for training sparsity predictor on LLaMA.
☆18May 12, 2024Updated last year
Alternatives and similar repositories for DejaVu_predictor
Users that are interested in DejaVu_predictor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPU operators for sparse tensor operations☆36Mar 11, 2024Updated 2 years ago
- FlexOS: Towards Flexible OS Isolation (ASPLOS'22) Artifact Evaluation Repository☆19Apr 2, 2022Updated 4 years ago
- ☆11Sep 20, 2024Updated last year
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆36Nov 13, 2025Updated 5 months ago
- Deferred Continuous Batching in Resource-Efficient Large Language Model Serving (EuroMLSys 2024)☆19May 28, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆17Mar 8, 2025Updated last year
- The 1st Spiking Transformer Benchmark (NeurIPS 2025)☆19Dec 29, 2025Updated 4 months ago
- This is a repo to store circuit design datasets☆19Jan 17, 2024Updated 2 years ago
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆13Nov 8, 2021Updated 4 years ago
- ☆19Aug 6, 2021Updated 4 years ago
- implementation of 'The Forward-Forward Algorithm: Some Preliminary Investigations', Hinton 2022☆14Dec 6, 2022Updated 3 years ago
- ICLR2023 - Tailoring Language Generation Models under Total Variation Distance☆21Feb 8, 2023Updated 3 years ago
- ☆13Feb 24, 2020Updated 6 years ago
- [HPCA 2026 Best Paper Candidate] Official implementation of "Focus: A Streaming Concentration Architecture for Efficient Vision-Language …☆51Feb 8, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24Aug 15, 2023Updated 2 years ago
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- PyTorch implementation of the ExStream method from our ICRA-2019 paper "Memory Efficient Experience Replay for Streaming Learning"☆22Nov 26, 2019Updated 6 years ago
- [FPL'24] This repository contains the source code for the paper “Revealing Untapped DSP Optimization Potentials for FPGA-based Systolic M…☆22May 6, 2024Updated 2 years ago
- ☆13Oct 2, 2024Updated last year
- ☆19May 4, 2023Updated 3 years ago
- Benchmarking Deepseek R1 API response speeds across different providers for performance comparison.☆10Feb 15, 2025Updated last year
- [TVLSI'23] This repository contains the source code for the paper "FireFly: A High-Throughput Hardware Accelerator for Spiking Neural Net…☆24Apr 4, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆164Feb 15, 2025Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆39Jun 11, 2025Updated 10 months ago
- ☆120Nov 17, 2023Updated 2 years ago
- ☆21Apr 29, 2026Updated last week
- natural annotated text-category pairs for text classification☆10Sep 10, 2021Updated 4 years ago
- The code used to train and run inference with MMDocIR☆33May 29, 2025Updated 11 months ago
- ☆78Dec 16, 2025Updated 4 months ago
- Model to predict kinase-ligand pKi values.☆12Jul 6, 2023Updated 2 years ago
- ☆27Jan 22, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆27Aug 18, 2019Updated 6 years ago
- ☆14May 25, 2022Updated 3 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆10Apr 14, 2022Updated 4 years ago
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated 2 years ago
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- ☆13Apr 15, 2024Updated 2 years ago
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆68Apr 24, 2024Updated 2 years ago