The codes for training sparsity predictor on LLaMA.
☆18May 12, 2024Updated last year
Alternatives and similar repositories for DejaVu_predictor
Users that are interested in DejaVu_predictor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- GPU operators for sparse tensor operations☆35Mar 11, 2024Updated 2 years ago
- ☆11Sep 20, 2024Updated last year
- 2019年全国大学生电子设计大赛G题双路语音调频接收机的FPGA全 实现☆18Apr 15, 2020Updated 5 years ago
- Orthogonal Matching Pursuit, parallelized on both CPU and GPU. 100x+ Speedup☆16Mar 8, 2026Updated 3 weeks ago
- ☆17Mar 8, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The 1st Spiking Transformer Benchmark (NeurIPS 2025)☆19Dec 29, 2025Updated 3 months ago
- This is a repo to store circuit design datasets☆19Jan 17, 2024Updated 2 years ago
- AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.☆13Nov 8, 2021Updated 4 years ago
- Mirror of http://gitlab.hpcrl.cse.ohio-state.edu/chong/ppopp19_ae, refactoring for understanding☆16Oct 20, 2021Updated 4 years ago
- RapidLayout: Fast Hard Block Placement of FPGA-Optimized Systolic Arrays using Evolutionary Algorithms☆18Nov 26, 2020Updated 5 years ago
- This is a curated list of "Continual Learning with Pretrained Models" research.☆19May 29, 2025Updated 10 months ago
- implementation of 'The Forward-Forward Algorithm: Some Preliminary Investigations', Hinton 2022☆14Dec 6, 2022Updated 3 years ago
- [HPCA 2026 Best Paper Candidate] Official implementation of "Focus: A Streaming Concentration Architecture for Efficient Vision-Language …☆43Feb 8, 2026Updated last month
- TVMScript kernel for deformable attention☆25Dec 15, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- LLMA = LLM + Arithmetic coder, which use LLM to do insane text data compression. LLMA=大模型+算术编码,它能使用LLM对文本数据进行暴力的压缩,达到极高的压缩率。☆22Nov 24, 2024Updated last year
- ☆19May 4, 2023Updated 2 years ago
- Benchmarking Deepseek R1 API response speeds across different providers for performance comparison.☆10Feb 15, 2025Updated last year
- [TVLSI'23] This repository contains the source code for the paper "FireFly: A High-Throughput Hardware Accelerator for Spiking Neural Net…☆24Apr 4, 2024Updated last year
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆11Apr 14, 2022Updated 3 years ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆39Jun 11, 2025Updated 9 months ago
- ☆63Dec 16, 2025Updated 3 months ago
- Beyond Empathy: Integrating Diagnostic and Therapeutic Reasoning with Large Language Models for Mental Health Counseling☆34Jan 24, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The code used to train and run inference with MMDocIR☆32May 29, 2025Updated 10 months ago
- Computational predictor of protein intrinsic disorder and its functions☆10Dec 4, 2023Updated 2 years ago
- ☆27Jan 22, 2023Updated 3 years ago
- ☆26Aug 18, 2019Updated 6 years ago
- ☆14May 25, 2022Updated 3 years ago
- OPUS-Rota4: A Gradient-Based Protein Side-Chain Modeling Framework Assisted by Deep Learning-Based Predictors☆10Apr 14, 2022Updated 3 years ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆28Feb 26, 2023Updated 3 years ago
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated 2 years ago
- Control LLM generation format efficiently. A simple version of microsoft/aici in vllm and transformers☆14Jun 7, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆13Apr 15, 2024Updated last year
- A byte-level decoder architecture that matches the performance of tokenized Transformers.☆67Apr 24, 2024Updated last year
- A parametric RTL code generator of an efficient integer MxM Systolic Array implementation for Xilinx FPGAs.☆34Aug 28, 2025Updated 7 months ago
- ☆18Mar 3, 2025Updated last year
- RES via complex-valued DNN☆25Sep 3, 2021Updated 4 years ago
- ☆28Dec 2, 2024Updated last year
- The repository for our paper: Neighboring Perturbations of Knowledge Editing on Large Language Models☆16May 4, 2024Updated last year