wilyub / VeriThoughtsLinks
The first large scale formally verified reasoning dataset for Verilog
☆18Updated 7 months ago
Alternatives and similar repositories for VeriThoughts
Users that are interested in VeriThoughts are comparing it to the libraries listed below
Sorting:
- LLM4HWDesign Starting Toolkit☆19Updated last year
- [ICLR 2025] TidalDecode: A Fast and Accurate LLM Decoding with Position Persistent Sparse Attention☆50Updated 5 months ago
- Codebase for ICML'24 paper: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs☆27Updated last year
- Sirius, an efficient correction mechanism, which significantly boosts Contextual Sparsity models on reasoning tasks while maintaining its…☆21Updated last year
- ☆52Updated last year
- ☆15Updated last year
- First Latency-Aware Competitive LLM Agent Benchmark☆26Updated 7 months ago
- MAGE: A Multi-Agent Engine for Automated RTL Code Generation☆82Updated 9 months ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆50Updated last year
- Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference☆47Updated last year
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆55Updated last year
- ☆34Updated 9 months ago
- The official implementation of the DAC 2024 paper GQA-LUT☆20Updated last year
- ☆40Updated last year
- TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators☆108Updated 6 months ago
- LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification☆73Updated 5 months ago
- [NeurIPS'23] Speculative Decoding with Big Little Decoder☆96Updated last year
- [NeurIPS'25 Spotlight] Adaptive Attention Sparsity with Hierarchical Top-p Pruning☆83Updated last month
- ☆49Updated 7 months ago
- ☆39Updated last year
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM tra…☆45Updated 3 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆39Updated 11 months ago
- AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference☆20Updated 11 months ago
- Residual vector quantization for KV cache compression in large language model☆10Updated last year
- ☆13Updated last year
- Autocomp: AI-Driven Code Optimizer for Tensor Accelerators☆59Updated last week
- Accelerating Large-Scale Reasoning Model Inference with Sparse Self-Speculative Decoding☆74Updated last month
- ☆53Updated 4 months ago
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆119Updated last year
- ☆33Updated last year