FPGA-based HyperLogLog Accelerator
☆12Jul 13, 2020Updated 5 years ago
Alternatives and similar repositories for fpga-hyperloglog
Users that are interested in fpga-hyperloglog are comparing it to the libraries listed below
Sorting:
- ☆11Apr 3, 2023Updated 2 years ago
- Towards Hardware and Software Continuous Integration☆13Jun 8, 2020Updated 5 years ago
- BiSUNA framework specialized to compile for the Xilinx Alveo U50☆13Dec 3, 2020Updated 5 years ago
- ☆14Feb 14, 2022Updated 4 years ago
- ☆13Jun 6, 2022Updated 3 years ago
- ☆36Jan 21, 2021Updated 5 years ago
- Xilinx Alveo Graph Analytics Product repository☆14May 18, 2022Updated 3 years ago
- SmartNIC☆14Dec 13, 2018Updated 7 years ago
- This is an official GitHub repository for the paper, "Towards timeout-less transport in commodity datacenter networks.".☆16Oct 12, 2021Updated 4 years ago
- Multi-armed bandit algorithm with tensorflow and 11 policies☆16Dec 27, 2022Updated 3 years ago
- C++/MPI proxies for distributed training of deep neural networks.☆15Jun 18, 2022Updated 3 years ago
- TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.☆14Nov 23, 2024Updated last year
- A crypto accelerator written for HLS to an FPGA that actually makes it slower than running it on your computer☆18Dec 11, 2018Updated 7 years ago
- ☆19Dec 3, 2019Updated 6 years ago
- DOSA: Differentiable Model-Based One-Loop Search for DNN Accelerators☆19Oct 10, 2024Updated last year
- ☆21Dec 9, 2018Updated 7 years ago
- Johnson-Lindenstrauss transform (JLT), random projections (RP), fast Johnson-Lindenstrauss transform (FJLT), and randomized Hadamard tran…☆23Jul 11, 2023Updated 2 years ago
- ☆14Sep 27, 2021Updated 4 years ago
- ☆20Nov 12, 2025Updated 3 months ago
- Manages vllm-nccl dependency☆17Jun 3, 2024Updated last year
- A new kind of hardware decompressor for Snappy decompression. Much faster than the existing software one.☆24Jun 27, 2023Updated 2 years ago
- DAMON-based Optimal Operation Schemes☆17Sep 5, 2024Updated last year
- Sources for the Multi-Clock system as described in the paper: MULTI-CLOCK: Dynamic Tiering for Hybrid Memory Systems, HPCA 2022.☆19Mar 21, 2022Updated 3 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 2 years ago
- Kernel Machine Library - fast GPU SVM in.net. Implemented kernels on CPU and GPU (Linear,RBF,Chi-Square,Exp Chi-Square). Library includes…☆27Mar 31, 2017Updated 8 years ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- The accelerometer analytical model published in ASPLOS 2020 (Accelerometer: Understanding Acceleration Opportunities forData Center Overh…☆16Jan 18, 2020Updated 6 years ago
- Network Traffic Transformer to learn network dynamics from packet traces. Learn fundamental dynamics with pre-training and fine-tune to m…☆23Jan 17, 2024Updated 2 years ago
- Benchmark suite containing cache filtered traces for use with Ramulator. These include some of the workloads used in our SIGMETRICS 2019 …☆23Oct 9, 2020Updated 5 years ago
- ☆22Feb 18, 2025Updated last year
- The Artifact of NeoMem: Hardware/Software Co-Design for CXL-Native Memory Tiering☆63Aug 11, 2024Updated last year
- An FPGA integration and acceleration of the popular FAISS framework for approximate similarity search☆25Jul 20, 2019Updated 6 years ago
- ☆24May 6, 2022Updated 3 years ago
- Manually implemented quantization-aware training☆23Oct 12, 2022Updated 3 years ago
- ☆27Mar 2, 2023Updated 3 years ago
- An FPGA-based NetTLP adapter☆27Mar 10, 2020Updated 5 years ago
- Distributed Accelerator OS☆64Apr 6, 2022Updated 3 years ago
- zMonkey is an open-source 200G network impairment emulator tool☆23Mar 8, 2022Updated 3 years ago
- Artifact evaluation repo for EuroSys'24.☆29Nov 7, 2023Updated 2 years ago