dimdano / faiss-fpga
An FPGA integration and acceleration of the popular FAISS framework for approximate similarity search
☆20Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for faiss-fpga
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆15Updated 3 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆14Updated 5 years ago
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆15Updated 4 years ago
- FPGA-based HyperLogLog Accelerator☆12Updated 4 years ago
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Updated 9 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆34Updated 9 years ago
- doppioDB - A hardware accelerated database☆48Updated 7 years ago
- Polyhedral High-Level Synthesis in MLIR☆29Updated last year
- The quantitative performance comparison among DL compilers on CNN models.☆75Updated 4 years ago
- Public Release of Stream-Dataflow☆14Updated 5 years ago
- Benchmark suite containing cache filtered traces for use with Ramulator. These include some of the workloads used in our SIGMETRICS 2019 …☆19Updated 4 years ago
- ☆15Updated 3 years ago
- A Distributed Multi-GPU System for Fast Graph Processing☆63Updated 6 years ago
- An IR for efficiently simulating distributed ML computation.☆25Updated 10 months ago
- Tacker: Tensor-CUDA Core Kernel Fusion for Improving the GPU Utilization while Ensuring QoS☆17Updated 2 years ago
- Modified version of PyTorch able to work with changes to GPGPU-Sim☆45Updated 2 years ago
- Artifacts for SOSP'19 paper Optimizing Deep Learning Computation with Automatic Generation of Graph Substitutions☆21Updated 2 years ago
- Artifact of ASPLOS'23 paper entitled: GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference☆16Updated last year
- HeteroCL-MLIR dialect for accelerator design☆40Updated 2 months ago
- A Language for Closed-form High-level ARchitecture Modeling☆19Updated 4 years ago
- ☆31Updated last year
- An Architecture-level Fault Injection Tool for GPU Application Resilience Evaluations☆16Updated 4 years ago
- Experiments evaluating preemption on the NVIDIA Pascal architecture☆18Updated 8 years ago
- ☆22Updated 5 years ago
- Productive and portable performance programming across spatial architectures (FPGAs, etc.) and vector architectures (GPUs, etc.)☆29Updated 6 months ago
- GVProf: A Value Profiler for GPU-based Clusters☆48Updated 8 months ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆19Updated last year
- A simple script to plot the Roofline model for given HW platforms and applications☆9Updated 3 months ago
- Fibertree emulator☆12Updated 2 weeks ago