dimdano / faiss-fpgaLinks
An FPGA integration and acceleration of the popular FAISS framework for approximate similarity search
☆24Updated 6 years ago
Alternatives and similar repositories for faiss-fpga
Users that are interested in faiss-fpga are comparing it to the libraries listed below
Sorting:
- MLSys 2021 paper: MicroRec: efficient recommendation inference by hardware and data structure solutions☆18Updated 4 years ago
- Alveo Collective Communication Library: MPI-like communication operations for Xilinx Alveo accelerators☆97Updated 2 months ago
- SMASH is a hardware-software cooperative mechanism that enables highly-efficient indexing and storage of sparse matrices. The key idea of…☆17Updated 5 years ago
- PTX-EMU is a simple emulator for CUDA program.☆34Updated 5 months ago
- ☆22Updated 7 months ago
- doppioDB - A hardware accelerated database☆49Updated 8 years ago
- Productive and portable performance programming across spatial architectures (FPGAs, etc.) and vector architectures (GPUs, etc.)☆31Updated last year
- ETHZ Heterogeneous Accelerated Compute Cluster.☆37Updated 5 months ago
- ☆19Updated 6 years ago
- Modified version of PyTorch able to work with changes to GPGPU-Sim☆56Updated 2 years ago
- Public Release of Stream-Dataflow☆14Updated 6 years ago
- ☆65Updated 4 years ago
- This is an read-only mirror of the gem5 simulator. The upstream repository is stored in https://gem5.googlesource.com, code reviews shoul…☆12Updated 5 years ago
- ☆23Updated 4 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Updated 6 years ago
- Emulating DMA Engines on GPUs for Performance and Portability☆41Updated 10 years ago
- Virtualized Accelerator Orchestration for Multi-Tenant Workloads☆19Updated 10 months ago
- ☆14Updated 3 years ago
- FPGA-based HyperLogLog Accelerator☆12Updated 5 years ago
- A Vector Caching Scheme for Streaming FPGA SpMV Accelerators☆10Updated 10 years ago
- hardware (ASIC) DEFLATE designed for low-latency page-granularity memory compression and implemented in Chisel☆14Updated 10 months ago
- A parallel and distributed simulator for thousand-core chips☆25Updated 7 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Updated 2 years ago
- TensorCore Vector Processor for Deep Learning - Google Summer of Code Project☆22Updated 4 years ago
- ☆25Updated last year
- SmartNIC☆14Updated 6 years ago
- ☆19Updated 5 years ago
- Learn NVDLA by SOMNIA☆43Updated 5 years ago
- PARADE: A Cycle-Accurate Full-System Simulation Platform for Accelerator-Rich Architectural Design and Exploration☆49Updated 3 years ago
- Source code of the simulator used in the Mosaic paper from MICRO 2017: "Mosaic: A GPU Memory Manager with Application-Transparent Support…☆49Updated 7 years ago