[DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs
☆28Nov 13, 2025Updated 3 months ago
Alternatives and similar repositories for TeraFly
Users that are interested in TeraFly are comparing it to the libraries listed below
Sorting:
- An AlphaZero engine for Saiblo Connect4, featuring a pure Python implementation of key KataGo techniques.☆15Updated this week
- Unsupervised muti-metric fusion for Full-Reference (FR) Image Quality Assessment (IQA)☆11Jul 11, 2014Updated 11 years ago
- VST that combines the classic mdaPiano and EPiano in a new plug-in☆16Oct 10, 2025Updated 4 months ago
- a toy mock server based on anyproxy☆17Mar 18, 2019Updated 6 years ago
- [FPL'24] This repository contains the source code for the paper “Revealing Untapped DSP Optimization Potentials for FPGA-based Systolic M…☆21May 6, 2024Updated last year
- A parametric RTL code generator of an efficient integer MxM Systolic Array implementation for Xilinx FPGAs.☆31Aug 28, 2025Updated 6 months ago
- [CVPR 2025] Implementation of "Forensics-Bench: A Comprehensive Forgery Detection Benchmark Suite for Large Vision Language Models"☆36Apr 28, 2025Updated 10 months ago
- [Spotlight ICLR 2023 paper] Continual evaluation for lifelong learning with neural networks, identifying the stability gap.☆30Apr 2, 2023Updated 2 years ago
- [DATE'25, ICCAD'25] An embedded FPGA-based LLM accelerator capable of supporting Llama2-7B☆79Jan 6, 2026Updated last month
- A simulator for SK hynix AiM PIM architecture based on Ramulator 2.0☆59Jul 22, 2025Updated 7 months ago
- Never lose context again with a persistent, queryable memory system for AI agents and development teams.☆18Jan 29, 2026Updated last month
- ☆11Jan 21, 2021Updated 5 years ago
- 2019年全国大学生电子设计大赛G题双路语音调频接收机的FPGA全实现☆17Apr 15, 2020Updated 5 years ago
- OpenExSys_NoC a mesh-based network on chip IP.☆20Dec 1, 2023Updated 2 years ago
- A lightweight, high-performance deep learning inference framework built in Rust. Zen-Infer provides a clean, modular architecture for dep…☆20Jul 31, 2025Updated 7 months ago
- ☆37Nov 11, 2018Updated 7 years ago
- ☆12Sep 18, 2024Updated last year
- MCP server for Fluent (ServiceNow SDK)☆18Feb 9, 2026Updated 3 weeks ago
- NSFW detection and annotator application for images. Detects and segments only nudity for now.☆10Feb 14, 2026Updated 2 weeks ago
- Kratos: An FPGA Benchmark for Unrolled Deep Neural Networks with Fine-Grained Sparsity and Mixed Precision☆12Jan 19, 2026Updated last month
- Includes the SVD-based approximation algorithms for compressing deep learning models and the FPGA accelerators exploiting such approximat…☆16Mar 3, 2023Updated 2 years ago
- TMMA: A Tiled Matrix Multiplication Accelerator for Self-Attention Projections in Transformer Models, optimized for edge deployment on Xi…☆26Mar 24, 2025Updated 11 months ago
- a student trainning project for HLS and transformer☆11Oct 19, 2022Updated 3 years ago
- ☆11Aug 4, 2020Updated 5 years ago
- Home page for Microsoft Phi-Ground tech-report☆23Sep 8, 2025Updated 5 months ago
- c++ version of ViT☆12Nov 13, 2022Updated 3 years ago
- ☆14Jun 22, 2022Updated 3 years ago
- A docker image for One Student One Chip's debug exam☆10Sep 22, 2023Updated 2 years ago
- FSA: Fusing FlashAttention within a Single Systolic Array☆89Aug 12, 2025Updated 6 months ago
- A simple API built to fetch chapters along with invocations from known book "Hisnul Muslim" (Fortress of the Muslim)☆11Sep 9, 2022Updated 3 years ago
- ☆11Sep 20, 2024Updated last year
- 哈尔滨工业大学(深圳)2021年球季学期深度学习体系结构实验☆17Oct 1, 2022Updated 3 years ago
- An optimized Merkle Patricia Trie implementation on GPU, fully compatible with and integrable into Ethereum. The paper is published on VL…☆14Apr 15, 2024Updated last year
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆122Aug 27, 2024Updated last year
- Multi Layer Perceptron by Vivado HLS for Xilinx FPGA implementation☆12Dec 26, 2016Updated 9 years ago
- Example of Matrix Multiplication using Map Reduce paradigm in python☆10Oct 25, 2016Updated 9 years ago
- Chinese Guide for Alveo Getting Started☆12May 18, 2020Updated 5 years ago
- ☆10Oct 8, 2021Updated 4 years ago
- A beautiful terminal social client for Mastodon and Bluesky 🐦☆55Feb 21, 2026Updated last week