Bringing Language Models to the Most Resource Constrained Devices
☆56Dec 23, 2024Updated last year
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆12Sep 18, 2024Updated last year
- Swift Implementation of the Model Context Protocol (MCP) Spec☆11Mar 28, 2025Updated last year
- ☆22Jul 11, 2025Updated 10 months ago
- ☆10Oct 8, 2021Updated 4 years ago
- MATLAB/Octave generator of Hamming ECC coding. Output format is Verilog HDL.☆12Dec 27, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 哈尔滨工业大学(深圳)2021年球季学期深度学习体系结构实验☆17Oct 1, 2022Updated 3 years ago
- The 'missing header' for Chisel☆23Feb 5, 2026Updated 3 months ago
- a student trainning project for HLS and transformer☆11Oct 19, 2022Updated 3 years ago
- ☆10Dec 21, 2020Updated 5 years ago
- ☆14Jun 22, 2022Updated 3 years ago
- A cycle-accurate RISC-V CPU simulator + RTL modeling library in pure Python.☆18Aug 27, 2025Updated 8 months ago
- This is a mysic detection app that uses ShazamKit to detect music. Once it detects the music, it takes the information about that music t…☆11Dec 5, 2021Updated 4 years ago
- ☆10Jul 28, 2020Updated 5 years ago
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆36Nov 13, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- My WWDC17 scholarship winning playground☆13Feb 14, 2019Updated 7 years ago
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated 2 years ago
- FPGA Innovation Design Competition:RISC-V Processor-based Hardware and Software Design in PGL22G☆12Sep 1, 2023Updated 2 years ago
- Reimplementation of facebook's DinoV2 in JAX. Inference (with pretrained weights) only; training is unsupported.☆12Jun 25, 2024Updated last year
- OCR ACE tensorflow☆11Jul 5, 2019Updated 6 years ago
- https://nnsmith-asplos.rtfd.io Artifact of "NNSmith: Generating Diverse and Valid Test Cases for Deep Learning Compilers" ASPLOS'23☆11Mar 29, 2023Updated 3 years ago
- 此项目是我个人对MIT 6.5940 课程作业的答案,学习笔记和心得。☆15Mar 1, 2024Updated 2 years ago
- Verilog implementation of MC68851 Memory Management Unit☆13Feb 26, 2018Updated 8 years ago
- [EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing☆14Feb 10, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Computational Memory Neural Network Compiler☆11Aug 11, 2021Updated 4 years ago
- AI Research team evaluation repository☆34Dec 5, 2025Updated 5 months ago
- Official Code Implementation for the CCS 2022 Paper "On the Privacy Risks of Cell-Based NAS Architectures"☆11Nov 21, 2022Updated 3 years ago
- BIOBOT: A Fall Detection System (FDS) using Artificial Intelligence☆13Jan 19, 2019Updated 7 years ago
- QuardStar Tutorial is all you need !☆18Sep 11, 2024Updated last year
- Log keys pressed on macOS. Useful for screen recordings and presentations.☆13Dec 5, 2022Updated 3 years ago
- 基于3D卷积C3D,利用Faster-RCNN思路,使用时间段建议网络Temporal Proposal 检测视频行为☆13Jun 20, 2018Updated 7 years ago
- An Unoffical Implementation of PeleeNet by TensorFlow, Keras☆14Jun 17, 2019Updated 6 years ago
- ☆15Dec 7, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 给NEMU移植Linux Kernel!☆23Jun 1, 2025Updated 11 months ago
- [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers☆60Nov 22, 2023Updated 2 years ago
- Abstractions for iterating and mapping over struct fields☆17Jan 15, 2026Updated 4 months ago
- Sample Android Camera Application☆16Jul 28, 2013Updated 12 years ago
- Accelerate convolution neural network for face recognition using GPU☆14Nov 24, 2020Updated 5 years ago
- This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…☆24Apr 2, 2025Updated last year
- Official implementation of the paper “Endowing Vision-Language Models with System 2 Thinking for Fine-Grained Visual Recognition,” AAAI 2…☆37Jan 30, 2026Updated 3 months ago