Bringing Language Models to the Most Resource Constrained Devices
☆61Dec 23, 2024Updated last year
Alternatives and similar repositories for TinyLLM
Users that are interested in TinyLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Adaptive Deep Neural Network Inference Optimization with EENet☆13Mar 28, 2024Updated 2 years ago
- LaLaRAND: Flexible Layer-by-Layer CPU/GPU Scheduling for Real-Time DNN Tasks☆18Mar 25, 2022Updated 4 years ago
- Swift Implementation of the Model Context Protocol (MCP) Spec☆11Mar 28, 2025Updated last year
- ☆44Jun 27, 2025Updated 11 months ago
- A docker image for One Student One Chip's debug exam☆10Sep 22, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The 'missing header' for Chisel☆24Feb 5, 2026Updated 4 months ago
- a student trainning project for HLS and transformer☆11Oct 19, 2022Updated 3 years ago
- ☆10Dec 21, 2020Updated 5 years ago
- ☆14Jun 22, 2022Updated 3 years ago
- A cycle-accurate RISC-V CPU simulator + RTL modeling library in pure Python.☆18Aug 27, 2025Updated 9 months ago
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆37Nov 13, 2025Updated 6 months ago
- ☆10Jul 28, 2020Updated 5 years ago
- NPUsim: Full-Model, Cycle-Level, and Value-Aware Simulator for DNN Accelerators☆55Jan 2, 2025Updated last year
- A Swift version of Marvis TTS, running locally on Apple Silicon using MLX Swift.☆23Jan 4, 2026Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- OpenTelemetry wrapper for Claude Code CLI that logs tool calls, token usage, costs, and execution traces to Logfire, Sentry, Honeycomb, o…☆24Oct 24, 2025Updated 7 months ago
- macOS Catalyst Video Speech Recognizer & Natural Language Recognition App demo☆18Oct 6, 2019Updated 6 years ago
- 《人工智能法规、伦理与社会影响》书稿☆14Aug 28, 2021Updated 4 years ago
- FPGA Innovation Design Competition:RISC-V Processor-based Hardware and Software Design in PGL22G☆12Sep 1, 2023Updated 2 years ago
- LibreTranslate API client for Swift.☆17Jan 27, 2025Updated last year
- Joint Out-of-Distribution Detection and Uncertainty Estimation for Trajectory Prediction: Model, training and evaluation code.☆35Jul 12, 2025Updated 10 months ago
- A pre-trained model with multi-exit transformer architecture.☆56Dec 10, 2022Updated 3 years ago
- 此项目是我个人对MIT 6.5940 课程作业的答案,学习笔记和心得。☆15Mar 1, 2024Updated 2 years ago
- Verilog implementation of MC68851 Memory Management Unit☆14Feb 26, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Teapotlabs BWLR1E: Wireless Environmental Sensor with Energy Harvesting from Solar Energy☆35Apr 29, 2025Updated last year
- Android server for Airplay2☆13Mar 30, 2023Updated 3 years ago
- Computational Memory Neural Network Compiler☆11Aug 11, 2021Updated 4 years ago
- Model summary of keras pre-trained neural networks.☆12Aug 1, 2019Updated 6 years ago
- Official Code Implementation for the CCS 2022 Paper "On the Privacy Risks of Cell-Based NAS Architectures"☆11Nov 21, 2022Updated 3 years ago
- An app to view and edit Open Food Fact products☆11Apr 25, 2024Updated 2 years ago
- QuardStar Tutorial is all you need !☆19Sep 11, 2024Updated last year
- ☆15Dec 7, 2021Updated 4 years ago
- 给NEMU移植Linux Kernel!☆23Jun 1, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers☆61Nov 22, 2023Updated 2 years ago
- Accelerate convolution neural network for face recognition using GPU☆15Nov 24, 2020Updated 5 years ago
- This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…☆25Apr 2, 2025Updated last year
- Code for "Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling" [ICML 2021]☆10Mar 14, 2022Updated 4 years ago
- Lab 5 project of MIT-6.5940, deploying LLaMA2-7B-chat on one's laptop with TinyChatEngine.☆18Dec 1, 2023Updated 2 years ago
- This library can be used for human thermal detection. There are examples to read temperature readings as quickly as possible and read the…☆17Jan 8, 2025Updated last year
- Fusing 2D Material World Knowledge on 3D Geometry☆55Mar 23, 2026Updated 2 months ago