High-efficiency LLM inference engine in C++/CUDA. Run Llama 70B on RTX 3090.
☆455Feb 22, 2026Updated last month
Alternatives and similar repositories for ntransformer
Users that are interested in ntransformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jul 12, 2025Updated 9 months ago
- Rust implementation of the Zstandard Seekable Format☆260Mar 24, 2026Updated 3 weeks ago
- tiny torch, but close to metal☆128Dec 29, 2025Updated 3 months ago
- Experiments with the Mojo 🔥 programming language on macOS arm64 guided by tests☆13Jan 8, 2026Updated 3 months ago
- DeepDream for video with temporal consistency. Features RAFT optical flow estimation and occlusion masking to prevent ghosting. A PyTorch…☆63Jan 2, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Automatically exported from code.google.com/p/smhasher☆16Mar 10, 2021Updated 5 years ago
- a5eq.lv2 - Another 5-Band Equalizer☆34Mar 26, 2025Updated last year
- How to build apps for ChatGPT?☆24Oct 28, 2025Updated 5 months ago
- tui-use lets agents interact with programs that expect a human at the keyboard — REPLs, debuggers, TUI apps, and anything else bash can't…☆160Updated this week
- Server for Matching Long/Lat to Timezone☆47Feb 21, 2026Updated last month
- Tokenflood is a load testing framework for simulating arbitary loads on instruction-tuned LLMs☆45Mar 20, 2026Updated 3 weeks ago
- A comprehensive suite of tools, built to liberate science by making the creation, evaluation, and dissemination of research more transpar…☆246Aug 8, 2025Updated 8 months ago
- Wolfram Language / Mathematica reimplementation in Rust (Wolfram oxidized)☆575Updated this week
- DiscoGrad - automatically differentiate across conditional branches in C++ programs☆211Sep 12, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Mathematical Knots in WebGL☆37Oct 20, 2022Updated 3 years ago
- Zero-dependency C99 GPT-2 engine for edge AI. Sub-1M parameter models train on-device in seconds. Organelle Pipeline Architecture (OPA) c…☆88Updated this week
- Low latency lock free SPSC, SPMC, MPMC Queue and Stack. Fast SpinLock, SeqLock☆10Feb 5, 2024Updated 2 years ago
- Standalone GN with builtin configs.☆18Jan 22, 2025Updated last year
- vm-curator is a fast and friendly TUI to build and manage QEMU/KVM virtual machines for desktop use with working 3D acceleration (para-vi…☆215Mar 9, 2026Updated last month
- Stress test for parallel disk i/o using git and pnpm☆31Mar 5, 2026Updated last month
- Docker-based inference engine for AMD GPUs☆233Oct 7, 2024Updated last year
- The web API server that runs program codes in an isolated environment using Docker.☆18Jul 20, 2023Updated 2 years ago
- ☆17Apr 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 💭 Chat with AI via API☆33Oct 20, 2024Updated last year
- ☆46Jun 11, 2025Updated 10 months ago
- https://ansi.tools☆34Mar 12, 2026Updated last month
- A simple app for downloading YouTube Shorts transcripts. Built to self-host with Python and Streamlit. Free and open source.☆32Dec 4, 2024Updated last year
- Inspired by Midnight Commander, tailored to my taste.☆51Updated this week
- ☆252Mar 20, 2024Updated 2 years ago
- A library for building dynamic terminal apps, using bonsai☆140Apr 6, 2026Updated last week
- Filesystem 'at' implementations for Unix and Windows☆12Apr 4, 2026Updated last week
- Ideas, concepts, tools and examples of sketch programming☆23Dec 20, 2025Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Resource and domain modeling for quick APIs, CMSs, and applications.☆26Jan 4, 2023Updated 3 years ago
- Tempest game clone for Mac OS classic☆16Sep 13, 2018Updated 7 years ago
- GraphTerm: An aspirational DevOps and Container IDE Concept☆27Nov 20, 2017Updated 8 years ago
- Ruqe brings the convenient types and methods found in Rust into Dart, such as the Result, Option, pattern-matching, etc.☆13Sep 13, 2023Updated 2 years ago
- StickOS® BASIC -- an entirely MCU-resident patented interactive programming environment, used by Flea-Scope™☆36Apr 8, 2025Updated last year
- Parsers for CUDA binary files☆24Dec 29, 2023Updated 2 years ago
- ☆25Dec 20, 2021Updated 4 years ago