throwaway GPT inference
☆141Jun 1, 2024Updated last year
Alternatives and similar repositories for a1gpt
Users that are interested in a1gpt are comparing it to the libraries listed below
Sorting:
- Make triton easier☆50Jun 12, 2024Updated last year
- Explore Daily Updated Statistics for Repositories in the 'awesome-rust' List☆15Updated this week
- Agent based market simulation☆15Aug 10, 2024Updated last year
- Lemon is an LALR(1) parser generator for C or C++.☆17Jun 10, 2014Updated 11 years ago
- 🍓 A toy object-oriented programming language written by rust☆17Apr 10, 2024Updated last year
- The C4 Concurrent C Fuzzer☆14Nov 2, 2023Updated 2 years ago
- Use tensor core to calculate back-to-back HGEMM (half-precision general matrix multiplication) with MMA PTX instruction.☆13Nov 3, 2023Updated 2 years ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆200Sep 24, 2023Updated 2 years ago
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Oct 2, 2024Updated last year
- Reversal Curse Experiment☆15Sep 24, 2023Updated 2 years ago
- An unofficial implementation of BitNet☆12Mar 22, 2024Updated last year
- Algebraic enhancements for GEMM & AI accelerators☆291Feb 28, 2025Updated last year
- 6502 Emulator written in C++☆13Feb 18, 2025Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Sep 6, 2023Updated 2 years ago
- Implementation of a simple matching engine and an order book for a stock exchange☆13Aug 28, 2017Updated 8 years ago
- Richard is gaining power☆200Jun 21, 2025Updated 8 months ago
- 2-layer and 4-layer FPGA development board with ZYNQ 7010/7020 400-pin BGA.☆20Jan 6, 2026Updated 2 months ago
- Inference Llama 2 in one file of pure C☆19,262Aug 6, 2024Updated last year
- Wrapper for the new GPT functions API☆15Jun 14, 2023Updated 2 years ago
- A source-to-source compiler for optimizing CUDA dynamic parallelism by aggregating launches☆15Jun 21, 2019Updated 6 years ago
- Seamless llvm-mca CMake integration☆28Mar 7, 2020Updated 6 years ago
- A CLI to manage install and configure llama inference implemenation in multiple languages☆65Jan 4, 2024Updated 2 years ago
- ☆252Mar 20, 2024Updated 2 years ago
- This repository contains a pure C++ ONNX implementation of multiple offline AI models, such as StableDiffusion (1.5 and XL), ControlNet, …☆633May 29, 2025Updated 9 months ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆158Aug 5, 2023Updated 2 years ago
- ☆19Sep 26, 2023Updated 2 years ago
- MLIR tools and dialect for GraphBLAS☆18Mar 30, 2022Updated 3 years ago
- A minimal viable programming language on top of liblgpp☆78Jan 5, 2021Updated 5 years ago
- DeMo: Decoupled Momentum Optimization☆198Dec 2, 2024Updated last year
- ☆181Dec 13, 2023Updated 2 years ago
- These classes calculate fast mathematical Sine/Cos for limited accuracy☆10Mar 7, 2021Updated 5 years ago
- ☆10Jun 2, 2023Updated 2 years ago
- Browser-based Voice Assistant☆43Mar 31, 2023Updated 2 years ago
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆34Mar 14, 2024Updated 2 years ago
- Hello World using 6 different methods in Assembly Language for Raspberry Pi☆21Jul 30, 2023Updated 2 years ago
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆379Apr 21, 2025Updated 10 months ago
- The closs platform shell of Nim, by Nim, for Nim☆11Dec 14, 2018Updated 7 years ago
- QuickJs based wrapper generator for WASM components in written in JavaScript☆18Updated this week
- minimal diffusion transformer in pytorch.☆17Oct 6, 2024Updated last year