new optimizer
☆20Aug 4, 2024Updated last year
Alternatives and similar repositories for grokadamw
Users that are interested in grokadamw are comparing it to the libraries listed below
Sorting:
- ☆138Aug 19, 2024Updated last year
- ☆16Feb 6, 2024Updated 2 years ago
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 4 months ago
- ☆20Mar 1, 2023Updated 3 years ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- Fast, High-Fidelity LLM Decoding with Regex Constraints☆21Jul 26, 2024Updated last year
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Oct 12, 2024Updated last year
- Quick ADC☆27May 31, 2019Updated 6 years ago
- Experimental GPU language with meta-programming☆26Sep 6, 2024Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Aug 14, 2024Updated last year
- ☆32Nov 11, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- ☆32Feb 3, 2026Updated last month
- Code for this paper "HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts via HyperNetwork"☆33Nov 29, 2023Updated 2 years ago
- ☆34May 14, 2025Updated 9 months ago
- ☆16Jan 13, 2022Updated 4 years ago
- ☆40Jul 26, 2024Updated last year
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Jan 15, 2024Updated 2 years ago
- Local LLM Testing & Benchmarking for Apple Silicon☆56Feb 26, 2026Updated last week
- A sample monorepo using turborepo, next.js, shadcn-ui, and tailwind to share components between multiple applications☆11Nov 30, 2023Updated 2 years ago
- Extensive time series analysis of chinese PM2.5 content, using models from ARMA and VAR to LSTMs and dynamic time warping clustering☆11Aug 17, 2019Updated 6 years ago
- an autonomous independent digital companion☆14Feb 12, 2026Updated 3 weeks ago
- Cookiecutter template for making a cog for Red.☆12Jun 18, 2024Updated last year
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- Discord Docsbot, Built on bgent☆11Jun 17, 2024Updated last year
- Chrome Extension to capture captions of ongoing meetings by using webkitspeechrecognition api for all the web video conferencing platform…☆11Apr 22, 2023Updated 2 years ago
- A set of scripts for using Oracle AI Database 26ai Free Container Image in Oracle Container Registry☆13Oct 15, 2025Updated 4 months ago
- 🧠🗺 because your mind doesn't have ugly boxes everywhere☆12Aug 17, 2022Updated 3 years ago
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year
- Ask AI to test your website with a specific goal☆15Dec 22, 2023Updated 2 years ago
- ☆113Jul 23, 2025Updated 7 months ago
- Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology☆22Jul 17, 2025Updated 7 months ago
- Seam carving implemented in rust☆12Apr 19, 2020Updated 5 years ago
- A minimal re-implementation of orthogonal fine-tuning (OFT) for LLMs. Based on nanoGPT and minLoRA.☆13Nov 17, 2023Updated 2 years ago
- Retrieval with Learned Similarities (http://arxiv.org/abs/2407.15462, WWW'25 Oral)☆52Apr 23, 2025Updated 10 months ago
- ☆17Updated this week
- An implementation of a quantum neural network built using pyquil.☆11Jun 7, 2019Updated 6 years ago