new optimizer
☆20Aug 4, 2024Updated last year
Alternatives and similar repositories for grokadamw
Users that are interested in grokadamw are comparing it to the libraries listed below
Sorting:
- ☆16Feb 6, 2024Updated 2 years ago
- CUDA implementation of Wavelet KAN.☆16Jun 8, 2024Updated last year
- A basic pure pytorch implementation of flash attention☆16Oct 28, 2024Updated last year
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 4 months ago
- This repository contains code for the MicroAdam paper.☆21Dec 14, 2024Updated last year
- ☆20Mar 1, 2023Updated 3 years ago
- Fast, High-Fidelity LLM Decoding with Regex Constraints☆21Jul 26, 2024Updated last year
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- [CoLM 24] Official Repository of MambaByte: Token-free Selective State Space Model☆24Oct 12, 2024Updated last year
- Quick ADC☆27May 31, 2019Updated 6 years ago
- Experimental GPU language with meta-programming☆26Sep 6, 2024Updated last year
- My Implementation of Q-Sparse: All Large Language Models can be Fully Sparsely-Activated☆33Aug 14, 2024Updated last year
- ☆32Nov 11, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- A universal messaging library for cross-platform applications (Chrome extension, Web, Mobile, Iframe,...)☆15Oct 10, 2025Updated 4 months ago
- ☆32Feb 3, 2026Updated last month
- The implementation of the paper: "Visualization-of-Thought Elicits Spatial Reasoning in Large Language Models"☆34Apr 11, 2024Updated last year
- [WIP] Better (FP8) attention for Hopper☆32Feb 24, 2025Updated last year
- React window title bar for your electron applications☆12Mar 7, 2017Updated 9 years ago
- ☆16Jan 13, 2022Updated 4 years ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Jan 15, 2024Updated 2 years ago
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- Extract streaming data from text using prefix completion.☆10Oct 6, 2024Updated last year
- A set of scripts for using Oracle AI Database 26ai Free Container Image in Oracle Container Registry☆13Oct 15, 2025Updated 4 months ago
- an autonomous independent digital companion☆14Feb 12, 2026Updated 3 weeks ago
- ☆16Jun 1, 2025Updated 9 months ago
- A web interface for creating learning objectives based on Bloom's Taxonomy☆10Oct 27, 2020Updated 5 years ago
- Extensive time series analysis of chinese PM2.5 content, using models from ARMA and VAR to LSTMs and dynamic time warping clustering☆11Aug 17, 2019Updated 6 years ago
- A sample monorepo using turborepo, next.js, shadcn-ui, and tailwind to share components between multiple applications☆11Nov 30, 2023Updated 2 years ago
- Local LLM Testing & Benchmarking for Apple Silicon☆56Feb 26, 2026Updated last week
- Chrome Extension to capture captions of ongoing meetings by using webkitspeechrecognition api for all the web video conferencing platform…☆11Apr 22, 2023Updated 2 years ago
- Discord Docsbot, Built on bgent☆11Jun 17, 2024Updated last year
- ☆113Jul 23, 2025Updated 7 months ago
- ☆11Dec 22, 2024Updated last year
- ☆11Dec 9, 2025Updated 3 months ago
- Exploring how optimizations for GEMMs work☆28Feb 28, 2026Updated last week
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- A Robust Authentication and Authorization Server for FHIR Servers 🔥☆20May 12, 2025Updated 9 months ago
- A tokenizer for French☆14Apr 18, 2013Updated 12 years ago