QuixiAI/grokadamw

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QuixiAI/grokadamw)

QuixiAI / grokadamw

☆137

Alternatives and similar repositories for grokadamw

Users that are interested in grokadamw are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nisten / grokadamw
View on GitHub
new optimizer
☆20Aug 4, 2024Updated last year
ironjr / grokfast
View on GitHub
Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"
☆583Jun 28, 2024Updated 2 years ago
QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
pharaouk / dharma
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
arcee-ai / DAM
View on GitHub
☆56Nov 6, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
kjslag / spacebyte
View on GitHub
A byte-level decoder architecture that matches the performance of tokenized Transformers.
☆67Apr 24, 2024Updated 2 years ago
QuixiAI / SystemChat
View on GitHub
☆31Jul 5, 2024Updated 2 years ago
bloc97 / DeMo
View on GitHub
DeMo: Decoupled Momentum Optimization
☆202Dec 2, 2024Updated last year
QuixiAI / kraken
View on GitHub
☆69May 26, 2024Updated 2 years ago
QuixiAI / spectrum
View on GitHub
☆145Aug 20, 2025Updated 11 months ago
arcee-ai / EvolKit
View on GitHub
EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…
☆257Oct 30, 2024Updated last year
catid / lllm
View on GitHub
Latent Large Language Models
☆19Aug 24, 2024Updated last year
bdambrosio / AllTheWorldAPlay
View on GitHub
All the world is a play, we are but actors in it.
☆51Jul 21, 2025Updated last year
SinatrasC / entropix-smollm
View on GitHub
smolLM with Entropix sampler on pytorch
☆148Oct 31, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wuhy68 / Parameter-Efficient-MoE
View on GitHub
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks (EMNLP'24)
☆145Sep 20, 2024Updated last year
zyushun / Adam-mini
View on GitHub
Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793
☆457May 13, 2025Updated last year
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
fblgit / model-similarity
View on GitHub
Simple Model Similarities Analysis
☆21Feb 3, 2024Updated 2 years ago
fishiatee / Tumera
View on GitHub
Yet another frontend for LLM, written using .NET and WinUI 3
☆11Sep 14, 2025Updated 10 months ago
Sanster / padding_free_llm_train
View on GitHub
☆16Feb 6, 2024Updated 2 years ago
QuixiAI / extract-expert
View on GitHub
Extract a single expert from a Mixture Of Experts model using slerp interpolation.
☆19May 26, 2024Updated 2 years ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
microsoft / GRIN-MoE
View on GitHub
GRadient-INformed MoE
☆264Sep 25, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Clybius / Personalized-Optimizers
View on GitHub
A collection of niche / personally useful PyTorch optimizers with modified code.
☆28Apr 14, 2026Updated 3 months ago
Doriandarko / mlx-local-server
View on GitHub
A tiny server to run local inference on MLX model in the style of OpenAI
☆13Jan 31, 2024Updated 2 years ago
allenai / hyper-task-descriptions
View on GitHub
Learning adapter weights from task descriptions
☆20Nov 12, 2023Updated 2 years ago
apple / ml-cross-entropy
View on GitHub
☆612Sep 23, 2025Updated 10 months ago
bminixhofer / zett
View on GitHub
Code for Zero-Shot Tokenizer Transfer
☆145Jan 14, 2025Updated last year
VITA-Group / Q-GaLore
View on GitHub
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆206Jul 17, 2024Updated 2 years ago
migtissera / Sensei
View on GitHub
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆221Apr 29, 2024Updated 2 years ago
arcee-ai / DistillKit
View on GitHub
An Open Source Toolkit For LLM Distillation
☆992May 12, 2026Updated 2 months ago
swairshah / Intensify
View on GitHub
coloring terminal text with intensities (used for plotting probability, entropy with tokens)
☆12Oct 11, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
astramind-ai / BitMat
View on GitHub
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆155Oct 15, 2024Updated last year
pau-mensa / xtr-warp-rs
View on GitHub
High performance implementation of the WARP (SIGIR'25) retrieval engine.
☆36May 21, 2026Updated 2 months ago
d0rc / deepdive
View on GitHub
Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…
☆45Aug 27, 2024Updated last year
catid / spectral_ssm
View on GitHub
Implementation of Spectral State Space Models
☆16Feb 23, 2024Updated 2 years ago
planned-diffusion / planned-diffusion
View on GitHub
☆20Nov 14, 2025Updated 8 months ago
shangshang-wang / Tora
View on GitHub
Tora: Torchtune-LoRA for RL
☆87Dec 2, 2025Updated 7 months ago
cuhk-mobitec / HiQ-Robust-and-Fast-Decoding-of-High-Capacity-Color-QR-Codes
View on GitHub
☆11May 10, 2019Updated 7 years ago