ClashLuke/SOAP

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ClashLuke/SOAP)

ClashLuke / SOAP

☆22

Alternatives and similar repositories for SOAP

Users that are interested in SOAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

UCL-COMP0233-2022-2023 / RSE-Classwork
View on GitHub
☆11Oct 13, 2023Updated 2 years ago
RobertCsordas / moe_layer
View on GitHub
sigma-MoE layer
☆21Jan 5, 2024Updated 2 years ago
sustcsonglin / mamba-triton
View on GitHub
☆52Jan 28, 2024Updated 2 years ago
BlinkDL / SmallInitEmb
View on GitHub
LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence
☆61Feb 21, 2022Updated 4 years ago
PASSIONLab / distributed_sddmm
View on GitHub
Distributed SDDMM Kernel
☆12Jul 8, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
alif-munim / minOFT
View on GitHub
A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.
☆14Nov 17, 2023Updated 2 years ago
benfoxall / scrub
View on GitHub
Video scrubbing with WebCodecs
☆15Nov 4, 2025Updated 8 months ago
renll / SparseLT
View on GitHub
[EMNLP 2022] Language Model Pre-Training with Sparse Latent Typing
☆14Feb 10, 2023Updated 3 years ago
google-research-datasets / LLAMA1-Test-Set
View on GitHub
We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…
☆23Mar 14, 2024Updated 2 years ago
HomebrewML / HeavyBall
View on GitHub
Efficient optimizers
☆335Jul 11, 2026Updated last week
tripplyons / sd-ia3
View on GitHub
(IA)^3 for Stable Diffusion
☆34Apr 2, 2023Updated 3 years ago
rimads / avey-dpa
View on GitHub
Code for the paper Don't Pay Attention
☆59Sep 25, 2025Updated 9 months ago
rwightman / gemma4_pytorch_claude
View on GitHub
Standalone Gemma 4 PyTorch Model using Claude Code
☆15Apr 13, 2026Updated 3 months ago
OliverRichter / normalized-attention
View on GitHub
Code publication to the paper "Normalized Attention Without Probability Cage"
☆17Nov 9, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
iPieter / llmq
View on GitHub
A Scheduler for Batched LLM Inference
☆19Oct 5, 2025Updated 9 months ago
sleepingcat4 / Sophia
View on GitHub
replacement of AdamW and Lion optimizer for LLMs
☆13May 28, 2023Updated 3 years ago
evanwashere / opus
View on GitHub
fast opus bindings for node and browsers
☆15Feb 11, 2024Updated 2 years ago
sekstini / gpupoor
View on GitHub
☆18Dec 2, 2024Updated last year
neso613 / ASR_TFLite
View on GitHub
Collection of ASR models for English TFLite models for faster inference.
☆14Feb 21, 2022Updated 4 years ago
hyama5 / vae_align
View on GitHub
Alignment examples for Interspeech 2024
☆28Jul 5, 2024Updated 2 years ago
facebookresearch / optimizers
View on GitHub
For optimization algorithm research and development.
☆578Updated this week
socialfoundations / benchbench
View on GitHub
BenchBench is a Python package to evaluate multi-task benchmarks.
☆23Oct 12, 2025Updated 9 months ago
shreyashankar / spade-experiments
View on GitHub
Experiments to assess SPADE on different LLM pipelines.
☆17Apr 7, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Snowflake-Labs / vllm
View on GitHub
☆16Nov 24, 2025Updated 7 months ago
flowersteam / vivarium
View on GitHub
Multi-agent simulator in Jax for research and teaching in AI & ALife
☆31Apr 11, 2026Updated 3 months ago
solemnwarning / kexec-loader
View on GitHub
☆14Jan 10, 2026Updated 6 months ago
ethansmith2000 / fsdp_optimizers
View on GitHub
supporting pytorch FSDP for optimizers
☆84Dec 8, 2024Updated last year
epfml / schedules-and-scaling
View on GitHub
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆93Oct 30, 2024Updated last year
kyegomez / FastFF
View on GitHub
Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"
☆16Nov 11, 2024Updated last year
JD-P / RetroInstruct
View on GitHub
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆34Oct 8, 2025Updated 9 months ago
oxalica / ghoti-shell
View on GitHub
☆15Apr 8, 2025Updated last year
yeus / syntexmex
View on GitHub
Syntexmex plugin for blender
☆16Mar 28, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
eholk / rust-stl
View on GitHub
Stereo lithography file support for Rust.
☆12Jul 29, 2023Updated 2 years ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
HazyResearch / based
View on GitHub
Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"
☆256Jun 6, 2025Updated last year
nikhilvyas / SOAP
View on GitHub
☆273Dec 2, 2024Updated last year
Umikaze-job / All-In-LoRA
View on GitHub
☆14Jan 27, 2024Updated 2 years ago
proger / accelerated-scan
View on GitHub
Accelerated First Order Parallel Associative Scan
☆198Jan 7, 2026Updated 6 months ago
lucidrains / token-shift-gpt
View on GitHub
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆49Jan 27, 2022Updated 4 years ago