☆22Nov 9, 2024Updated last year
Alternatives and similar repositories for SOAP
Users that are interested in SOAP are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- code for training and using chess embeddings models☆14Jun 9, 2024Updated 2 years ago
- Efficient optimizers☆334Jun 21, 2026Updated last week
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Feb 21, 2022Updated 4 years ago
- ☆11Oct 13, 2023Updated 2 years ago
- ☆52Jan 28, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.☆14Nov 17, 2023Updated 2 years ago
- ☆16Jul 8, 2024Updated last year
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated 2 years ago
- (IA)^3 for Stable Diffusion☆35Apr 2, 2023Updated 3 years ago
- Fluid Language Model Benchmarking☆30Sep 16, 2025Updated 9 months ago
- Official Code for "Relative Entropy Pathwise Policy Optimization"☆57May 6, 2026Updated last month
- This repository includes various baseline techniques for label-free model evaluation task for the VDU2023 competition.☆19Mar 8, 2023Updated 3 years ago
- Code publication to the paper "Normalized Attention Without Probability Cage"☆17Nov 9, 2021Updated 4 years ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆18Dec 2, 2024Updated last year
- About Code release for "FlashBias: Fast Computation of Attention with Bias" (NeurIPS 2025), https://arxiv.org/abs/2505.12044☆29Nov 17, 2025Updated 7 months ago
- fast opus bindings for node and browsers☆15Feb 11, 2024Updated 2 years ago
- Collection of ASR models for English TFLite models for faster inference.☆14Feb 21, 2022Updated 4 years ago
- Alignment examples for Interspeech 2024☆27Jul 5, 2024Updated last year
- A frontend for your PDS☆25Oct 20, 2025Updated 8 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆30Jul 24, 2025Updated 11 months ago
- For optimization algorithm research and development.☆577May 6, 2026Updated last month
- Experiments to assess SPADE on different LLM pipelines.☆17Apr 7, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Parser-combinator in Go☆10Jul 19, 2020Updated 5 years ago
- A package for accurately computing running (online) mean, variance, and standard deviation in golang☆11Oct 16, 2022Updated 3 years ago
- Notebooks of experiences with the fastai library☆11Oct 13, 2018Updated 7 years ago
- A flexible tool for the multi-resolution localization of causal variants across the genome☆11Feb 19, 2021Updated 5 years ago
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆92Oct 30, 2024Updated last year
- Import Nix projects regardless of how they are exposed.☆34Nov 9, 2025Updated 7 months ago
- ☆14Jan 10, 2026Updated 5 months ago
- ☆16Nov 24, 2025Updated 7 months ago
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- tabix file access with golang using biogo machinery☆10Jul 1, 2025Updated last year
- Training HuggingFace models using fastai☆11Jul 22, 2021Updated 4 years ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Nov 11, 2024Updated last year
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆34Oct 8, 2025Updated 8 months ago
- A spatial terminal multiplexer for macOS. Terminals live on an infinite canvas that you can pan, zoom, and arrange freely.☆42Apr 2, 2026Updated 2 months ago
- Statistical modeling in Go☆14May 19, 2021Updated 5 years ago
- ☆15Apr 8, 2025Updated last year