☆22Nov 9, 2024Updated last year
Alternatives and similar repositories for SOAP
Users that are interested in SOAP are comparing it to the libraries listed below
Sorting:
- Efficient optimizers☆285Dec 20, 2025Updated 2 months ago
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- Fluid Language Model Benchmarking☆26Sep 16, 2025Updated 5 months ago
- Experiments to assess SPADE on different LLM pipelines.☆17Apr 7, 2024Updated last year
- LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence☆61Feb 21, 2022Updated 4 years ago
- This repository includes various baseline techniques for label-free model evaluation task for the VDU2023 competition.☆19Mar 8, 2023Updated 2 years ago
- We introduce the LLAMA1 Test Set, a comprehensive open-domain world knowledge QA dataset for evaluating question-answering systems. We pr…☆23Mar 14, 2024Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 7 months ago
- ☆11Oct 13, 2023Updated 2 years ago
- This repo is built to facilitate the training and analysis of autoregressive transformers on maze-solving tasks.☆34Oct 28, 2025Updated 4 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Oct 8, 2025Updated 4 months ago
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- ☆252Dec 2, 2024Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆91Oct 30, 2024Updated last year
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆36Oct 31, 2024Updated last year
- Neural Networks for JAX☆84Sep 24, 2024Updated last year
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆87Sep 12, 2025Updated 5 months ago
- A mod that enables AI to play the game TowerFall Ascension.☆14Aug 22, 2023Updated 2 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- This repository contains the Parasol processor, which enables next-generation privacy preserving applications. Users can run arbitrary co…☆11Updated this week
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆56Feb 20, 2026Updated last week
- Code for the paper Don't Pay Attention☆52Sep 25, 2025Updated 5 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆94Nov 17, 2024Updated last year
- Rocket League pretraining from replay files☆36Apr 22, 2025Updated 10 months ago
- A virtual musical instrument built using Google MediaPipe.☆12Oct 10, 2022Updated 3 years ago
- GPUCorrel : A GPU accelerated Digital Image Correlation Software written in Python. To cite this Original Software Publication: https://w…☆11Sep 9, 2021Updated 4 years ago
- Training Neural Network with Particle Swarm Optimization☆14Jan 10, 2019Updated 7 years ago
- ☆16Jun 10, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 4 months ago
- Real-Time RTUs☆11Jan 2, 2025Updated last year
- Python platform for parallel Surrogate-Based Optimization☆12Nov 27, 2024Updated last year
- ☆15Sep 7, 2025Updated 5 months ago
- Machine Learning for Mathematical Formalization☆11Jul 20, 2024Updated last year
- A PyTorch Implementation of DF-GAN☆10Mar 26, 2022Updated 3 years ago
- Direct transcription of an optimal control problem and resolution☆12Updated this week
- ☆10Jun 21, 2024Updated last year
- For optimization algorithm research and development.☆558Updated this week
- imgsys backend☆49Jun 23, 2024Updated last year