☆20Jul 19, 2024Updated last year
Alternatives and similar repositories for optiacts
Users that are interested in optiacts are comparing it to the libraries listed below
Sorting:
- A neural network training framework within a task-based parallel programming paradigm☆55Mar 12, 2026Updated last week
- Models and code for the ICLR 2020 workshop paper "Towards Understanding Normalization in Neural ODEs"☆16Apr 27, 2020Updated 5 years ago
- Fully customizable Classifer Free Guidance for ComfyUI☆15Jul 14, 2024Updated last year
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Accompanying code for the paper Performance of Hyperbolic Geometry Models on Top-N Recommendation Tasks, accepted at ACM RecSys 2020.☆36Aug 20, 2020Updated 5 years ago
- Сайт проекта☆19Aug 25, 2024Updated last year
- XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning - - — ICLR 2025☆82Feb 13, 2025Updated last year
- stochastic bfloat16 based optimizer library☆21Dec 4, 2024Updated last year
- Compression schema for gradients of activations in backward pass☆45Jul 26, 2023Updated 2 years ago
- Catalyst.RL: A Distributed Framework for Reproducible RL Research☆39Mar 17, 2019Updated 7 years ago
- Code for MSID, a Multi-Scale Intrinsic Distance for comparing generative models, studying neural networks, and more!☆52May 29, 2019Updated 6 years ago
- ☆31Sep 23, 2024Updated last year
- Presentations of the advanced topics in optimization☆11Oct 30, 2019Updated 6 years ago
- ☆10Aug 5, 2020Updated 5 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- Bunch of notebooks for pre-training custom Saiga-like LLM☆12Feb 9, 2024Updated 2 years ago
- Deep Learning Course, Skoltech, 2024☆16Jun 12, 2024Updated last year
- Проект языковой модели для проведения морфемного анализа, сегментации и токенизации слов русского языка.☆17Jan 10, 2025Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Nov 1, 2024Updated last year
- ☆21Jun 4, 2024Updated last year
- 📄Small Batch Size Training for Language Models☆80Oct 4, 2025Updated 5 months ago
- ☆12Nov 28, 2015Updated 10 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆45May 23, 2025Updated 9 months ago
- ComfyUI node to apply the ResAdapter Unet patch for SD1.5 models☆31Feb 27, 2025Updated last year
- Faster and Lighter LoRA Implementations☆13Nov 21, 2024Updated last year
- ☆71Aug 27, 2024Updated last year
- ☆15Mar 24, 2019Updated 6 years ago
- Automating the Design of Multigrid Methods with Evolutionary Program Synthesis☆13Feb 11, 2025Updated last year
- ☆14Oct 3, 2018Updated 7 years ago
- Neural Potential Field for Obstacle-Aware Local Motion Planning☆22Jun 2, 2024Updated last year
- AI-generated text boundary detection with RoFT☆25Sep 9, 2024Updated last year
- Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...☆14May 28, 2025Updated 9 months ago
- FastAPI wrapper for LLM, a fork of (oobabooga / text-generation-webui)☆10Jun 1, 2023Updated 2 years ago
- MUSCO: MUlti-Stage COmpression of neural networks☆72Feb 16, 2021Updated 5 years ago
- Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)☆32Jun 14, 2025Updated 9 months ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 10 months ago
- Deep learning lectures I am holding for the MSc on Data Science and Scientific Computing☆15Jul 2, 2022Updated 3 years ago
- Spectral Neural Operator☆80Dec 20, 2023Updated 2 years ago
- Public repository for managing Grid Platform documentation synced with gitbook on docs.grid.ai☆20Aug 4, 2022Updated 3 years ago