zz1358m / SofT-GRPO-masterView external linksLinks
Code for the SofT-GRPO algorithm on the LLM soft-thinking reasoning pattern.
☆38Jan 2, 2026Updated last month
Alternatives and similar repositories for SofT-GRPO-master
Users that are interested in SofT-GRPO-master are comparing it to the libraries listed below
Sorting:
- Recursive Neural Tensor Networks☆11Feb 3, 2014Updated 12 years ago
- Easy Setup, File-based, Offline Capable Federated Learning and Computations☆21Updated this week
- STM32746G-DISCOVERY platform - GCC Makefile project templates and experiments☆11Dec 23, 2015Updated 10 years ago
- A Sega Saturn SCU DSP assembler for Linux, Windows, and macOS☆11Aug 10, 2025Updated 6 months ago
- Lupa for Torch☆10Sep 16, 2015Updated 10 years ago
- Basics of Embedded Audio Programming Tutorials☆14Apr 10, 2021Updated 4 years ago
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- LV2 port for the TAP (Tom's Audio Processing) plugins☆17Feb 18, 2024Updated last year
- A binaural audio unit.☆10Dec 8, 2014Updated 11 years ago
- ☆26Updated this week
- Audio tools for iOS and OS X - (ノಥ益ಥ)ノ ┻━┻☆10Oct 13, 2016Updated 9 years ago
- My collection of dotfiles☆15Updated this week
- Bindings to FFTW3☆10Feb 29, 2016Updated 9 years ago
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated 11 months ago
- ☆10Aug 31, 2021Updated 4 years ago
- ☆39Jan 27, 2026Updated 2 weeks ago
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago
- A simple sample app illustrating how to use opensl_stream.☆24Jul 16, 2013Updated 12 years ago
- Material for git workshop☆11Mar 13, 2018Updated 7 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 4 months ago
- A tool to view the total transactions, received, sent, and current balance of Bitcoin wallets 👁☆17Aug 19, 2025Updated 5 months ago
- ☆18Mar 17, 2025Updated 10 months ago
- A page describing how to ship torch binaries without sharing the source code of your scripts.☆17Nov 2, 2015Updated 10 years ago
- Code for Sufficient Input Subsets Paper☆14Mar 8, 2019Updated 6 years ago
- Reachy2 Unity package to mirror a real or fake robot's state☆19Jul 18, 2025Updated 6 months ago
- Convolutional REpresenations for Music Analysis☆12Jul 5, 2016Updated 9 years ago
- Reverb built in JUCE☆12May 5, 2017Updated 8 years ago
- Impulse Measurement Tools for Julia☆12Apr 3, 2020Updated 5 years ago
- PyTorch implementation of the paper Learning Multi-Level Representations for Hierarchical Music Structure Analysis presented at ISMIR 202…☆14Jan 2, 2023Updated 3 years ago
- Reinforcement Learning Project☆12Jan 16, 2017Updated 9 years ago
- Reinforcing General Reasoning without Verifiers☆97Jun 24, 2025Updated 7 months ago
- Mac setup and configuration via Ansible.☆13Feb 24, 2025Updated 11 months ago
- ☆42Sep 15, 2025Updated 5 months ago
- Hello Deep Learning☆16Apr 20, 2024Updated last year
- nanoGPT using Equinox☆15Mar 3, 2023Updated 2 years ago
- ☆19May 9, 2019Updated 6 years ago
- CUDA implementation of Wavelet KAN.☆16Jun 8, 2024Updated last year
- Fast Fourier Transform Frontend☆13Dec 11, 2013Updated 12 years ago
- PAVOQUE Corpus of Expressive Speech☆12Aug 2, 2016Updated 9 years ago