Code for the SofT-GRPO algorithm on the LLM soft-thinking reasoning pattern.
☆48Jan 2, 2026Updated 3 months ago
Alternatives and similar repositories for SofT-GRPO-master
Users that are interested in SofT-GRPO-master are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- My collection of dotfiles☆14Apr 22, 2026Updated last week
- Enemies for your LLM☆35Jan 20, 2026Updated 3 months ago
- ☆19Jan 29, 2026Updated 3 months ago
- Modern utility library and typescript typings for building JSON Schema documents☆14Nov 28, 2025Updated 5 months ago
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Hello Deep Learning☆16Apr 20, 2024Updated 2 years ago
- Efficient Long-context Language Model Training by Core Attention Disaggregation☆98Apr 7, 2026Updated 3 weeks ago
- A DSPy Adapter for exact-fidelity prompt templates with full control over messages.☆45Feb 23, 2026Updated 2 months ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆36Oct 3, 2025Updated 6 months ago
- OpenROAD Agent. This repository contain the model to train and testing the model using EDA Corpus dataset.☆28Jul 24, 2025Updated 9 months ago
- Recursive Neural Tensor Networks☆11Feb 3, 2014Updated 12 years ago
- A collection of MATLAB coding rules and guidelines optimized for use with AI coding assistants like Cursor, Windsurf, Claude Code, and Gi…☆32Jan 19, 2026Updated 3 months ago
- ☆21Jan 2, 2026Updated 3 months ago
- Lupa for Torch☆10Sep 16, 2015Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- Example repo showcasing model training and deployment with distil claude cli skill☆56Jan 19, 2026Updated 3 months ago
- ☆17Sep 4, 2023Updated 2 years ago
- ⚔️ [ICLR 2026] Official code of "Search Arena: Analyzing Search-Augmented LLMs".☆56Feb 23, 2026Updated 2 months ago
- Simple rules based grapheme to phoneme in Python☆11Sep 2, 2017Updated 8 years ago
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆29Jul 27, 2025Updated 9 months ago
- Ops files for https//github.com/meta-llama/llama-stack☆17Jun 28, 2025Updated 10 months ago
- An ontology of imaging and related techniques and technologies, image processing and analysis, image data and formats, within bio- and ot…☆12Oct 26, 2025Updated 6 months ago
- Links to recourses for the Lean Theorem Prover☆12Dec 3, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆14Apr 8, 2026Updated 3 weeks ago
- Sister project to OpenLLMetry, but in Ruby. Open-source observability for your LLM application, based on OpenTelemetry☆14Apr 6, 2026Updated 3 weeks ago
- Bindings to FFTW3☆10Feb 29, 2016Updated 10 years ago
- A page describing how to ship torch binaries without sharing the source code of your scripts.☆17Nov 2, 2015Updated 10 years ago
- RCS Business Messaging upgrades SMS with branding, rich media, interactivity, and analytics. With RCS, businesses can bring branded, inte…☆13Apr 23, 2026Updated last week
- A Sega Saturn SCU DSP assembler for Linux, Windows, and macOS☆11Aug 10, 2025Updated 8 months ago
- Fast IRLS code for solving p-norm regression problems☆14Feb 2, 2020Updated 6 years ago
- ☆28Apr 10, 2026Updated 2 weeks ago
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Audio tools for iOS and OS X - (ノಥ益ಥ)ノ ┻━┻☆10Oct 13, 2016Updated 9 years ago
- A simple sample app illustrating how to use opensl_stream.☆24Jul 16, 2013Updated 12 years ago
- Repository containing necessary files to run a server able to run Webots simulation☆12Apr 15, 2026Updated 2 weeks ago
- The MMFT ISO Designer is a tool that validates and generates microfluidic chip designs conforming to the ISO 22916 standard.☆15Mar 26, 2026Updated last month
- Code for testing DCT plus Sparse (DCTpS) networks☆14Jun 15, 2021Updated 4 years ago
- Convolutional REpresenations for Music Analysis☆12Jul 5, 2016Updated 9 years ago
- OpenTelemetry Browser SDK and instrumentation☆33Apr 23, 2026Updated last week