Shekswess/tiny-reasoning-language-model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Shekswess/tiny-reasoning-language-model)

Shekswess / tiny-reasoning-language-model

Code repository dedicated to experimenting and research with tiny reasoning language model

☆52

Alternatives and similar repositories for tiny-reasoning-language-model

Users that are interested in tiny-reasoning-language-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Shekswess / ai-project-template
View on GitHub
This is the code repository for the AI project template. The idea of this template is to have a code framework prepared for any AI/ML/MLO…
☆44Jan 26, 2026Updated 6 months ago
Zinxira / tlvmc-parkinsons-fog-prediction-4th-place-solution
View on GitHub
☆11Aug 3, 2023Updated 2 years ago
shangshang-wang / Tora
View on GitHub
Tora: Torchtune-LoRA for RL
☆87Dec 2, 2025Updated 7 months ago
tengxiao1 / MR-Search
View on GitHub
Meta-Reinforcement Learning with Self-Reflection
☆33Mar 26, 2026Updated 4 months ago
dynamiq-ai / search-gpt
View on GitHub
Open-source Search GPT engine
☆21Nov 4, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
stockeh / mlx-trm
View on GitHub
MLX Implementation of Recursive Reasoning with Tiny Networks
☆79Oct 11, 2025Updated 9 months ago
syjmelody / RankE
View on GitHub
Implementation of RankE: End-to-End Discrete Text-to-Image Post-Training via Rank-Consistent Alignment
☆22May 27, 2026Updated 2 months ago
Lossfunk / KernelBench-v2
View on GitHub
KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems
☆24Jul 4, 2025Updated last year
Shekswess / small-language-model-rags-is-all-you-need
View on GitHub
A code repository for the project called small-language-model-rags-is-all-you-need
☆17Sep 28, 2024Updated last year
brendanhogan / 2025_advent_of_small_ml
View on GitHub
☆22Dec 24, 2025Updated 7 months ago
vincentamato / mlx-esm-2
View on GitHub
An MLX implementation of Meta AI's ESM-2 protein language model
☆16Aug 16, 2025Updated 11 months ago
thu-coai / CROPI
View on GitHub
[ACL'26] Official Repository for for paper "Data-Efficient RLVR via Off-Policy Influence Guidance"
☆25Updated this week
bcml-labs / rosa-plus
View on GitHub
ROSA+: RWKV's ROSA implementation with fallback statistical predictor
☆36Oct 13, 2025Updated 9 months ago
lodeguns / Solids-classification-3D-CNN-3D-GradCam
View on GitHub
Here we introduce the problem of 3D solids classification with a CNN (spheres and octahedra). We implemented a 3D GradCam model, in orde…
☆11Nov 25, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
kurakurai / Luth
View on GitHub
Luth is a state-of-the-art series of fine-tuned LLMs for French
☆46Oct 12, 2025Updated 9 months ago
sfeucht / footprints
View on GitHub
https://footprints.baulab.info
☆17Oct 4, 2024Updated last year
jannatul-fredaues / CropLens-AI
View on GitHub
Deep learning-based flower classification system using CNN architectures for image recognition and classification. Trained and evaluated …
☆19Jul 18, 2026Updated last week
kethan-1818 / 5G-channel-modulation-using-RL
View on GitHub
I have developed a custom environment using OpenAI Gym in Python for simulating a 5G wireless communication channel as part of a reinforc…
☆14Mar 27, 2024Updated 2 years ago
BaohaoLiao / ApiQ
View on GitHub
[EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs
☆15Jul 18, 2024Updated 2 years ago
jinhangzhan / RL_Heals_SFT
View on GitHub
☆21Mar 22, 2026Updated 4 months ago
mcleish7 / retrofitting-recurrence
View on GitHub
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
☆68Nov 11, 2025Updated 8 months ago
Westlake-AI / AutoMix
View on GitHub
[ECCV 2022 Oral] AutoMix: Unveiling the Power of Mixup for Stronger Classifiers
☆18Apr 25, 2023Updated 3 years ago
ifding / seq2seq-pytorch
View on GitHub
Sequence to Sequence Models with PyTorch
☆27May 12, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jsikyoon / V-MPO_torch
View on GitHub
V-MPO torch version with DMLab30 and GTrXL
☆13Mar 1, 2021Updated 5 years ago
lm-playpen / playpen
View on GitHub
All you need to get started with the LM Playpen Environment for Learning in Interaction.
☆16Jun 22, 2026Updated last month
KhawajaAbaid / micrograd_c
View on GitHub
Andrej Kapathy's micrograd implemented in c
☆30Aug 7, 2024Updated last year
Sivalavida / Text-based-Industry-Classification
View on GitHub
Using NLP techniques to classify companies according to their descriptions
☆13Nov 12, 2020Updated 5 years ago
JinjieNi / dlms-are-super-data-learners
View on GitHub
The official github repo for "Diffusion Language Models are Super Data Learners".
☆227Nov 6, 2025Updated 8 months ago
seanys / Use-Modified-Penetration-Depth-and-Guided-Search-to-Solve-Nesting-Problem
View on GitHub
☆15Aug 14, 2020Updated 5 years ago
joey00072 / microjax
View on GitHub
Jax like function transformation engine but micro, microjax
☆34Oct 25, 2024Updated last year
tmabraham / fastai_tpu
View on GitHub
TPU support for the fastai library
☆14Apr 15, 2021Updated 5 years ago
amazon-science / Self-Aligned-Reward-Towards_Effective_and_Efficient_Reasoners
View on GitHub
☆21Apr 21, 2026Updated 3 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AMD-AGI / Magpie
View on GitHub
A lightweight, general-purpose framework for evaluating GPU kernel and benchmark.
☆57Updated this week
Improbable-AI / orso
View on GitHub
☆18Feb 22, 2025Updated last year
sidmohan0 / tesserack
View on GitHub
Compiling strategy guides into reward functions for reinforcement learning. Uses Claude Vision to extract unit tests from game guides, …
☆37Jan 30, 2026Updated 5 months ago
mkurman / synthlabs
View on GitHub
Create synthetic datasets from scratch using AI-powered generation. Define topics, customize prompts, and generate high-quality reasoning…
☆32Updated this week
Indemos / Distribution
View on GitHub
General purpose virtual actor framework for peer-to-peer microservices or in-process communication within the same app with possible exte…
☆14Jun 11, 2025Updated last year
ericyuegu / hal
View on GitHub
Training AI for Super Smash Bros. Melee
☆37Jul 17, 2026Updated last week
huggingface / finephrase
View on GitHub
Synthetic pretraining data by rephrasing the web
☆25Jun 5, 2026Updated last month