HarleyCoops / smolThinker-.5BView external linksLinks
A Qwen .5B reasoning model trained on OpenR1-Math-220k
☆14Oct 11, 2025Updated 4 months ago
Alternatives and similar repositories for smolThinker-.5B
Users that are interested in smolThinker-.5B are comparing it to the libraries listed below
Sorting:
- ☆37Aug 4, 2025Updated 6 months ago
- ☆38Aug 1, 2025Updated 6 months ago
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Dec 1, 2023Updated 2 years ago
- Source code and dataset for the paper 'Saamayik: A Benchmark and Dataset for English-Sanskrit Translation'☆15Oct 11, 2025Updated 4 months ago
- Numbeo Unofficial API☆15Oct 16, 2022Updated 3 years ago
- Primus-SaFE(Stability and Fault Endurance)☆50Updated this week
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆97Jan 25, 2026Updated 3 weeks ago
- Synthetic Data Generation with Execution-Based Verification and Grounding for LLM Training.☆19Feb 7, 2025Updated last year
- A user-friendly interface built on top of Thinking Machines Tinker API that lets you fine-tune LLMs, chat with your trained model, and de…☆26Jan 31, 2026Updated 2 weeks ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- Official pytorch implementation of "Tool-R1: Sample-Efficient Reinforcement Learning for Agentic Tool Use"☆20Sep 16, 2025Updated 5 months ago
- Reusable components for AI coding agents: skills, subagents, MCP servers, and extensions.☆26Feb 6, 2026Updated last week
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- ☆11Jul 21, 2024Updated last year
- 🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation wi…☆12Apr 5, 2024Updated last year
- ☆14Dec 12, 2024Updated last year
- 收集量子机器学习的基础、算法、学习、项目等资料的收集。Here you can get all the Quantum Machine learning Basics, Algorithms ,Study Materials ,Projects and the descri…☆11Jan 4, 2018Updated 8 years ago
- Universal LLM security auditor with automated jailbreak testing, DSPy optimization, and OWASP 2025-aligned attack patterns☆21Oct 23, 2025Updated 3 months ago
- ☆10Updated this week
- Synthetic graph generator☆12Nov 7, 2023Updated 2 years ago
- ML from scratch in Jax☆12Aug 20, 2025Updated 5 months ago
- finetune script for SDXL adapted from waifu-diffusion trainer☆11Aug 21, 2023Updated 2 years ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated this week
- Config files for my GitHub profile.☆12Jul 18, 2024Updated last year
- Ultra-fast token & cost tracker for LLM Token Usage (e.g. Claude Code)☆34Updated this week
- A chaos engineering library for Elixir inspired by Netflix's Chaos Monkey☆20Feb 7, 2026Updated last week
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- ⚙️ Lightweight & smart Bun & Browser configuration loader.☆15Updated this week
- ☆11Jan 19, 2024Updated 2 years ago
- Developing a legal research tool leveraging ChatGPT / GPT-4☆14Mar 10, 2024Updated last year
- Official implementation: Large Language Models are Interpretable Learners - Google☆13Jun 29, 2024Updated last year
- Official Implementation of Knowledge Flow Prompting☆35Oct 20, 2025Updated 3 months ago
- Code for our ICLR Trustworthy ML 2020 workshop paper "Improved Image Wasserstein Attacks and Defenses"☆14Apr 28, 2020Updated 5 years ago
- ☆12Updated this week
- Auto start/attach tmux session with consistent session names☆10Oct 29, 2021Updated 4 years ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Sep 19, 2023Updated 2 years ago
- Using fourier interpolation to merge large language models☆11Jan 6, 2026Updated last month
- we generate captions to the images which are given by user(user input) using prompt engineering and Generative AI☆10Aug 24, 2024Updated last year