This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest research, methodologies, and techniques for fine-tuning language models.
☆132Jul 28, 2025Updated 7 months ago
Alternatives and similar repositories for reasoning_models_how_to
Users that are interested in reasoning_models_how_to are comparing it to the libraries listed below
Sorting:
- Enterprise-grade AI Detection and Response platform with real-time monitoring and configuration management for AI Agents and Large Langua…☆33Dec 14, 2025Updated 2 months ago
- Computational Neuroscience stuff☆13Aug 12, 2019Updated 6 years ago
- An MCP server providing intelligent transcript processing capabilities, featuring natural formatting, contextual repair, and smart summar…☆19Mar 14, 2025Updated 11 months ago
- story based implementation for sequential thinking☆15Dec 15, 2025Updated 2 months ago
- A Python implementation of the Sequential Thinking MCP server using the official Model Context Protocol (MCP) Python SDK. This server fac…☆24Jun 1, 2025Updated 9 months ago
- ☆18Apr 18, 2025Updated 10 months ago
- A TypeScript Model Context Protocol (MCP) server to allow LLMs to programmatically construct mind maps to explore an idea space, with enf…☆26Mar 23, 2025Updated 11 months ago
- Deploy and scale Large Language Models (LLMs) in production.☆39Jul 20, 2024Updated last year
- ☆11Updated this week
- Simple ideas to compare Agentic Coding Tools☆38Jun 29, 2025Updated 8 months ago
- MCP DeepResearch Server: 基于 LangGraph + Ollama + Tavily 的深度研究服务器,支持异步运行、超时控制与进度推送☆31Jun 16, 2025Updated 8 months ago
- Automatically split long webpage screenshots into chunks for input into models with shorter contexts. 自动将长网页截图进行区块分割,用于输入上下文较短的模型☆27Oct 25, 2025Updated 4 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Dec 12, 2025Updated 2 months ago
- Exercises Galois theory D. Cox☆12Jun 29, 2023Updated 2 years ago
- Your self-hosted AI assistant. Interactive Linux Shell, Files and Folders analysis. Powered by Ollama.☆37Updated this week
- Embed your LLM into a python function☆22Jan 9, 2025Updated last year
- A comprehensive React Native starter template built with Expo. It includes reusable UI components, Poppins font setup, NativeWind, Fireba…☆23Updated this week
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆34Apr 5, 2025Updated 11 months ago
- examples and guides to using Nomic Atlas☆37Apr 18, 2025Updated 10 months ago
- AuraMatrix is personality analysis web which using llm to do evaluation. I have made this for Gyanotsav-2025 to show different ways to ut…☆11Dec 22, 2025Updated 2 months ago
- Structured TRIZ prompt engineering for LLMs in an open, portable XML format – MIT licensed.☆16Nov 11, 2025Updated 3 months ago
- Build internal agents with just backend code.☆39Aug 25, 2025Updated 6 months ago
- ☆26Feb 28, 2026Updated last week
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- ☆32Feb 2, 2025Updated last year
- any4any是一个企业级多模态AI平台,提供完整的智能交互解决方案。集成了大语言模型对话、数字人系统、智能SQL查询、语音处理、知识库系统等核心功能,支持OpenAI兼容API接口,可无缝集成到各类AI应用中。☆61Nov 10, 2025Updated 3 months ago
- ☆42Jan 6, 2025Updated last year
- Python library providing a Polars DataFrame interface for easy and intuitive access to the Bloomberg API☆18Jan 9, 2026Updated 2 months ago
- ☆28Dec 4, 2025Updated 3 months ago
- Curated list of awesome tools, demos, docs for ChatGPT and GPT-3☆12Mar 28, 2023Updated 2 years ago
- VibEx (vx) is a developer-friendly CLI tool that streamlines the process of working with AI coding assistants. It helps developers prepar…☆29May 17, 2025Updated 9 months ago
- Glitch Gremlin AI☆15Apr 5, 2025Updated 11 months ago
- an auto coder which automatically fixes errors and improves the code from simple user prompt☆37Dec 27, 2024Updated last year
- CoachLint is your AI coding coach. It guides you through errors instead of just solving them for you.☆23Nov 20, 2025Updated 3 months ago
- MAGELLAN: Metacognitive predictions of learning progress guide autotelic LLM agents in large goal spaces☆10Mar 24, 2025Updated 11 months ago
- ☆11Aug 29, 2025Updated 6 months ago
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated 2 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆28Feb 13, 2026Updated 3 weeks ago