rkinas / reasoning_models_how_toLinks

This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest research, methodologies, and techniques for fine-tuning language models.

☆99

Alternatives and similar repositories for reasoning_models_how_to

Users that are interested in reasoning_models_how_to are comparing it to the libraries listed below

Sorting:

TrelisResearch / one-click-llms
One click templates for inferencing Language Models
☆195Updated last month
argilla-io / synthetic-data-generator
Build datasets using natural language
☆500Updated 2 months ago
abhishekkrthakur / chat-ext
chrome & firefox extension to chat with webpages: local llms
☆119Updated 6 months ago
microsoft / GRIN-MoE
GRadient-INformed MoE
☆263Updated 9 months ago
ibm-granite / granite-3.0-language-models
☆259Updated 3 weeks ago
huggingface / yourbench
🤗 Benchmark Large Language Models Reliably On Your Data
☆359Updated last week
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆242Updated 2 weeks ago
gabrielchua / daily-ai-papers
All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…
☆188Updated last week
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆232Updated 8 months ago
argilla-io / argilla-cookbook
Simple examples using Argilla tools to build AI
☆53Updated 8 months ago
Vaibhavs10 / gpu-poor-llm-notebooks
☆74Updated 9 months ago
lamini-ai / Lamini-Memory-Tuning
Banishing LLM Hallucinations Requires Rethinking Generalization
☆276Updated last year
cognitivecomputations / grokadamw
☆134Updated 11 months ago
menloresearch / ReZero
☆156Updated 3 months ago
huggingface / ai-blueprint
A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …
☆54Updated 5 months ago
AK391 / dailypapersHN
☆86Updated 9 months ago
AlexBodner / How_Much_VRAM
☆101Updated 10 months ago
aryagxr / cuda
coding CUDA everyday!
☆36Updated 2 months ago
tonywu71 / colpali-cookbooks
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆315Updated last month
osanseviero / hackerllama
My personal site
☆77Updated 11 months ago
QuixiAI / agi-memory
☆148Updated 3 weeks ago
YuvrajSingh-mist / SmolLlama
So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…
☆15Updated 3 months ago
teknium1 / ShareGPT-Builder
☆115Updated 7 months ago
AI4Privacy / LLM_STATUS_CODES
☆58Updated last year
deep-diver / llamaduo
[ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
☆311Updated this week
huggingface / Google-Cloud-Containers
Hugging Face Deep Learning Containers (DLCs) for Google Cloud
☆150Updated 2 months ago
Pints-AI / 1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
☆318Updated 3 months ago
shivendrra / SmallLanguageModel
a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model
☆147Updated last year
gradio-app / trackio
A lightweight, local-first, and free experiment tracking Python library built on top of 🤗 Datasets and Spaces.
☆227Updated last week
ai8hyf / OpenResearchAssistant
An automated tool for discovering insights from research papaer corpora
☆138Updated last year