rkinas / reasoning_models_how_toLinks
This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest research, methodologies, and techniques for fine-tuning language models.
☆127Updated 6 months ago
Alternatives and similar repositories for reasoning_models_how_to
Users that are interested in reasoning_models_how_to are comparing it to the libraries listed below
Sorting:
- One click templates for inferencing Language Models☆227Updated 2 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆167Updated last year
- chrome & firefox extension to chat with webpages: local llms☆131Updated last year
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆211Updated 3 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆279Updated 6 months ago
- ☆269Updated 7 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆425Updated last month
- Utils for Unsloth https://github.com/unslothai/unsloth☆187Updated last week
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆340Updated 9 months ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆16Updated 10 months ago
- ☆242Updated 4 months ago
- ☆158Updated 9 months ago
- ☆109Updated 7 months ago
- ☆182Updated 2 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆294Updated 10 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆496Updated 5 months ago
- ☆121Updated 3 weeks ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆85Updated 5 months ago
- Various installation guides for Large Language Models☆77Updated 9 months ago
- Simple examples using Argilla tools to build AI☆57Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- GRadient-INformed MoE☆264Updated last year
- ☆75Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 6 months ago
- Exploring Applications of GRPO☆251Updated 5 months ago
- minimal GRPO implementation from scratch☆102Updated 10 months ago
- Train LLM on Hugging Face infra☆67Updated 2 months ago
- Build datasets using natural language☆559Updated 4 months ago
- ☆140Updated 5 months ago