rkinas / reasoning_models_how_toLinks
This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest research, methodologies, and techniques for fine-tuning language models.
☆114Updated 2 months ago
Alternatives and similar repositories for reasoning_models_how_to
Users that are interested in reasoning_models_how_to are comparing it to the libraries listed below
Sorting:
- One click templates for inferencing Language Models☆213Updated 2 months ago
- ☆264Updated 3 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆404Updated 2 weeks ago
- ☆136Updated last year
- ☆86Updated last year
- GRadient-INformed MoE☆264Updated last year
- Inference, Fine Tuning and many more recipes with Gemma family of models☆271Updated 3 months ago
- ☆22Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆459Updated last month
- ☆157Updated 6 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- chrome & firefox extension to chat with webpages: local llms☆127Updated 10 months ago
- ☆75Updated last year
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆201Updated this week
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆60Updated 8 months ago
- ☆160Updated 3 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 9 months ago
- ☆46Updated 6 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated last month
- ☆177Updated 2 months ago
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- Utils for Unsloth https://github.com/unslothai/unsloth☆155Updated 2 weeks ago
- Collection of resources for RL and Reasoning☆26Updated 8 months ago
- Build datasets using natural language☆532Updated last month
- ☆136Updated last month
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆46Updated 5 months ago
- Exploring Applications of GRPO☆248Updated last month
- ☆45Updated 4 months ago