rkinas / reasoning_models_how_toLinks
This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest research, methodologies, and techniques for fine-tuning language models.
☆119Updated 4 months ago
Alternatives and similar repositories for reasoning_models_how_to
Users that are interested in reasoning_models_how_to are comparing it to the libraries listed below
Sorting:
- ☆266Updated 5 months ago
- ☆692Updated 7 months ago
- One click templates for inferencing Language Models☆220Updated last week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 4 months ago
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆209Updated last month
- GRadient-INformed MoE☆264Updated last year
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- Utils for Unsloth https://github.com/unslothai/unsloth☆173Updated this week
- Simple & Scalable Pretraining for Neural Architecture Research☆302Updated last month
- 🤗 Benchmark Large Language Models Reliably On Your Data☆412Updated this week
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆479Updated 3 months ago
- Various installation guides for Large Language Models☆77Updated 7 months ago
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ☆158Updated 7 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆343Updated 5 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆84Updated 3 months ago
- Let's build better datasets, together!☆265Updated 11 months ago
- ☆46Updated 8 months ago
- ☆182Updated 3 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year
- Verifiers for LLM Reinforcement Learning☆78Updated 2 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆451Updated last year
- Exploring Applications of GRPO☆249Updated 3 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆63Updated 10 months ago
- Build your own visual reasoning model☆414Updated last week
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆164Updated last year
- ☆86Updated last year
- ☆207Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆334Updated 7 months ago
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last week