rkinas / reasoning_models_how_toLinks
This repository serves as a collection of research notes and resources on training large language models (LLMs) and Reinforcement Learning from Human Feedback (RLHF). It focuses on the latest research, methodologies, and techniques for fine-tuning language models.
☆99Updated 3 weeks ago
Alternatives and similar repositories for reasoning_models_how_to
Users that are interested in reasoning_models_how_to are comparing it to the libraries listed below
Sorting:
- One click templates for inferencing Language Models☆195Updated last month
- Build datasets using natural language☆500Updated 2 months ago
- chrome & firefox extension to chat with webpages: local llms☆119Updated 6 months ago
- GRadient-INformed MoE☆263Updated 9 months ago
- ☆259Updated 3 weeks ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆359Updated last week
- Inference, Fine Tuning and many more recipes with Gemma family of models☆242Updated 2 weeks ago
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆188Updated last week
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆232Updated 8 months ago
- Simple examples using Argilla tools to build AI☆53Updated 8 months ago
- ☆74Updated 9 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- ☆134Updated 11 months ago
- ☆156Updated 3 months ago
- A blueprint for AI development, focusing on applied examples of RAG, information extraction, analysis and fine-tuning in the age of LLMs …☆54Updated 5 months ago
- ☆86Updated 9 months ago
- ☆101Updated 10 months ago
- coding CUDA everyday!☆36Updated 2 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆315Updated last month
- My personal site☆77Updated 11 months ago
- ☆148Updated 3 weeks ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆15Updated 3 months ago
- ☆115Updated 7 months ago
- ☆58Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆311Updated this week
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆150Updated 2 months ago
- A compact LLM pretrained in 9 days by using high quality data☆318Updated 3 months ago
- a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model☆147Updated last year
- A lightweight, local-first, and free experiment tracking Python library built on top of 🤗 Datasets and Spaces.☆227Updated last week
- An automated tool for discovering insights from research papaer corpora☆138Updated last year