huggingface/trl-tuto

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huggingface/trl-tuto)

huggingface / trl-tuto

☆52

Alternatives and similar repositories for trl-tuto

Users that are interested in trl-tuto are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hanneshapke / TensorFlow-World-Adv-Introduction-TF-Serving
View on GitHub
This repository contains all code examples for my TensorFlow World talk about "Advanced model deployments with TensorFlow Serving"
☆17Dec 8, 2022Updated 3 years ago
OpenPipe / deductive-reasoning
View on GitHub
Train your own SOTA deductive reasoning model
☆111Mar 6, 2025Updated last year
amazon-science / graph-lm-ensemble
View on GitHub
☆15Jun 2, 2025Updated last year
agramfort / DS3_practical_optim_for_ml
View on GitHub
Notebooks from DS3 course on practical optimization
☆15Jan 5, 2021Updated 5 years ago
peerdavid / layerwise-batch-entropy
View on GitHub
Layerwise Batch Entropy Regularization
☆24Aug 3, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
gabrielolympie / PromptServer
View on GitHub
A FastAPI server that turns markdown prompt files into API endpoints with minimal configuration.
☆15Sep 5, 2025Updated 10 months ago
solegalli / Python-Feature-Engineering-Cookbook-Second-Edition
View on GitHub
☆12May 27, 2024Updated 2 years ago
numanai / Visual-Question-Answering-for-Medical-domain
View on GitHub
☆12Mar 18, 2024Updated 2 years ago
microsoft / post-training-toolkit
View on GitHub
☆25Jan 28, 2026Updated 6 months ago
tatsu432 / BDCM
View on GitHub
☆17Mar 24, 2026Updated 4 months ago
jina-ai / correlations
View on GitHub
Simple UI for debugging correlations of text embeddings
☆315May 28, 2025Updated last year
ashishpatel26 / MLflow_End_to_End_Example
View on GitHub
MLflow is Open source platform for the machine learning lifecycle so here you can learn MLflow End to End Example with Prediction.
☆13Jun 14, 2022Updated 4 years ago
Human-Centric-Machine-Learning / counterfactual-llms
View on GitHub
Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.
☆34Nov 7, 2024Updated last year
somewheresystems / llama2mlx
View on GitHub
Karpathy's llama2.c transpiled to MLX for Apple Silicon
☆14Dec 28, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
huggingface / hf_benchmarks
View on GitHub
A starter kit for evaluating benchmarks on the 🤗 Hub
☆18Apr 8, 2026Updated 3 months ago
ZootoPi / advanced_data_science_ibm
View on GitHub
Advanced Data Science with IBM Specialization
☆12Aug 9, 2021Updated 4 years ago
garrisonhess / llama2.c
View on GitHub
Inference Llama 2 in one file of pure C
☆14Jul 24, 2023Updated 3 years ago
parlance-labs / ftcourse
View on GitHub
☆170Jun 3, 2024Updated 2 years ago
smeznar / SNoRe
View on GitHub
SNoRe: Scalable Unsupervised Learning of Symbolic Node Representations
☆11Sep 26, 2023Updated 2 years ago
yangliuy / Intent-Aware-Ranking-Transformers
View on GitHub
Code on IART: Intent-aware Response Ranking with Transformers in Information-seeking Conversation Systems (WWW 2020)
☆11Apr 18, 2021Updated 5 years ago
whitead / emoji-math
View on GitHub
A complete waste of time
☆15Dec 11, 2022Updated 3 years ago
tboulet / Alan-Code-agent
View on GitHub
Open-source Python implementation of a Claude-Code-like agent (Gemini CLI, Codex, Copilot...). Usable in CLI, GUI, or as a python library…
☆24Updated this week
LuisEstebanAcevedoBringas / FCOS_torch
View on GitHub
☆10Jul 18, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
danielemalitesta / Multimodal-DL-4-RecSys
View on GitHub
Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School
☆12Oct 12, 2024Updated last year
shisa-ai / shaberi
View on GitHub
Lightblue LLM Eval Framework: tengu, elyza100, ja-mtbench, rakuda
☆19Apr 29, 2026Updated 3 months ago
alif-munim / minOFT
View on GitHub
A minimal re-implementation of orthogonal fine-tuning (OFT), a diffusion method, for LLMs. Based on nanoGPT and minLoRA.
☆14Nov 17, 2023Updated 2 years ago
tatsuokun / sentence_compression
View on GitHub
Simple model for sentence compression (a.k.a Baseline in Klerke et al., NAACL 2016)
☆10Dec 16, 2018Updated 7 years ago
hallerite / ludic
View on GitHub
Ludic – an LLM-RL library for the era of experience
☆67Jan 9, 2026Updated 6 months ago
huggingface / trl-jobs
View on GitHub
Train LLM on Hugging Face infra
☆72May 26, 2026Updated 2 months ago
TAMU-AML / DSWE-Package
View on GitHub
An R implementation of some of the data science methods for wind energy (DSWE) applications.
☆11Feb 6, 2024Updated 2 years ago
liamlio / MolGAN
View on GitHub
AI for a cure, a combination of Latent-GAN and VAE-JTNN to create 100% valid drug like molecules
☆10Mar 16, 2020Updated 6 years ago
AI-Maker-Space / Fine-tuning-LLM-Resources
View on GitHub
A collection of fine-tuning notebooks!
☆32Oct 5, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Pleias / Pleias-RAG-Library
View on GitHub
Python library to use Pleias-RAG models
☆72Jul 1, 2026Updated 3 weeks ago
interstellarninja / MeeseeksAI
View on GitHub
A framework for orchestrating AI agents using a mermaid graph
☆76May 16, 2024Updated 2 years ago
BY571 / DistRL-LLM
View on GitHub
Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization
☆22Mar 12, 2025Updated last year
TimotheeMickus / codwoe
View on GitHub
The CODWOE shared task invites you to compare two types of semantic descriptions: dictionary glosses and word embedding representations. …
☆12Jul 13, 2022Updated 4 years ago
jbossdemocentral / rhdm7-qlb-loan-demo
View on GitHub
☆13Jul 19, 2021Updated 5 years ago
fsndzomga / open_source_lrm
View on GitHub
☆10Oct 24, 2024Updated last year
anpaure / cp_eval
View on GitHub
Tiny evaluation of leading LLMs on competitive programming problems
☆14Apr 10, 2026Updated 3 months ago