chavicoski / Deep-RL-course
Exercises of the reinforcement learning course from Hugging Face
☆12Updated 2 years ago
Alternatives and similar repositories for Deep-RL-course:
Users that are interested in Deep-RL-course are comparing it to the libraries listed below
- GBRL-based Actor-Critic algorithms implemented in stable-baselines3☆34Updated 3 weeks ago
- ML/DL Math and Method notes☆60Updated last year
- Important Note fastrl version 2 is being developed at fastrl. Note the link in the readme☆39Updated 4 years ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆32Updated 5 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated last year
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Updated 6 months ago
- This course aims to teach from the basics of RL to advanced algorithms such as PPO.☆17Updated last week
- Implementation of Soft Actor Critic and some of its improvements in Pytorch☆56Updated 2 months ago
- This repo is for members of the rl-implementation channel on MLC Discord to play with RL algorithms and learn.☆10Updated 3 years ago
- Repository of notes, code and notebooks in Python for the book "Reinforcement Learning: An Introduction" by Richard S. Sutton and Andrew …☆30Updated 2 weeks ago
- Highly commented implementations of Transformers in PyTorch☆136Updated last year
- Official Implementation of the 'When XGBoost Outperforms GPT-4 on Text Classification: A Case Study' NAACL-W 2024 paper☆15Updated 4 months ago
- The fastai book, 2nd edition (in progress)☆51Updated 9 months ago
- Exploring machine learning engineering and operations. ❚☆39Updated last week
- Some helpers and examples for creating an LLM fine-tuning dataset☆70Updated last year
- ☆17Updated 2 months ago
- Technical documents on a variety of topics, created for the purpose of learning☆33Updated last week
- Get packages onto your conda channel faster☆25Updated 4 months ago
- Study the temporal performance degradation of machine learning models.☆16Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆198Updated 11 months ago
- ☆18Updated 7 months ago
- TPU use in single line in colab using tf2 package.☆11Updated 3 years ago
- PyTorch Guide☆28Updated 3 years ago
- Hub for researchers exploring VLMs and Multimodal Learning:)☆25Updated last week
- Automatic Machine Learning (AutoML) for Wave Apps☆32Updated last year
- A minimal PyTorch re-implementation of GPT (Generative Pretrained Transformer) language model training☆14Updated last year
- A set of NLP tools created during my medium NLP Explanation series.☆31Updated 11 months ago
- PyTorch code for DeepTime: Deep Time-Index Meta-Learning for Non-Stationary Time-Series Forecasting☆11Updated 2 years ago
- Collecting news articles for all the companies in the R1000, for a pre-defined set of news outlets, using Diffbot's Knowledge Graph☆11Updated 2 years ago
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last week