HiddenBeginner/Deep-Reinforcement-Learnings

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HiddenBeginner/Deep-Reinforcement-Learnings)

HiddenBeginner / Deep-Reinforcement-Learnings

심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings

☆11

Alternatives and similar repositories for Deep-Reinforcement-Learnings

Users that are interested in Deep-Reinforcement-Learnings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

postech-minds / postech-minds
View on GitHub
Archiving everything we have studied
☆10Jan 23, 2022Updated 4 years ago
daje0601 / CookBook
View on GitHub
LLM의 다양한 튜닝 방법과 데이터 전처리 코드를 정리해놓았습니다.
☆14May 18, 2026Updated 2 months ago
gyunggyung / OpenMLLM
View on GitHub
Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?
☆19Jan 31, 2025Updated last year
ASzot / rl-toolkit
View on GitHub
☆12Apr 26, 2022Updated 4 years ago
marcharper / pyed
View on GitHub
Computes trajectories for evolutionary dynamics.
☆15Oct 6, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CS285-KOR / CS285_21_KOR_SUB
View on GitHub
Korean Sub for CS285 2021 fall lecture
☆14Apr 2, 2022Updated 4 years ago
goddoe / RLYX
View on GitHub
A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.
☆38Aug 27, 2025Updated 10 months ago
microsoft / lightATAC
View on GitHub
A lightweight reimplementation of Adversarially Trained Actor Critic
☆19Mar 19, 2026Updated 4 months ago
liner-engineering / liner-pdf-chat-tutorial
View on GitHub
LINER PDF Chat Tutorial with ChatGPT & Pinecone
☆49May 30, 2023Updated 3 years ago
david-lindner / safe-grid-gym
View on GitHub
A gym interface for AI safety gridworlds created in pycolab.
☆18May 12, 2022Updated 4 years ago
kazukiosawa / ngd_in_wide_nn
View on GitHub
simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-spac…
☆16Oct 21, 2020Updated 5 years ago
kekmodel / gym-tictactoe-zero
View on GitHub
Tic Tac Toe with Alpha Zero method - My first work
☆18Aug 23, 2018Updated 7 years ago
kakumarabhishek / MCC-Loss
View on GitHub
Matthews Correlation Coefficient Loss implementation for image segmentation.
☆12Mar 19, 2026Updated 4 months ago
SafeRL-Lab / Uncertainty-in-RL
View on GitHub
The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.
☆23Jun 16, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
chez-tozhang / LoGo-SSL
View on GitHub
☆17Jun 10, 2024Updated 2 years ago
jeongukjae / korean-wikipedia-corpus
View on GitHub
문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.
☆24Sep 6, 2023Updated 2 years ago
nerdimite / maml
View on GitHub
Model-Agnostic Meta-Learning in PyTorch
☆12Jul 31, 2020Updated 5 years ago
wisepaip / paip2020
View on GitHub
☆14Aug 26, 2020Updated 5 years ago
sadimanna / simclr_pytorch
View on GitHub
Implementation of SimCLR in PyTorch
☆13Jul 8, 2021Updated 5 years ago
sooftware / Naver-AI-Hackathon-Speech
View on GitHub
2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib
☆22Jun 11, 2020Updated 6 years ago
daekeun-ml / KoSimCSE-SageMaker
View on GitHub
This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…
☆22Oct 6, 2023Updated 2 years ago
minkyujeon / MAML_Pytorch
View on GitHub
MAML implementation with pytorch
☆11Sep 23, 2020Updated 5 years ago
romanpogodin / towards-bio-plausible-conv
View on GitHub
☆14Jul 22, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
juice500ml / dysarthria-mtl
View on GitHub
Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…
☆12Feb 14, 2024Updated 2 years ago
lheadjh / FaceTypeDetector
View on GitHub
Face Type Detector Using Active Shape Models with Stasm
☆12Jan 25, 2016Updated 10 years ago
51616 / marl-lipo
View on GitHub
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
☆19May 10, 2024Updated 2 years ago
AprilYapingZhang / awesome-ocr
View on GitHub
☆18Apr 11, 2023Updated 3 years ago
MuMiN-dataset / mumin-baseline
View on GitHub
Baseline implementations on the MuMiN dataset
☆10Oct 18, 2023Updated 2 years ago
tristandeleu / jax-comln
View on GitHub
Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)
☆25Mar 4, 2022Updated 4 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
dojeon-ai / PLASTIC
View on GitHub
Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)
☆23Dec 8, 2023Updated 2 years ago
PerceptionComputingLab / PARSE2022
View on GitHub
Official repository of MICCAI 2022 Parse challenge
☆15Jul 22, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
uncharted-technologies / risk-and-uncertainty
View on GitHub
Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"
☆31Nov 22, 2022Updated 3 years ago
SeoulTechPSE / EngMath
View on GitHub
공학수학 강의노트
☆19Feb 27, 2024Updated 2 years ago
Atipico1 / Kor-IR
View on GitHub
Kor-IR: Korean Information Retrieval Benchmark
☆87Jul 3, 2024Updated 2 years ago
fmelihh / circuit-breaker-pattern-fastapi
View on GitHub
Python FastApi "Circuit Breaker" implementation
☆13Mar 14, 2025Updated last year
raymondchua / simple_successor_features
View on GitHub
Simple Successor Features
☆20Jul 15, 2025Updated last year
samlobel / CFN
View on GitHub
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
☆25Dec 29, 2023Updated 2 years ago
jaehyeongAN / KoELECTRA-finetuned-sentiment-analysis
View on GitHub
Generalized Sentiment Classifier finetuned by KoELECTRA
☆11Nov 28, 2024Updated last year