심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings
☆11May 10, 2024Updated last year
Alternatives and similar repositories for Deep-Reinforcement-Learnings
Users that are interested in Deep-Reinforcement-Learnings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM의 다양한 튜닝 방법과 데이터 전처리 코드를 정리해놓았습니다.☆14Feb 23, 2026Updated last month
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- ☆12Apr 26, 2022Updated 3 years ago
- Computes trajectories for evolutionary dynamics.☆15Oct 6, 2020Updated 5 years ago
- Korean Sub for CS285 2021 fall lecture☆13Apr 2, 2022Updated 3 years ago
- ☆34Mar 14, 2026Updated last week
- A lightweight reimplementation of Adversarially Trained Actor Critic☆19Updated this week
- LINER PDF Chat Tutorial with ChatGPT & Pinecone☆49May 30, 2023Updated 2 years ago
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆22Dec 8, 2023Updated 2 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 3 years ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 6 months ago
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean 으로 사용해주세요.☆24Sep 6, 2023Updated 2 years ago
- simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-spac…☆16Oct 21, 2020Updated 5 years ago
- Tic Tac Toe with Alpha Zero method - My first work☆18Aug 23, 2018Updated 7 years ago
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 2 years ago
- Matthews Correlation Coefficient Loss implementation for image segmentation.☆12Updated this week
- ☆17Jun 10, 2024Updated last year
- EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets☆10Dec 12, 2023Updated 2 years ago
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- Model-Agnostic Meta-Learning in PyTorch☆11Jul 31, 2020Updated 5 years ago
- ☆14Aug 26, 2020Updated 5 years ago
- 2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib☆22Jun 11, 2020Updated 5 years ago
- Implementation of SimCLR in PyTorch☆13Jul 8, 2021Updated 4 years ago
- MAML implementation with pytorch☆11Sep 23, 2020Updated 5 years ago
- ☆14Jul 22, 2021Updated 4 years ago
- ☆19Nov 21, 2023Updated 2 years ago
- Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…☆11Feb 14, 2024Updated 2 years ago
- ☆18Apr 11, 2023Updated 2 years ago
- Face Type Detector Using Active Shape Models with Stasm☆12Jan 25, 2016Updated 10 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year
- Official Implementation of ICLR2025 Paper: Songyuan Zhang, Oswin So, Mitchell Black, Chuchu Fan: "Discrete GCBF Proximal Policy Optimizat…☆25May 14, 2025Updated 10 months ago
- ☆39Feb 25, 2026Updated 3 weeks ago
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆25Mar 4, 2022Updated 4 years ago
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- Python FastApi "Circuit Breaker" implementation☆13Mar 14, 2025Updated last year
- 공학수학 강의노트☆19Feb 27, 2024Updated 2 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- Official repository of MICCAI 2022 Parse challenge☆15Jul 22, 2023Updated 2 years ago