심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings
☆11May 10, 2024Updated last year
Alternatives and similar repositories for Deep-Reinforcement-Learnings
Users that are interested in Deep-Reinforcement-Learnings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM의 다양한 튜닝 방법과 데이터 전처리 코드를 정리해놓았습니다.☆14Feb 23, 2026Updated last month
- Open Source + Multilingual MLLM + Fine-tuning + Distillation + More efficient models and learning + ?☆18Jan 31, 2025Updated last year
- ☆12Apr 26, 2022Updated 3 years ago
- Computes trajectories for evolutionary dynamics.☆15Oct 6, 2020Updated 5 years ago
- Korean Sub for CS285 2021 fall lecture☆13Apr 2, 2022Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A lightweight reimplementation of Adversarially Trained Actor Critic☆20Mar 19, 2026Updated 3 weeks ago
- ☆47Mar 20, 2026Updated 3 weeks ago
- LINER PDF Chat Tutorial with ChatGPT & Pinecone☆49May 30, 2023Updated 2 years ago
- Code for the paper "PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning" (NeurIPS 2023)☆22Dec 8, 2023Updated 2 years ago
- A gym interface for AI safety gridworlds created in pycolab.☆18May 12, 2022Updated 3 years ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.☆37Aug 27, 2025Updated 7 months ago
- simple JAX-/NumPy-based implementations of NGD with exact/approximate Fisher Information Matrix both in parameter-space and function-spac…☆16Oct 21, 2020Updated 5 years ago
- 문장단위로 분절된 한국어 위키피디아 코퍼스. Releases에서 다운로드 받거나 tfds-korean으로 사용해주세요.☆24Sep 6, 2023Updated 2 years ago
- Tic Tac Toe with Alpha Zero method - My first work☆18Aug 23, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The repository is for Reinforcement-Learning Uncertainty research, in which we investigate various uncertain factors in RL.☆23Jun 16, 2023Updated 2 years ago
- Matthews Correlation Coefficient Loss implementation for image segmentation.☆12Mar 19, 2026Updated 3 weeks ago
- ☆17Jun 10, 2024Updated last year
- This is a hands-on for ML beginners to perform SimCSE step-by-step. Implemented both supervised SimCSE and unsupervisied SimCSE, and dist…☆22Oct 6, 2023Updated 2 years ago
- Model-Agnostic Meta-Learning in PyTorch☆11Jul 31, 2020Updated 5 years ago
- ☆14Aug 26, 2020Updated 5 years ago
- 2019 Clova AI Hackathon : Speech - Rank 12 / Team Kai.Lib☆22Jun 11, 2020Updated 5 years ago
- Implementation of SimCLR in PyTorch☆13Jul 8, 2021Updated 4 years ago
- MAML implementation with pytorch☆11Sep 23, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆14Jul 22, 2021Updated 4 years ago
- ☆19Nov 21, 2023Updated 2 years ago
- Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…☆11Feb 14, 2024Updated 2 years ago
- ☆18Apr 11, 2023Updated 3 years ago
- Face Type Detector Using Active Shape Models with Stasm☆12Jan 25, 2016Updated 10 years ago
- Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)☆19May 10, 2024Updated last year
- Official Implementation of ICLR2025 Paper: Songyuan Zhang, Oswin So, Mitchell Black, Chuchu Fan: "Discrete GCBF Proximal Policy Optimizat…☆25May 14, 2025Updated 10 months ago
- ☆43Feb 25, 2026Updated last month
- Code for "Continuous-Time Meta-Learning with Forward Mode Differentiation" (ICLR 2022)☆25Mar 4, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Kor-IR: Korean Information Retrieval Benchmark☆87Jul 3, 2024Updated last year
- 공학수학 강의노트☆19Feb 27, 2024Updated 2 years ago
- Python FastApi "Circuit Breaker" implementation☆13Mar 14, 2025Updated last year
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- Official repository of MICCAI 2022 Parse challenge☆15Jul 22, 2023Updated 2 years ago
- This project automates promotional posts across multiple social media platforms.☆33Mar 30, 2026Updated last week