An educational resource to help anyone learn deep reinforcement learning, with support for PyTorch
☆17Oct 19, 2023Updated 2 years ago
Alternatives and similar repositories for spinningup_pytorch
Users that are interested in spinningup_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- yet another anki app☆14Sep 9, 2024Updated last year
- Create shortcuts to Homebrew formula app bundles☆16May 6, 2024Updated 2 years ago
- Standardizing environment infrastructure with Strands Agents — step, observe, reward.☆47May 13, 2026Updated last week
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆18Dec 1, 2023Updated 2 years ago
- This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.☆32Jun 5, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Jan 8, 2022Updated 4 years ago
- ✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks☆18Aug 16, 2024Updated last year
- Source code for Jordan Boyd-Graber's academic webpage.☆12Updated this week
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆28Oct 14, 2025Updated 7 months ago
- An example of DyNet autobatching for the NIPS "how to code a paper" workshop☆12Dec 9, 2017Updated 8 years ago
- Replication package for ISSTA2023 paper - Towards Efficient Fine-tuning of Pre-trained Code Models: An Experimental Study and Beyond☆23Apr 9, 2023Updated 3 years ago
- Mirror of 0.1.1 release of clausie from http://www.mpi-inf.mpg.de/departments/databases-and-information-systems/software/clausie/☆14Jan 4, 2015Updated 11 years ago
- Codebase for Context-aware Meta-learned Loss Scaling (CaMeLS). https://arxiv.org/abs/2305.15076.☆26Jan 23, 2024Updated 2 years ago
- List of papers that applied graph network to NLP☆13Feb 26, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- (Python3- TensorFlow 1.5) Code for the ACL 2017 paper "Get To The Point: Summarization with Pointer-Generator Networks"☆13Mar 23, 2018Updated 8 years ago
- A collection of papers on reinforcement learning applied to NLP☆14Sep 7, 2018Updated 7 years ago
- bert-pli应用于LeCaRD☆18Nov 14, 2021Updated 4 years ago
- Official implementation of MATPO: Multi-Agent Tool-Integrated Policy Optimization.☆81Oct 31, 2025Updated 6 months ago
- Code and data for Colors in Context and Generating Bilingual Pragmatic Color References☆12Mar 13, 2018Updated 8 years ago
- ☆11Aug 8, 2018Updated 7 years ago
- ☆13Oct 17, 2020Updated 5 years ago
- Show Azure Openai demos for partners☆24Jun 12, 2023Updated 2 years ago
- ☆26Jul 25, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Legal document similarity - Code, data, and models for the ICAIL 2021 paper "Evaluating Document Representations for Content-based Legal …☆32Apr 29, 2021Updated 5 years ago
- Turn your 2-D wife(anime image) to 3-D wife(cosplay image) or opposite using DCGAN!☆15Sep 6, 2018Updated 7 years ago
- Library to compare and evaluate reward functions☆69Oct 23, 2023Updated 2 years ago
- Implementing keyword extraction algorithm using tf-idf weighting, see☆16Feb 2, 2017Updated 9 years ago
- ☆18Jun 12, 2017Updated 8 years ago
- A python package to design and debug RL agents.☆33Apr 2, 2026Updated last month
- Multimodal Neurons in Artificial Neural Networks☆16Oct 18, 2021Updated 4 years ago
- ☆41Jun 19, 2024Updated last year
- ☆26Dec 29, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- EfficientNet-Keras-GradCam-Visualization☆18Aug 31, 2019Updated 6 years ago
- MT/IE: Cross-lingual Open Information Extraction with Neural Sequence-to-Sequence Models☆23Jul 15, 2018Updated 7 years ago
- [ACL 2023 Findings] What In-Context Learning “Learns” In-Context: Disentangling Task Recognition and Task Learning☆21Jul 9, 2023Updated 2 years ago
- [ICML 2026 Spotlight] Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback☆66May 4, 2026Updated 2 weeks ago
- ☆47Mar 27, 2022Updated 4 years ago
- ☆47Dec 12, 2024Updated last year
- ROUGE summarization evaluation metric, enhanced with use of Word Embeddings☆23Oct 8, 2018Updated 7 years ago