jianhuaixie / enforcelearningView external linksLinks
enforcement learning demo and note
☆22Sep 26, 2017Updated 8 years ago
Alternatives and similar repositories for enforcelearning
Users that are interested in enforcelearning are comparing it to the libraries listed below
Sorting:
- SentiStorm - Real-time Twitter Sentiment Classification based on Apache Storm☆10May 22, 2018Updated 7 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- Official code for the LoG2022 paper -- MSGNN: A Spectral Graph Neural Network Based on a Novel Magnetic Signed Laplacian.☆13Feb 8, 2025Updated last year
- ☆11Jan 17, 2026Updated 3 weeks ago
- My solutions toward CS294 homework: Deep Reinforcement Learning☆11Nov 14, 2018Updated 7 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆11Sep 22, 2019Updated 6 years ago
- ☆10Oct 3, 2023Updated 2 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- ☆11Mar 15, 2019Updated 6 years ago
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- ***Warning*** Old Apache Flink Graph API: This repository is not in use anymore.☆15Jan 18, 2016Updated 10 years ago
- ☆11Oct 19, 2020Updated 5 years ago
- ☆10Jun 28, 2015Updated 10 years ago
- older version of Kinect simulator and original opcodemesh code☆14Apr 6, 2015Updated 10 years ago
- RiddleSense: Reasoning about Riddle Questions Featuring Linguistic Creativity and Commonsense Knowledge☆14Oct 20, 2021Updated 4 years ago
- Online machine learning algorithms based on Spark streaming☆12Nov 30, 2015Updated 10 years ago
- Let there be clock in the beach - WACV 2022☆15Nov 15, 2021Updated 4 years ago
- The MOPED framework: Object recognition and pose estimation for manipulation☆14Aug 1, 2016Updated 9 years ago
- Comprehensive Implementation of Proximal Policy Optimization☆12Aug 3, 2021Updated 4 years ago
- PyTorch implementation of "The Option Keyboard: Combining Skills in Reinforcement Learning" (NeurIPS 2019)☆12Jul 2, 2020Updated 5 years ago
- Official Code Repository for [AutoScale📈: Scale-Aware Data Mixing for Pre-Training LLMs] Published as a conference paper at **COLM 2025*…☆13Aug 8, 2025Updated 6 months ago
- ☆15Oct 11, 2022Updated 3 years ago
- Reinforcement Learning with Convex Constraints☆14Apr 6, 2022Updated 3 years ago
- Catch game example is translated by TensorFlow☆16May 8, 2017Updated 8 years ago
- A compiler for Pig Latin to Spark and Flink.☆23Nov 21, 2019Updated 6 years ago
- ☆17Feb 15, 2021Updated 4 years ago
- ☆15Nov 9, 2017Updated 8 years ago
- Pytorch implementation for NeurIPS-23:"GNNEvaluator: Evaluating GNN Performance On Unseen Graphs Without Labels"☆19Mar 21, 2024Updated last year
- Representation Learning in RL☆13Jun 1, 2022Updated 3 years ago
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆18Jun 6, 2024Updated last year
- Incremental Learning Event Definitions☆15Jul 21, 2015Updated 10 years ago
- ☆17Oct 15, 2023Updated 2 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 2 years ago
- ADMM Logistic Regression implemented in Spark☆32Jan 20, 2014Updated 12 years ago
- ☆16Jun 29, 2022Updated 3 years ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- The agent tying together the components of Project Happy Meal☆21Apr 21, 2019Updated 6 years ago