liziniu / KnapsackRLView external linksLinks
☆17Oct 30, 2025Updated 3 months ago
Alternatives and similar repositories for KnapsackRL
Users that are interested in KnapsackRL are comparing it to the libraries listed below
Sorting:
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Apr 14, 2022Updated 3 years ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆33Jul 14, 2025Updated 7 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Dec 19, 2023Updated 2 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆51May 12, 2025Updated 9 months ago
- Face Recognition on NVIDIA TX2☆10Sep 5, 2018Updated 7 years ago
- sc14 matlab application☆14Nov 24, 2014Updated 11 years ago
- ☆10Jul 13, 2024Updated last year
- ☆41Mar 22, 2024Updated last year
- ☆13May 30, 2019Updated 6 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ☆13Apr 3, 2019Updated 6 years ago
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15Jan 4, 2026Updated last month
- This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.☆10May 30, 2018Updated 7 years ago
- TD-VAE in PyTorch☆10May 28, 2019Updated 6 years ago
- ☆11Jan 12, 2021Updated 5 years ago
- Enumerate all Nash equilibria of a bimatrix game (i.e. 2-player strategic-form game)☆15Apr 26, 2024Updated last year
- 2023ICS_riscv32☆19Sep 10, 2024Updated last year
- PyTorch Implementation of Visual GAIL in Atari Games☆14Dec 7, 2022Updated 3 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Mar 11, 2025Updated 11 months ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- A fun review of spectral clustering with MATLAB demos I made for the NU machine learning meetiup in 2014☆12Mar 4, 2016Updated 9 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- 16824 homework: weakly supervised object detection with PyTorch☆13Sep 5, 2018Updated 7 years ago
- Levin tree search guided by both a policy and a heuristic function☆19Jul 13, 2023Updated 2 years ago
- JOYTOU is a BootStrap blog template developed by Joytou Wu.☆10Feb 5, 2020Updated 6 years ago
- Actor critic reinforcement learning + motion and task planning under LTL tasks + wireless sensor network routing☆14Mar 6, 2021Updated 4 years ago
- Large matrix multiplication in CUDA☆17Oct 20, 2023Updated 2 years ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- USTC 自动每日打卡/自动跨校区报备/自动生成行程码☆16Feb 26, 2023Updated 2 years ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆23Apr 26, 2025Updated 9 months ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆25Dec 12, 2023Updated 2 years ago
- ☆22Feb 4, 2025Updated last year
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆27Apr 17, 2025Updated 9 months ago
- Reinforcement learning tutorials using the rlberry library.☆17Jan 9, 2023Updated 3 years ago
- ☆16Jun 12, 2018Updated 7 years ago
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 2 years ago
- Implementation of POMDP algorithms on the tiger example, as described in Littman, Cassandra and Kaelbling (1994).☆17Aug 8, 2017Updated 8 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago