☆18Oct 30, 2025Updated 4 months ago
Alternatives and similar repositories for KnapsackRL
Users that are interested in KnapsackRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for ICLR 2022 Paper (HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning)☆12Nov 28, 2023Updated 2 years ago
- PyTorch Implementation of Visual GAIL in Atari Games☆14Dec 7, 2022Updated 3 years ago
- This repo contains the code for the paper "Understanding and Mitigating Hallucinations in Large Vision-Language Models via Modular Attrib…☆35Jul 14, 2025Updated 8 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- USTC 自动每日打卡/自动跨校区报备/自动生成行程码☆16Feb 26, 2023Updated 3 years ago
- Source for the sample efficient tabular RL submission to the 2019 NIPS workshop on Biological and Artificial RL☆24Apr 14, 2022Updated 3 years ago
- 2023ICS_riscv32☆19Sep 10, 2024Updated last year
- Irene is a python package that aims to be a toolkit for global optimization problems that can be realized algebraically. It generalizes L…☆15Updated this week
- ☆41Mar 22, 2024Updated 2 years ago
- Code for Paper (Preserving Diversity in Supervised Fine-tuning of Large Language Models)☆52May 12, 2025Updated 10 months ago
- ☆10Jul 13, 2024Updated last year
- Face Recognition on NVIDIA TX2☆10Sep 5, 2018Updated 7 years ago
- ☆13May 30, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Mar 11, 2025Updated last year
- ☆22Feb 4, 2025Updated last year
- Official implementation of the paper `Augmenting GAIL with BC for sample efficient imitation learning` in PyTorch☆35Jan 3, 2021Updated 5 years ago
- Welcome to the land of C Neuro.☆48Mar 13, 2026Updated last week
- This project applies Monte Carlo Tree Search (MCTS) to a simple grid world.☆10May 30, 2018Updated 7 years ago
- Implementation of Action Matching for the Schrödinger equation☆25Jun 18, 2023Updated 2 years ago
- sc14 matlab application☆14Nov 24, 2014Updated 11 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- ☆13Apr 3, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- TD-VAE in PyTorch☆10May 28, 2019Updated 6 years ago
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆29Apr 17, 2025Updated 11 months ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- 16824 homework: weakly supervised object detection with PyTorch☆13Sep 5, 2018Updated 7 years ago
- Reproduction of the complete process of DeepSeek-R1 on small-scale models, including Pre-training, SFT, and RL.☆29Mar 11, 2025Updated last year
- ☆11Jan 12, 2021Updated 5 years ago
- An evaluation suite for Retrieval-Augmented Generation (RAG).☆23Apr 26, 2025Updated 11 months ago
- JOYTOU is a BootStrap blog template developed by Joytou Wu.☆10Feb 5, 2020Updated 6 years ago
- Levin tree search guided by both a policy and a heuristic function☆19Jul 13, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A fun review of spectral clustering with MATLAB demos I made for the NU machine learning meetiup in 2014☆12Mar 4, 2016Updated 10 years ago
- python algorithms to solve sparse linear programming problems☆34Jul 6, 2023Updated 2 years ago
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆453May 13, 2025Updated 10 months ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year
- 我们是第一个完全可商用的角色大模型。☆40Aug 11, 2024Updated last year
- Code for paper "Successor Uncertainties: Exploration and Uncertainty in Temporal Difference Learning" by David Janz*, Jiri Hron*, Przemys…☆21Feb 24, 2023Updated 3 years ago
- Large matrix multiplication in CUDA☆17Oct 20, 2023Updated 2 years ago