This is a project using neural-network reinforcement learning to solve the 8 puzzle problem (or even N puzzle)
☆11Mar 24, 2018Updated 8 years ago
Alternatives and similar repositories for Reinforcement-Learning-Q-learning-8puzzle-Pytorch
Users that are interested in Reinforcement-Learning-Q-learning-8puzzle-Pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Highly configurable simulation made using ns3 to compare two of the oldest TCP variants, Tahoe and Reno.☆11Feb 15, 2023Updated 3 years ago
- [SIGIR'25] Code of "Generative Recommender with End-to-End Learnable Item Tokenization".☆24Apr 17, 2025Updated 11 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆12Feb 9, 2022Updated 4 years ago
- Geth + Elastic☆17Feb 7, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆12Nov 14, 2019Updated 6 years ago
- The implementation of paper "Strategy-aware Bundle Recommender System", SIGIR'23.☆15Sep 4, 2023Updated 2 years ago
- ☆13Mar 4, 2023Updated 3 years ago
- Implementation of EAutoDet☆12Oct 24, 2022Updated 3 years ago
- Implementation of CVPR2016 for Heart-Rate estimation.☆15Jun 11, 2017Updated 8 years ago
- A Python framework that uses machine learning algorithms to implement the metadata recovery attack against obfuscated programs.☆11Jul 25, 2016Updated 9 years ago
- zkSnark circuit compiler☆13Feb 19, 2026Updated last month
- A Formal Verification of Algorithm W☆17Mar 10, 2021Updated 5 years ago
- Fast subset and superset queries based on tries.☆11Jun 21, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Large Language Models as Evaluators for Recommendation Explanations (RecSys 2024 Reproducibility)☆20Aug 13, 2025Updated 7 months ago
- 자주 쓰이는 technique example code 모음☆22Nov 30, 2018Updated 7 years ago
- ☆16Mar 15, 2021Updated 5 years ago
- the GHOST protocol implementation on solidity.☆22Jun 4, 2019Updated 6 years ago
- Meme serving with NLP☆35May 20, 2023Updated 2 years ago
- Artemis Academy capstone project☆10Sep 10, 2022Updated 3 years ago
- ☆12May 7, 2023Updated 2 years ago
- Udacity Deep Reinforecment Learning - Implementation of Proximal Policy Optimization (PPO)☆14Nov 1, 2018Updated 7 years ago
- Library built from scratch to implement zk-protocols☆13Dec 13, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A template for creating new SBTs inheriting from the Masa SBT smart contracts, using ZKP.☆11Nov 25, 2024Updated last year
- A Lean 4 package for heavy numerical computations☆20Jan 16, 2022Updated 4 years ago
- [EMNLP 2024] The official GitHub repo for the paper "Course-Correction: Safety Alignment Using Synthetic Preferences"☆20Oct 2, 2024Updated last year
- coq-tutorial☆17Nov 11, 2019Updated 6 years ago
- オリジナルの漢字テストを作成するWebアプリ☆12Mar 5, 2024Updated 2 years ago
- PATRIOTIC - Pervasive Anti-Tampering and Anti-Repackaging for IoT for Integrated C-based Firmware☆10Jan 26, 2023Updated 3 years ago
- CIKM'20, Generate Neural Template Explanations for Recommendation☆23Jan 24, 2025Updated last year
- Repository containing the PhD Thesis "Formal Verification of Deep Reinforcement Learning Agents"☆11Aug 29, 2022Updated 3 years ago
- The Valida execution engine, prover, and verifier☆28Oct 6, 2025Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Underconstrained symbolic execution for cryptography verification☆19Mar 26, 2021Updated 5 years ago
- Code and data for NAACL 2025 paper "IHEval: Evaluating Language Models on Following the Instruction Hierarchy"☆16Feb 25, 2025Updated last year
- Compile circom code to llvm partially☆12Feb 14, 2023Updated 3 years ago
- ☆15Sep 14, 2022Updated 3 years ago
- The Zero Knowledge Whitelist Tool is a powerful utility for managing an address whitelist using Zero-Knowledge (ZK) proofs.☆11Oct 3, 2025Updated 5 months ago
- Solution to Kaggle's Google Research Football Competition☆14Dec 2, 2020Updated 5 years ago
- A tool to search for gadgets, operations, and ROP chains using a backtracking algorithm in a tree-like structure☆19Jun 13, 2023Updated 2 years ago