Elktrn/Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Elktrn/Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python)

Elktrn / Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python

solving a simple 4*4 Gridworld almost similar to openAI gym FrozenLake using Qlearning Temporal difference method Reinforcement Learning

☆14

Alternatives and similar repositories for Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python

Users that are interested in Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Elktrn / nQueens-problem-solved-with-evolutionary-strategy-in-python
View on GitHub
find arrangement for n Queens in n*n board of chees using Genetic algorithms
☆14Feb 22, 2025Updated last year
mshobeyri / DemonPresentationBoard
View on GitHub
cross platform presentation softwate
☆16Dec 17, 2019Updated 6 years ago
MrJs6781 / neshan_map
View on GitHub
☆15Aug 4, 2024Updated last year
mohammadoftadeh / simple-react-socket-io
View on GitHub
A simple example of react app with node.js, express.js, socket.io, created to answer a question in stack overflow.
☆11Dec 25, 2021Updated 4 years ago
nicholas-leonard / dp
View on GitHub
A deep learning library for streamlining research and development using the Torch7 distribution.
☆339Sep 1, 2016Updated 9 years ago
ucinlp / autoprompt
View on GitHub
AutoPrompt: Automatic Prompt Construction for Masked Language Models.
☆640Aug 24, 2024Updated last year
gram-ai / capsule-networks
View on GitHub
A PyTorch implementation of the NIPS 2017 paper "Dynamic Routing Between Capsules".
☆1,752Nov 9, 2018Updated 7 years ago
eric-mitchell / direct-preference-optimization
View on GitHub
Reference implementation for DPO (Direct Preference Optimization)
☆2,868Aug 11, 2024Updated last year
pezzolabs / pezzo
View on GitHub
🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboratio…
☆3,213Jun 28, 2025Updated 8 months ago
Agenta-AI / agenta
View on GitHub
The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.
☆3,947Updated this week
langroid / langroid
View on GitHub
Harness LLMs with Multi-Agent Programming
☆3,940Updated this week
dennybritz / deeplearning-papernotes
View on GitHub
Summaries and notes on Deep Learning research papers
☆4,418Feb 13, 2018Updated 8 years ago
rougier / scientific-visualization-book
View on GitHub
An open access book on scientific visualization using python and matplotlib
☆11,217Jan 4, 2026Updated 2 months ago
HKUDS / LightRAG
View on GitHub
[EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"
☆30,264Updated this week
emcie-co / parlant
View on GitHub
The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling cus…
☆17,826Updated this week
agent0ai / agent-zero
View on GitHub
Agent Zero AI framework
☆16,290Updated this week
pydantic / pydantic-ai
View on GitHub
GenAI Agent Framework, the Pydantic way
☆15,571Updated this week
mml-book / mml-book.github.io
View on GitHub
Companion webpage to the book "Mathematics For Machine Learning"
☆15,213Mar 13, 2025Updated last year
modelcontextprotocol / python-sdk
View on GitHub
The official Python SDK for Model Context Protocol servers and clients
☆22,245Updated this week
langchain-ai / langgraph
View on GitHub
Build resilient language agents as graphs.
☆27,302Updated this week
jax-ml / jax
View on GitHub
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
☆35,190Updated this week
stanfordnlp / dspy
View on GitHub
DSPy: The framework for programming—not prompting—language models
☆33,038Updated this week
unslothai / unsloth
View on GitHub
Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.
☆57,673Updated this week
numpy / numpy
View on GitHub
The fundamental package for scientific computing with Python.
☆31,621Updated this week
dbeaver / dbeaver
View on GitHub
Free universal database tool and SQL client
☆49,222Updated this week
Pythagora-io / gpt-pilot
View on GitHub
The first real AI developer
☆33,808Nov 10, 2025Updated 4 months ago
microsoft / autogen
View on GitHub
A programming framework for agentic AI
☆55,908Updated this week
tesseract-ocr / tesseract
View on GitHub
Tesseract Open Source OCR Engine (main repository)
☆72,962Mar 16, 2026Updated last week
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆74,135Updated this week
astral-sh / uv
View on GitHub
An extremely fast Python package and project manager, written in Rust.
☆81,647Updated this week
CompVis / stable-diffusion
View on GitHub
A latent text-to-image diffusion model
☆72,709Jun 18, 2024Updated last year
ocornut / imgui
View on GitHub
Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies
☆72,142Updated this week
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆98,911Updated this week
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆165,557Updated this week
jacobrosenthal / nrf52-kicad
View on GitHub
nordic's qfaa-dcdc exported with altium2kicad and cleaned up
☆12Mar 17, 2016Updated 10 years ago
tfausak / rampart
View on GitHub
Determine how intervals relate to each other.
☆79Mar 3, 2026Updated 3 weeks ago
thunlp / OpenPrompt
View on GitHub
An Open-Source Framework for Prompt-Learning.
☆4,843Jul 16, 2024Updated last year
fhstp / UnityImport3D
View on GitHub
An asynchronous runtime 3D-model importer for Unity
☆16Mar 19, 2024Updated 2 years ago
leylabmpi / pyTecanFluent
View on GitHub
Python interface to TECAN Fluent liquid handling robot
☆21Apr 18, 2024Updated last year