Elktrn / Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-pythonView on GitHub
solving a simple 4*4 Gridworld almost similar to openAI gym FrozenLake using Qlearning Temporal difference method Reinforcement Learning
☆14Feb 22, 2025Updated last year
Alternatives and similar repositories for Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python
Users that are interested in Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- find arrangement for n Queens in n*n board of chees using Genetic algorithms☆14Feb 22, 2025Updated last year
- cross platform presentation softwate☆16Dec 17, 2019Updated 6 years ago
- ☆15Aug 4, 2024Updated last year
- A simple example of react app with node.js, express.js, socket.io, created to answer a question in stack overflow.☆11Dec 25, 2021Updated 4 years ago
- A deep learning library for streamlining research and development using the Torch7 distribution.☆339Sep 1, 2016Updated 9 years ago
- AutoPrompt: Automatic Prompt Construction for Masked Language Models.☆640Aug 24, 2024Updated last year
- A PyTorch implementation of the NIPS 2017 paper "Dynamic Routing Between Capsules".☆1,752Nov 9, 2018Updated 7 years ago
- Reference implementation for DPO (Direct Preference Optimization)☆2,868Aug 11, 2024Updated last year
- 🕹️ Open-source, developer-first LLMOps platform designed to streamline prompt design, version management, instant delivery, collaboratio…☆3,213Jun 28, 2025Updated 8 months ago
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆3,947Updated this week
- Harness LLMs with Multi-Agent Programming☆3,940Updated this week
- Summaries and notes on Deep Learning research papers☆4,418Feb 13, 2018Updated 8 years ago
- An open access book on scientific visualization using python and matplotlib☆11,217Jan 4, 2026Updated 2 months ago
- [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆30,264Updated this week
- The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling cus…☆17,826Updated this week
- Agent Zero AI framework☆16,290Updated this week
- GenAI Agent Framework, the Pydantic way☆15,571Updated this week
- Companion webpage to the book "Mathematics For Machine Learning"☆15,213Mar 13, 2025Updated last year
- The official Python SDK for Model Context Protocol servers and clients☆22,245Updated this week
- Build resilient language agents as graphs.☆27,302Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more☆35,190Updated this week
- DSPy: The framework for programming—not prompting—language models☆33,038Updated this week
- Unsloth Studio is a web UI for training and running open models like Qwen, DeepSeek, gpt-oss and Gemma locally.☆57,673Updated this week
- The fundamental package for scientific computing with Python.☆31,621Updated this week
- Free universal database tool and SQL client☆49,222Updated this week
- The first real AI developer☆33,808Nov 10, 2025Updated 4 months ago
- A programming framework for agentic AI☆55,908Updated this week
- Tesseract Open Source OCR Engine (main repository)☆72,962Mar 16, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆74,135Updated this week
- An extremely fast Python package and project manager, written in Rust.☆81,647Updated this week
- A latent text-to-image diffusion model☆72,709Jun 18, 2024Updated last year
- Dear ImGui: Bloat-free Graphical User interface for C++ with minimal dependencies☆72,142Updated this week
- LLM inference in C/C++☆98,911Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆165,557Updated this week
- nordic's qfaa-dcdc exported with altium2kicad and cleaned up☆12Mar 17, 2016Updated 10 years ago
- Determine how intervals relate to each other.☆79Mar 3, 2026Updated 3 weeks ago
- An Open-Source Framework for Prompt-Learning.☆4,843Jul 16, 2024Updated last year
- An asynchronous runtime 3D-model importer for Unity☆16Mar 19, 2024Updated 2 years ago
- Python interface to TECAN Fluent liquid handling robot☆21Apr 18, 2024Updated last year