Elktrn / Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-pythonView on GitHub
solving a simple 4*4 Gridworld almost similar to openAI gym FrozenLake using Qlearning Temporal difference method Reinforcement Learning
☆14Feb 22, 2025Updated last year
Alternatives and similar repositories for Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python
Users that are interested in Reinforcement-Learning-solving-a-simple-4by4-Gridworld-using-Qlearning-in-python are comparing it to the libraries listed below
Sorting:
- OpenCV bindings for Torch.☆208Sep 3, 2018Updated 7 years ago
- AutoPrompt: Automatic Prompt Construction for Masked Language Models.☆641Aug 24, 2024Updated last year
- The fastest way to build robust AI agents☆2,110Jul 11, 2025Updated 7 months ago
- The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.☆3,887Updated this week
- An awesome & curated list of best LLMOps tools for developers☆5,645Feb 3, 2026Updated last month
- Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.☆13,451Feb 16, 2026Updated 2 weeks ago
- 🔠Foreign language reading and translation assistant based on copy and translate.☆17,545Feb 23, 2026Updated last week
- Build resilient language agents as graphs.☆25,083Feb 25, 2026Updated last week
- [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆28,682Feb 23, 2026Updated last week
- Distribute and run LLMs with a single file.☆23,755Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,381Feb 24, 2026Updated last week
- The first real AI developer☆33,798Nov 10, 2025Updated 3 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Updated this week
- Free universal database tool and SQL client☆48,878Updated this week
- A programming framework for agentic AI☆54,956Jan 22, 2026Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- An extremely fast Python package and project manager, written in Rust.☆80,084Updated this week
- Tesseract Open Source OCR Engine (main repository)☆72,562Feb 21, 2026Updated last week
- A latent text-to-image diffusion model☆72,575Jun 18, 2024Updated last year
- LLM inference in C/C++☆96,322Updated this week
- ☆101,745Aug 28, 2025Updated 6 months ago
- Tensors and Dynamic neural networks in Python with strong GPU acceleration☆97,870Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆125,513Updated this week
- 🦜🔗 The platform for reliable agents.☆127,809Updated this week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆163,632Updated this week
- Repo for counting stars and contributing. Press F to pay respect to glorious developers.☆275,567Aug 22, 2025Updated 6 months ago
- ☆16Apr 8, 2018Updated 7 years ago
- Explainer for black box models that predict molecule properties☆347May 8, 2025Updated 9 months ago
- Processing 3.x Template for KidzLabs/4M/Toysmith Animation Praxinoscope☆20Jan 26, 2018Updated 8 years ago
- ☆21Dec 15, 2023Updated 2 years ago
- Generate a markdown or JSON list of contributors for a project using the GitHub API.☆17Jan 4, 2017Updated 9 years ago
- Use as a sub-generator or plugin in your generator to create a package.json for a project. Or install globally and run with Generate's CL…☆18May 30, 2021Updated 4 years ago
- Terraform enables you to safely and predictably create, change, and improve infrastructure. It is a source-available tool that codifies A…☆47,830Updated this week
- Jenkins automation server☆25,057Updated this week
- Beyond Pure Geometry: An Uncertainty-Driven Perspective on Long-Term LiDAR Localization☆60Mar 14, 2025Updated 11 months ago
- A Brackets extension to insert form elements quickly, based on QuickFormTool☆12Mar 18, 2023Updated 2 years ago
- Uses KNN to learn & break captcha images containing numbers (uses OpenCV2.1)☆19Aug 13, 2017Updated 8 years ago
- Reddit is dead, long live Lemmy☆14Jun 30, 2023Updated 2 years ago
- state estimation for UAV, variational Bayesian adaptive Kalman filter, state augmentation.☆12May 7, 2022Updated 3 years ago