学习强化学习过程中的笔记和代码
☆12Jul 27, 2020Updated 5 years ago
Alternatives and similar repositories for RL_notes_and_codes
Users that are interested in RL_notes_and_codes are comparing it to the libraries listed below
Sorting:
- Deep neural network codes for ctr/cvr prediction task in ranking process implemented by Tensorflow (1.14/2.4.1 version), using tf.estimat…☆11Apr 21, 2021Updated 4 years ago
- deploy machine learning model in tensorflow sering and docker☆10Dec 5, 2018Updated 7 years ago
- ☆10Jul 7, 2022Updated 3 years ago
- A Spark application, written in Python, to figure out strongly connected components with Bi-directional Label Propagation algorithm. This…☆11Jun 25, 2019Updated 6 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- A underwater vehicle simulation test-bed with SAUVC swimming pool environment with 6-vectored thruster configuration vehicle operating in…☆10Mar 12, 2022Updated 3 years ago
- Casually implementation a classic metric about clustering☆12Mar 14, 2023Updated 2 years ago
- Optimal and Full Coverage Path Planning for Agricultural Sector☆14May 19, 2021Updated 4 years ago
- ☆11Jan 21, 2022Updated 4 years ago
- 李宏毅老师强化学习笔记☆10May 25, 2021Updated 4 years ago
- 🤖 基于多智能体的A股交易分析系统 | AI-Powered Trading Analysis with Multi-Agent Architecture | 支持技术分析、基本面分析、风险评估☆18Jul 13, 2025Updated 7 months ago
- InternLM-7B微调, SFT/LoRA, instruction finetune☆13May 17, 2024Updated last year
- 深度学习OCR REST api (Flask+Redis+Keras)☆12Jul 23, 2018Updated 7 years ago
- Scrapy Spider for 中国发展改革委员会☆13Nov 17, 2014Updated 11 years ago
- ROS node to convert the amigobot sonar scan depthcloud data to laserscan data for use by amcl☆12Apr 23, 2014Updated 11 years ago
- 基于qwen3的医疗大模型研发全流程 0.分词训练 1.增量预训练 2.微调 3.强化 4.量化 5.蒸馏 6.评估 7.lora模型合并 8.服务 9.部署☆27Jan 3, 2026Updated 2 months ago
- ☆15Dec 22, 2021Updated 4 years ago
- 在A股(股票)市场上训练强化学习交易智能体☆335Mar 27, 2024Updated last year
- 智能家居(Intelligent Furniture)☆12Dec 2, 2019Updated 6 years ago
- WebApp to bring together Text Summarization and Sentiment Analysis of the stock related news to better understand the stock price trends.☆17Apr 7, 2025Updated 10 months ago
- ☆13Nov 19, 2022Updated 3 years ago
- Easy implementations of GCN on Elliptic Datasets☆13Dec 19, 2020Updated 5 years ago
- knowledge graph recommendation☆14Jun 13, 2019Updated 6 years ago
- User behavior prediction from event data.☆16Jun 26, 2023Updated 2 years ago
- 2019语言与智能技术竞赛第5名方案☆14Dec 2, 2019Updated 6 years ago
- standalone node and matlab wrapper for teb-planner package☆13Jul 1, 2021Updated 4 years ago
- A package for multiple ultrasonic sensor into ROS☆12Nov 1, 2017Updated 8 years ago
- Fork of Python's pickle module to work with ZODB☆18Nov 29, 2025Updated 3 months ago
- ☆12Jan 3, 2022Updated 4 years ago
- A simple automatic parking system for car based on fuzzy logics by matlab.☆14Apr 26, 2017Updated 8 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Reinforcement Learning for Uplift Modeling☆13Mar 13, 2021Updated 4 years ago
- Official repository for the paper BECLR: Batch Enhanced Contrastive Unsupervised Few-Shot Learning☆16Mar 17, 2024Updated last year
- Stock movement prediction using BERT and GPT-2 based on tweets related to the stocks☆13Jun 24, 2022Updated 3 years ago
- personalized recommendation☆12Mar 26, 2020Updated 5 years ago
- 基于Matlab实现纯跟踪(Pure Pursuit)算法☆16Oct 12, 2022Updated 3 years ago
- "4D TRAJECTORY GENERATION FOR GUIDANCE MODULE OF A UAV FOR A GATE TO GATE FLIGHT IN PRESENCE OF TURBULENCE", International Journal of A…☆16Jul 29, 2018Updated 7 years ago
- Dynamic visualization training service in Jupyter Notebook for Keras, tf.keras and others.☆15Mar 22, 2022Updated 3 years ago
- This directory simulates UUV dynamics and control purely in the Matlab programming language.☆19Nov 10, 2017Updated 8 years ago