libo-huang / Awesome-Causal-Reinforcement-LearningLinks
[TNNLS-2024, arXiv-2023.2.10] Official repository of "A Survey on Causal Reinforcement Learning"
☆211Updated last month
Alternatives and similar repositories for Awesome-Causal-Reinforcement-Learning
Users that are interested in Awesome-Causal-Reinforcement-Learning are comparing it to the libraries listed below
Sorting:
- energy paper with LLM☆52Updated 6 months ago
- ☆27Updated 2 weeks ago
- Resource collection of medical agent for clinical dialogue and health☆230Updated last week
- Official code for NeurIPS2025 "Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers"☆227Updated 2 weeks ago
- LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework☆112Updated this week
- ☆360Updated last month
- Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning☆129Updated 7 months ago
- Updating curated list of research advancements on item identification in generative recommender systems.☆50Updated this week
- (附数据集)基于 PyTorch 实现 MobileNetV2 轻量 CNN 模型,完成 ImageNet 子集 20 类图像分类任务,包含模型训练、损失曲线绘制、卷 积核 / 中间层特征图可视化全流程,附训练权重文件。 (With Dataset)PyTorch impl…☆28Updated this week
- [ACL 2025] The code repository for "Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning" i…☆142Updated 8 months ago
- A high-performance LLM inference engine with PagedAttention | 基于PagedAttention的高性能大模型推理引擎☆34Updated last month
- VPSCI(Vehicle–Pedestrian Safety-Critical Interaction)Dataset☆348Updated last month
- DASFAA2023 FedGR Code Repository. Federated learning for double unbalance settings (sample quantities imbalance for different classes in …☆40Updated 2 years ago
- Ond ESG Intelligence Platform is a cloud-native solution that ingests ESG data, processes it with Azure Data Factory & Databricks, and ap…☆131Updated 2 months ago
- Advanced Multi-Agent Optimization System featuring intelligent routing strategies, semantic memory optimization, distributed coordination…☆15Updated 5 months ago
- 基于 Rust 的隐私优先数据聚合平台,支持网络搜索、RSS 聚合、天气,股票,热榜等多模态搜索能力,私人部署的云端数据获取中心。☆63Updated this week
- 🧊 A High-Perf Quantitative Trading Framework for Crypto☆134Updated last month
- AI Agent Development Platform - Supports multiple models (OpenAI/DeepSeek/Wenxin/Tongyi), knowledge base management, workflow automation,…☆525Updated last month
- 解构rocketmq☆40Updated last month
- A minimalist multi-agent framework for rubost automation of scientific analysis workflows, such as gene expression analysis.☆130Updated 3 months ago
- ☆45Updated last month
- 星瀚外卖是一个后端基于Springboot、前端基于vue的小程序的单体外卖系统,是在黑马程序员的“苍穹外卖”基础上复现和改进☆39Updated this week
- Financial News AI Analysis Notification Service☆208Updated 2 weeks ago
- ☆441Updated this week
- YiShape-Math is a Java math library that provides NumPy-like functionalities including vector & matrix operations, data visualization, st…☆201Updated last month
- ☆318Updated last month
- ☆37Updated 10 months ago
- BizSpring Java微服务开源商城,电商平台,多用户商城,分布式商城,SpringCloud☆57Updated 2 months ago
- 一个简约的web白板,更适合老师体质 | Just a board.☆246Updated this week
- This repository contains experimental reports and training results for my research☆103Updated last month