通过动画学强化学习笔记
☆67Apr 23, 2026Updated last month
Alternatives and similar repositories for Reinforcement-Learning-Comic-Notes
Users that are interested in Reinforcement-Learning-Comic-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ECMLPKDD22] MepoGNN: Metapopulation Epidemic Forecasting with Graph Neural Networks☆32Feb 24, 2026Updated 3 months ago
- [NLPCC 2024] Shared Task 10: Regulating Large Language Models☆15Jun 12, 2024Updated 2 years ago
- ☆31Jul 24, 2025Updated 10 months ago
- ☆15Jul 8, 2023Updated 2 years ago
- NLP/ML面试各类资料链接 汇总(主要Github收集)☆11Mar 3, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 中文版-A playbook for systematically maximizing the performance of deep learning models.☆23Jan 26, 2023Updated 3 years ago
- An Adaptive Multi-Agent Framework for Dynamic Fact-Checking Evaluation of Large Language Models☆18Feb 27, 2025Updated last year
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Sep 21, 2022Updated 3 years ago
- Siamese Network and Triplet Loss for face recognition in real time☆14Jul 17, 2019Updated 6 years ago
- [AAAI 2025] Code for the paper: "Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition"☆39Apr 15, 2025Updated last year
- ☆19Nov 17, 2019Updated 6 years ago
- [MM'2024] Official release of RFUND introduced in the MM'2024 paper "PEneo: Unifying Line Extraction, Line Grouping, and Entity Linking f…☆21Dec 4, 2024Updated last year
- CMD: a framework for Context-aware Model self-Detoxification (EMNLP2024 Long Paper)☆17Feb 10, 2025Updated last year
- Code and model for AAAI 2024: UMIE: Unified Multimodal Information Extraction with Instruction Tuning☆47Jun 5, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Yet another dependency parser, integrated with tokenizer, tagger and visualization tool.☆11Mar 18, 2018Updated 8 years ago
- Open source code for ICASSP 2021 Paper “Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understandin…☆29Apr 2, 2021Updated 5 years ago
- Learning Evasion Strategy in Pursuit-Evasion by Deep Q-Network, ICPR2018.☆13Dec 22, 2018Updated 7 years ago
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆53Jul 25, 2024Updated last year
- Udacity Full Stack Nanodegree Project 3☆11May 27, 2017Updated 9 years ago
- 基于ROS的多无人机协同控制☆12May 8, 2021Updated 5 years ago
- ☆10Aug 25, 2018Updated 7 years ago
- Over 60 figures and diagrams of LLMs, quantization, low-rank adapters (LoRA), and chat templates FREE TO USE in your blog posts, slides, …☆24Feb 18, 2025Updated last year
- 🔍 OpenSearch-VL provides a fully open recipe for training strong multimodal deep search agents through high-quality data curation, diver…☆212May 19, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Sequence Labeling Parsing by Learning Across Representations☆13Oct 3, 2019Updated 6 years ago
- The repo of ASGMVLP☆19Jan 16, 2026Updated 4 months ago
- A TensorFlow implement for "A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding".☆10Jan 22, 2021Updated 5 years ago
- ☆42Jun 18, 2025Updated 11 months ago
- Closed-loop simulator of complex behavior and learning based on reinforcement learning and deep neural networks☆15Mar 20, 2026Updated 2 months ago
- Learned User Representations in Online Social Networks (Twitter) using Temporal Dynamics of Information Diffusion.☆10Oct 15, 2018Updated 7 years ago
- My lecture notes on the RL series provided by Stanford.☆15Aug 31, 2022Updated 3 years ago
- ☆14Dec 27, 2016Updated 9 years ago
- code space of paper "Safety Layers in Aligned Large Language Models: The Key to LLM Security" (ICLR 2025)☆24Apr 26, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Source code for ACL 2020 paper "Learning Spoken Language Representations with Neural Lattice Language Modeling"☆17Feb 11, 2023Updated 3 years ago
- 使用词性模板抽取中文语料中的名词短语☆18Aug 4, 2025Updated 10 months ago
- A framework for evolving and testing question-answering datasets with various models.☆26Feb 28, 2024Updated 2 years ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated 2 years ago
- 介绍docker、docker compose的使用。☆21Sep 4, 2024Updated last year
- 使用biaffine的中文命名实体识别☆10Jan 12, 2023Updated 3 years ago
- Attention based dialog embedding for dialog breakdown detection (in DSTC6 task 3)☆13Feb 11, 2018Updated 8 years ago