rl-explainer
☆193Mar 9, 2026Updated 3 months ago
Alternatives and similar repositories for rl-explainer
Users that are interested in rl-explainer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jan 8, 2025Updated last year
- Code release for "MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning"☆11Oct 11, 2024Updated last year
- Bayesian scaling laws for in-context learning.☆16Mar 12, 2025Updated last year
- LoRA supervised fine-tuning, RLHF (PPO) and RAG with llama-3-8B on the TLDR summarization dataset☆14Feb 2, 2025Updated last year
- Create Persona dataset from reddit en movie category comment☆11Aug 6, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Language Models as Semantic Indexers (ICML 2024)☆42May 2, 2024Updated 2 years ago
- 基于pytorch的不平衡数据的文本分类☆12Dec 26, 2021Updated 4 years ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwo…☆12Dec 1, 2021Updated 4 years ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- [SIGIR 2025] Benchmarking Recommendation, Classification, and Tracing Based on Hugging Face Knowledge Graph☆16Jun 6, 2025Updated last year
- 将每一帧的bev矢量图拼接起来(实习项目)☆12Aug 31, 2023Updated 2 years ago
- 机器学习项目,python实现的假新闻检测☆16Mar 15, 2023Updated 3 years ago
- Metadata browser of TREC☆10May 19, 2026Updated last month
- Environments, tools, and benchmarks for general computer agents☆17Dec 3, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A collection of various llm pruning implementations, training code for GPUs & TPUs, and evaluation script.☆69Apr 20, 2026Updated 2 months ago
- Official repo for PIWM: Enhancing Physical Consistency in Lightweight World Models☆25Nov 26, 2025Updated 7 months ago
- Tools for working with the S800 corpus☆12Sep 17, 2020Updated 5 years ago
- Code for paper: Unified Text-to-Image Generation and Retrieval☆16Jul 6, 2024Updated last year
- Official PyTorch implementation of CVPR2022 paper “Learning to Imagine: Diversify Memory for Incremental Learning using Unlabeled Data”☆13Jul 25, 2022Updated 3 years ago
- Implementation of KDR-Agent, the AAAI 2025 accepted paper, focusing on knowledge-driven reasoning for autonomous agents.☆21Nov 24, 2025Updated 7 months ago
- Reimplementation of graph neural network based generation model, HDMapGen☆21Jul 23, 2024Updated last year
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆26Jun 17, 2025Updated last year
- ☆11Feb 19, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 或许这里有作为同济大学软件学院机器智能的一位学生学业所需的所有东西☆20Aug 5, 2024Updated last year
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 11 months ago
- This is the official implementation of our paper Untargeted Backdoor Attack against Object Detection.☆27Mar 6, 2023Updated 3 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12Jun 22, 2026Updated last week
- (EMNLP 2023 Findings) Text2Tree: Aligning Text Representation to the Label Tree Hierarchy for Imbalanced Medical Classification.☆16Feb 27, 2024Updated 2 years ago
- ☆28Apr 19, 2026Updated 2 months ago
- Implementation of 12 AI agents evaluation techniques☆43Jul 31, 2025Updated 11 months ago
- SIGIR 2021: Proactive Retrieval-based Chatbots based on Relevant Knowledge and Goals☆11Jul 30, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- The Internet Memes Knowledge Graph☆18Oct 18, 2024Updated last year
- ☆11Feb 21, 2024Updated 2 years ago
- ☆16May 11, 2025Updated last year
- 极市平台打榜使用的yolov5模板☆17Sep 27, 2023Updated 2 years ago
- 2022 Software Testing course project, Tongji University. 同济大学软件测试课设☆17Mar 12, 2025Updated last year
- Easy CMake Doxygen template for C++ project.☆12Apr 21, 2017Updated 9 years ago
- ☆11Sep 29, 2023Updated 2 years ago