erxiong0 / ReinforcementLearning-R.S.Links
To reproduce the experiments in Sutton's book
☆14Updated 10 months ago
Alternatives and similar repositories for ReinforcementLearning-R.S.
Users that are interested in ReinforcementLearning-R.S. are comparing it to the libraries listed below
Sorting:
- Build Jekyll site with GitBook style!☆14Updated 8 months ago
- A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.☆2,360Updated last week
- ☆1,856Updated last year
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX | Organize the currently open-source optimal table recognition models, improve pre-processing and post-…☆920Updated 6 months ago
- [CVPR 2025] A Comprehensive Benchmark for Document Parsing and Evaluation☆1,446Updated last month
- The Open-Source Data Annotation Platform☆1,176Updated 11 months ago
- 中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。☆1,670Updated last year
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆637Updated last year
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,530Updated last week
- Netease Youdao's open-source embedding and reranker models for RAG products.☆1,861Updated 4 months ago
- ☆545Updated last year
- PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation☆2,362Updated 4 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆623Updated 3 weeks ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,470Updated last week
- Reproduce R1 Zero on Logic Puzzle☆2,430Updated 10 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,815Updated 9 months ago
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆7,576Updated this week
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,896Updated last year
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,368Updated 8 months ago
- Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案☆617Updated last year
- GraphRAG的应用实例,项目特点在于提供了替换OpenAI模型的方法,并通过修改原有提示和切分文档的方法,提高了GraphRAG处理中文内容的能力。☆185Updated last year
- A Next-Generation Training Engine Built for Ultra-Large MoE Models☆5,082Updated this week
- datasets resource☆129Updated 7 months ago
- FinGLM: 致力于构建一个开放的、公益的、持久的金融大模型项目,利用开源开放来促进「AI+金融」。☆2,158Updated last year
- Community maintained hardware plugin for vLLM on Ascend☆1,618Updated last week
- 企业级RAG系统从入门到精通☆623Updated 7 months ago
- Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷☆5,840Updated this week
- A pre-built agent for TableGPT2.☆631Updated last month
- the resources about the application based on LLM with RAG pattern☆1,621Updated last month
- ⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with …☆3,538Updated this week