erxiong0 / ReinforcementLearning-R.S.
To reproduce the experiments in Sutton's book
☆13Updated last month
Alternatives and similar repositories for ReinforcementLearning-R.S.
Users that are interested in ReinforcementLearning-R.S. are comparing it to the libraries listed below
Sorting:
- Build Jekyll site with GitBook style!☆12Updated last week
- ☆935Updated 3 months ago
- Data annotation toolbox supports image, audio and video data.☆1,190Updated 2 weeks ago
- The Open-Source Data Annotation Platform☆811Updated 2 months ago
- ☆494Updated 9 months ago
- 通义千问VLLM推理部署DEMO☆577Updated last year
- 整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post…☆676Updated last month
- Yuan 2.0 Large Language Model☆683Updated 10 months ago
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆961Updated this week
- 一个适合学习、使用、自主扩展的RAG【检索增强生成】系统!可联网做AI搜索☆484Updated 8 months ago
- A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team…☆1,709Updated last month
- 企业级RAG系统从入 门到精通☆462Updated 2 months ago
- FinQwen: 致力于构建一个开放、稳定、高质量的金融大模型项目,基于大模型搭建金融场景智能问答系统,利用开源开放来促进「AI+金融」。☆379Updated 11 months ago
- [ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.☆1,830Updated 4 months ago
- [中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language models (LLMs) to provide a wide range of legal services.☆703Updated 4 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆224Updated last month
- ☆691Updated last month
- [ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models☆347Updated last year
- 从0开始,将chatgpt的技术路线跑一遍。☆233Updated 8 months ago
- 万卷1.0多模态语料☆560Updated last year
- 本项目旨在收集开源的表格智能任务数据集(比如表格问答、表格-文本生成等),将原始数据整理为指令微调格式的数据并微调LLM,进而增强LLM对于表格数据的理解,最终构建出专门面向表格智能任务的大型语言模型。☆580Updated last year
- Llama中文社区,最好的中文Llama大模型,完全开源可商用☆12Updated last year
- 添加🚀流式 Web 服务到 GraphRAG,兼容 OpenAI SDK,支持可访问的实体链接🔗,支持建议问题,兼容本地嵌入模型,修复诸多问题。Add streaming web server to GraphRAG, compatible with OpenAI SD…☆251Updated last month
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.☆6,335Updated this week
- An open-source educational chat model from ICALK, East China Normal University. 开源中英教育对话大模型。(通用基座模型,GPU部署,数据清理) 致敬: LLaMA, MOSS, BELLE, Z…☆792Updated last week
- 一个简单快速的分词、命名实体识别工具☆582Updated last month
- 开源SFT数据集整理,随时补充☆513Updated last year
- Distributed RL System for LLM Reasoning☆1,248Updated 2 weeks ago
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆2,355Updated this week
- Phi2-Chinese-0.2B 从0开始训 练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆551Updated 10 months ago