opendilab / LLMRiddlesLinks
Open-Source Reproduction/Demo of the LLM Riddles Game
☆544Updated last year
Alternatives and similar repositories for LLMRiddles
Users that are interested in LLMRiddles are comparing it to the libraries listed below
Sorting:
- PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements. (e.g. MBTI Measurement Agent)☆178Updated this week
- A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.☆714Updated 7 months ago
- A Game Demo Powered by ChatGPT Agents☆269Updated last year
- AgentSims is an easy-to-use infrastructure for researchers from all disciplines to test the specific capacities they are interested in.☆887Updated last year
- A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)☆222Updated 4 months ago
- 羊了个羊 + 深度强化学习(Deep Reinforcement Learning + 3 Tiles Game)☆462Updated 5 months ago
- GPT-Fathom is an open-source and reproducible LLM evaluation suite, benchmarking 10+ leading open-source and closed-source LLMs as well a…☆350Updated last year
- ☆718Updated 2 years ago
- Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memo…☆631Updated 2 years ago
- ☆120Updated last year
- GAOKAO-Bench is an evaluation framework that utilizes GAOKAO questions as a dataset to evaluate large language models.☆675Updated 7 months ago
- Crowdfunding open source projects: use OpenReview's high-quality review data to fine-tune a professional review and response LLM. 众筹开源项目:…☆202Updated 2 years ago
- AgentTuning: Enabling Generalized Agent Abilities for LLMs☆1,457Updated last year
- Enhance LLM agents with rich tool APIs☆396Updated 10 months ago
- Official Pytorch Implementation for MathGLM☆325Updated last year
- Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs…☆557Updated 9 months ago
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆240Updated 5 months ago
- ☆905Updated 2 years ago
- 大模型多维度中文对齐评测基准 (ACL 2024)☆402Updated 11 months ago
- [EMNLP'24] CharacterGLM: Customizing Chinese Conversational AI Characters with Large Language Models☆469Updated 7 months ago
- Awesome LLM Benchmarks to evaluate the LLMs across text, code, image, audio, video and more.☆145Updated last year
- Scenario-based Evaluation dataset for LLM (beta)☆135Updated last year
- This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…☆366Updated last year
- ☆112Updated 7 months ago
- 基于《西游记》原文、白话文、ChatGPT生成数据制作的,以InternLM2微调的角色扮演多LLM聊天室。 本项目将介绍关于角色扮演类 LLM 的一切,从数据获取、数据处理,到使用 XTuner 微调并部署至 OpenXLab,再到使用 LMDeploy 部署,以 op…☆103Updated last year
- website☆443Updated 5 months ago
- A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI☆767Updated last year
- 🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VL…☆727Updated last week
- TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles.☆153Updated 9 months ago
- LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。☆313Updated last month