OpenLMLab / GAOKAO-Bench-2023
GAOGAO-Bench-2023 is a supplement to the GAOKAO-Bench, a dataset to evaluate large language models.
☆17Updated 9 months ago
Related projects: ⓘ
- Feeling confused about super alignment? Here is a reading list☆42Updated 8 months ago
- ☆71Updated 8 months ago
- 中文大语言模型评测第三期☆23Updated 3 months ago
- ☆13Updated 2 months ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆28Updated 8 months ago
- ☆27Updated last month
- ☆34Updated 2 weeks ago
- MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation☆17Updated 5 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆56Updated 7 months ago
- AI Alignment: A Comprehensive Survey☆123Updated 10 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆62Updated 7 months ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆65Updated 11 months ago
- Official completion of “Training on the Benchmark Is Not All You Need”.☆18Updated last week
- Awesome papers for role-playing with language models☆88Updated last month
- Achieving Efficient Alignment through Learned Correction☆103Updated 3 months ago
- 本项目用于大模型数学解题能力方面的数据集合成,模型训练及评测,相关文章记录。☆41Updated this week
- ☆82Updated 5 months ago
- ☆75Updated 5 months ago
- This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and train…☆51Updated 5 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆63Updated 5 months ago
- Our code will be public soon .☆26Updated last year
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆62Updated 11 months ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆89Updated 2 months ago
- The official GitHub page for the survey paper "A Survey on Data Augmentation in Large Model Era"☆101Updated 2 months ago
- SuperCLUE-Math6:新一代中文原生多轮多步数学推理数据集的探索之旅☆38Updated 7 months ago
- ☆87Updated 4 months ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆45Updated 6 months ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆30Updated 4 months ago
- 1.4B sLLM for Chinese and English - HammerLLM🔨☆43Updated 5 months ago
- Controllable Text Generation for Large Language Models: A Survey☆89Updated 3 weeks ago