cubenlp / BIBench
BIBench:数据分析领域LLM评测基准
☆13Updated 6 months ago
Related projects: ⓘ
- ☆124Updated 2 months ago
- An open-source conversational language model developed by the Knowledge Works Research Laboratory at Fudan University.☆62Updated 11 months ago
- 中文大语言模型评测第二期☆68Updated 10 months ago
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDING☆86Updated 5 months ago
- ☆90Updated 6 months ago
- 中文大语言模型评测第三期☆23Updated 3 months ago
- 使用qlora对中文大语言模型进行微调,包含ChatGLM、Chinese-LLaMA-Alpaca、BELLE☆86Updated last year
- YuLan-IR: Information Retrieval Boosted LMs☆211Updated 6 months ago
- A Toolkit for Table-based Question Answering☆94Updated 11 months ago
- The code and data for "StructGPT: A general framework for Large Language Model to Reason on Structured Data"☆95Updated 6 months ago
- The LLM of NL2GQL with NebulaGraph or Neo4j☆83Updated 9 months ago
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated 9 months ago
- CodeGPT: A Code-Related Dialogue Dataset Generated by GPT and for GPT☆106Updated last year
- ☆89Updated 9 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆63Updated 5 months ago
- 怎么训练一个LLM分词器☆123Updated last year
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆25Updated last month
- 中文原生检索增强生成测评基准☆92Updated 5 months ago
- ☆42Updated 9 months ago
- An open-source and powerful Information Extraction toolkit based on GPT (GPT for Information Extraction; GPT4IE for short)。Note: we set a…☆167Updated last year
- ☆126Updated last year
- A large-scale language model for scientific domain, trained on redpajama arXiv split☆120Updated 6 months ago
- moss chat finetuning☆50Updated 4 months ago
- 🐋 An unofficial implementation of Self-Alignment with Instruction Backtranslation.☆128Updated 2 months ago
- FinEval是一个中文金融领域高质量多项选择与文本问答题的集合。☆154Updated 3 months ago
- SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准☆78Updated 10 months ago
- ☆75Updated 5 months ago
- 基于 LoRA 和 P-Tuning v2 的 ChatGLM-6B 高效参数微调☆54Updated last year
- Implementation of "Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation"☆74Updated last year
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆37Updated 6 months ago