HAMNET-AI / PDFTriage
Reproduction paper --- PDFTriage : Question Answering over Long, Structured Documents
☆40Updated 8 months ago
Related projects: ⓘ
- A Toolkit for Table-based Question Answering☆94Updated 11 months ago
- [ACL 2024] IEPile: A Large-Scale Information Extraction Corpus☆154Updated 2 months ago
- 1st Solution For Conversational Multi-Doc QA Workshop & International Challenge @ WSDM'24 - Xiaohongshu.Inc☆151Updated 6 months ago
- Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation☆61Updated 4 months ago
- TianGong-AI-Unstructure☆48Updated this week
- ☆109Updated 5 months ago
- ☆89Updated 9 months ago
- LAiW: A Chinese Legal Large Language Models Benchmark☆62Updated 2 months ago
- 中文原生检索增强生成测评基准☆92Updated 5 months ago
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆95Updated last month
- A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.☆101Updated 7 months ago
- ☆90Updated 5 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"☆105Updated 3 months ago
- ☆111Updated 6 months ago
- ☆124Updated 2 months ago
- LLaMA Factory Document☆61Updated 3 weeks ago
- ChatGLM2-6B微调, SFT/LoRA, instruction finetune☆107Updated last year
- 大语言模型指令调优工具(支持 FlashAttention)☆162Updated 8 months ago
- Evaluating LLMs' multi-round chatting capability via assessing conversations generated by two LLM instances.☆132Updated 10 months ago
- ☆156Updated last year
- [EMNLP 2023 Demo] CLEVA: Chinese Language Models EVAluation Platform☆55Updated 9 months ago
- 中文大语言模型评 测第二期☆68Updated 10 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other mo…☆281Updated last week
- 文本去重☆65Updated 3 months ago
- Imitate OpenAI with Local Models☆83Updated 3 weeks ago
- A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark☆96Updated last year