RUC-GSAI / YuLan-IR
YuLan-IR: Information Retrieval Boosted LMs
β218Updated 11 months ago
Alternatives and similar repositories for YuLan-IR:
Users that are interested in YuLan-IR are comparing it to the libraries listed below
- MEASURING MASSIVE MULTITASK CHINESE UNDERSTANDINGβ88Updated 10 months ago
- T2Ranking: A large-scale Chinese benchmark for passage ranking.β153Updated last year
- π An unofficial implementation of Self-Alignment with Instruction Backtranslation.β136Updated 7 months ago
- An Open-Source Package for Information Retrievalβ160Updated 2 weeks ago
- β271Updated last year
- Repository for Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step Questions, ACL23β189Updated 8 months ago
- β160Updated 11 months ago
- β139Updated 7 months ago
- A large-scale complex question answering evaluation of ChatGPT and similar large-language modelsβ38Updated 9 months ago
- β159Updated last year
- δΈζ倧θ―θ¨ζ¨‘εθ―ζ΅η¬¬δΊζβ70Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuningβ240Updated last year
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"β159Updated 6 months ago
- Codebase for RetroMAE and beyond.β249Updated 8 months ago
- β130Updated 10 months ago
- Generative Judge for Evaluating Alignmentβ228Updated last year
- The repository for the survey paper <<Survey on Large Language Models Factuality: Knowledge, Retrieval and Domain-Specificity>>β332Updated 9 months ago
- ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Modelsβ175Updated 4 months ago
- [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other moβ¦β337Updated 5 months ago
- β96Updated 11 months ago
- β137Updated last year
- [NeurIPS 2023] Codebase for the paper: "Guiding Large Language Models with Directional Stimulus Prompting"β105Updated last year
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenariosβ65Updated 2 months ago
- Dataset and evaluation script for "Evaluating Hallucinations in Chinese Large Language Models"β118Updated 8 months ago
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions