Azure / synthetic-qa-generation
This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be step-by-step for developers and data scientists, as well as those in the field, to try it out with a little help.
☆44Updated 3 months ago
Alternatives and similar repositories for synthetic-qa-generation:
Users that are interested in synthetic-qa-generation are comparing it to the libraries listed below
- ☆19Updated 6 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆84Updated 3 months ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆31Updated 2 weeks ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆116Updated last year
- evolve llm training instruction, from english instruction to any language.☆115Updated last year
- ☆17Updated 9 months ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆42Updated 2 months ago
- Code for KaLM-Embedding models☆71Updated last month
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆125Updated 7 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"☆147Updated 2 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆12Updated 11 months ago
- Comprehensive benchmark for RAG☆114Updated 3 months ago
- Benchmarking library for RAG☆166Updated last week
- An extended project of the LLM Compiler paper, focusing on developing LLM-based Autonomous Agents.☆23Updated 3 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆125Updated 2 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Updated 5 months ago
- Codebase accompanying the Summary of a Haystack paper.☆74Updated 5 months ago
- Repository for “PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makers”, NAACL24☆134Updated 8 months ago
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallel☆24Updated last year
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆123Updated 9 months ago
- 1-Click is all you need.☆59Updated 9 months ago
- Code and Data Repo for [ACL 2023] Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"☆53Updated last year
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆23Updated 4 months ago
- ☆33Updated 3 weeks ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆72Updated 2 weeks ago
- ☆29Updated last month
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆129Updated 4 months ago
- A framework for few-shot evaluation of language models.☆21Updated last week
- ☆42Updated 8 months ago