Azure / synthetic-qa-generation
This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be step-by-step for developers and data scientists, as well as those in the field, to try it out with a little help.
â45Updated 5 months ago
Alternatives and similar repositories for synthetic-qa-generation:
Users that are interested in synthetic-qa-generation are comparing it to the libraries listed below
- Testing DeepSpeed integration in đ¤ Accelerateâ11Updated 2 years ago
- â19Updated 9 months ago
- Code for KaLM-Embedding modelsâ75Updated last month
- An extended project of the LLM Compiler paper, focusing on developing LLM-based Autonomous Agents.â23Updated 6 months ago
- Github repository for "RAGTruth: A Hallucination Corpus for Developing Trustworthy Retrieval-Augmented Language Models"â171Updated 4 months ago
- BERT score for text generationâ11Updated 3 months ago
- evolve llm training instruction, from english instruction to any language.â115Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesâ45Updated 4 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPOâ117Updated last year
- Performs benchmarking on two Korean datasets with minimal time and effort.â38Updated last week
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuningâ90Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionâ87Updated 5 months ago
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".â106Updated 6 months ago
- â17Updated 11 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.â12Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Modelsâ25Updated 8 months ago
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"â53Updated last year
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmarkâ140Updated 4 months ago
- Benchmarking library for RAGâ193Updated this week
- Lightweight demos for finetuning LLMs. Powered by đ¤ transformers and open-source datasets.â76Updated 6 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"â106Updated 7 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [FâŚâ62Updated 11 months ago
- A curated list of awesome papers about utilizing large language models for ranking.â15Updated 5 months ago
- â43Updated 3 months ago
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.â151Updated last week
- â17Updated 10 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"â83Updated 8 months ago
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paperâ135Updated 9 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"â33Updated 4 months ago
- â40Updated 8 months ago