Azure / synthetic-qa-generation
This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured data, assuming a real-world scenario. The sample aims to be step-by-step for developers and data scientists, as well as those in the field, to try it out with a little help.
☆32Updated last week
Related projects ⓘ
Alternatives and complementary repositories for synthetic-qa-generation
- ☆18Updated 3 months ago
- Performs benchmarking on two Korean datasets with minimal time and effort.☆26Updated 2 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models☆25Updated 2 months ago
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"☆14Updated 2 months ago
- Official code and dataset repository of KoBBQ (TACL 2024)☆14Updated 5 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision☆78Updated last week
- evolve llm training instruction, from english instruction to any language.☆113Updated last year
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"☆20Updated last month
- ☆32Updated last year
- ☆33Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆13Updated 7 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆86Updated last year
- Benchmarking library for RAG☆113Updated this week
- Reward Model을 이용하여 언어모델의 답변을 평가하기☆27Updated 8 months ago
- StrategyQA 데이터 세트 번역☆20Updated 7 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆115Updated 10 months ago
- 1-Click is all you need.☆59Updated 6 months ago
- Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New …☆31Updated 5 months ago
- 언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.☆19Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆65Updated 8 months ago
- ☆19Updated 2 years ago
- ☆9Updated 2 months ago
- Difference-based Contrastive Learning for Korean Sentence Embeddings☆24Updated last year
- This hands-on walks you through fine-tuning an open source LLM on Azure and serving the fine-tuned model on Azure. It is intended for Dat…☆12Updated 4 months ago
- Data processing system for polyglot☆90Updated last year
- A framework for few-shot evaluation of language models.☆17Updated last week
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆36Updated last month
- Calculating Expected Time for training LLM.☆38Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆49Updated 3 months ago
- CareCall for Seniors: Role Specified Open-Domain Dialogue dataset generated by leveraging LLMs (NAACL 2022).☆59Updated 2 years ago