pacman100 / accelerate-deepspeed-testLinks
Testing DeepSpeed integration in š¤ Accelerate
ā11Updated 3 years ago
Alternatives and similar repositories for accelerate-deepspeed-test
Users that are interested in accelerate-deepspeed-test are comparing it to the libraries listed below
Sorting:
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured ā¦ā52Updated 2 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionā90Updated 8 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPOā116Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.ā12Updated last year
- evolve llm training instruction, from english instruction to any language.ā118Updated last year
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallelā24Updated 2 years ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesisā11Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuningā94Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesā48Updated 7 months ago
- ā31Updated 2 years ago
- ā20Updated 11 months ago
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughtsā21Updated last year
- Continue Pretraining T5 on custom dataset based on available pretrained model checkpointsā38Updated 4 years ago
- ACL 2023 short: Balancing Lexical and Semantic Quality in Abstractive Summarizationā16Updated last year
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Tableā24Updated 2 years ago
- ā10Updated 10 months ago
- ā17Updated last year
- The git repository of Modular Prompted Chatbot paperā34Updated 2 years ago
- Benchmarking library for RAGā213Updated last month
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"ā23Updated 9 months ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"ā33Updated 7 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023ā70Updated last year
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"ā23Updated 10 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lā¦ā43Updated 2 years ago
- Code for "Democratizing Reasoning Ability: Tailored Learning from Large Language Model", EMNLP 2023ā35Updated last year
- ā124Updated 9 months ago
- ā23Updated last year
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Modelsā25Updated 10 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.ā163Updated last year
- Finetune mistral-7b-instruct for sentence embeddingsā85Updated last year