pacman100 / accelerate-deepspeed-testLinks
Testing DeepSpeed integration in š¤ Accelerate
ā11Updated 3 years ago
Alternatives and similar repositories for accelerate-deepspeed-test
Users that are interested in accelerate-deepspeed-test are comparing it to the libraries listed below
Sorting:
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured ā¦ā61Updated 7 months ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.ā13Updated last year
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionā95Updated last year
- evolve llm training instruction, from english instruction to any language.ā119Updated 2 years ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuningā94Updated 2 years ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPOā116Updated last year
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"ā23Updated last year
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallelā24Updated 2 years ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesisā11Updated 2 years ago
- ā16Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lā¦ā45Updated 2 years ago
- ā31Updated 2 years ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedbackā97Updated 2 years ago
- Calculating Expected Time for training LLM.ā38Updated 2 years ago
- ā20Updated 4 years ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesā52Updated 3 months ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Followingā78Updated last year
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysisā11Updated last year
- Developing a Korean LLM model : Hate Speech Filtering, Improving conversational skills, Finetuning with the RLHF methodā19Updated 6 months ago
- š¢ Data Toolkit for Sailor Language Modelsā94Updated 9 months ago
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughtsā24Updated 2 years ago
- ā75Updated last year
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.ā165Updated 2 years ago
- [ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Promptingā27Updated last month
- ā24Updated last year
- ā129Updated last year
- [ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"ā53Updated last year
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"ā86Updated last year
- Open-WikiTable :Dataset for Open Domain Question Answering with Complex Reasoning over Tableā27Updated 2 years ago
- š¤ Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.ā17Updated 5 months ago