pacman100 / accelerate-deepspeed-testLinks
Testing DeepSpeed integration in š¤ Accelerate
ā11Updated 3 years ago
Alternatives and similar repositories for accelerate-deepspeed-test
Users that are interested in accelerate-deepspeed-test are comparing it to the libraries listed below
Sorting:
- This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured ā¦ā55Updated 4 months ago
- [ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervisionā93Updated 10 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPOā116Updated last year
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.ā12Updated last year
- [NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"ā23Updated 10 months ago
- evolve llm training instruction, from english instruction to any language.ā119Updated last year
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuningā94Updated 2 years ago
- ā154Updated last year
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messagesā49Updated 3 weeks ago
- SECOM: On Memory Construction and Retrieval for Personalized Conversational Agents, ICLR 2025ā34Updated 6 months ago
- ā17Updated last year
- ā20Updated 4 years ago
- [TACL 2024] Improving Probability-based Prompt Selection Through Unified Evaluation and Analysisā11Updated 9 months ago
- ā127Updated 11 months ago
- KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Modelsā25Updated last year
- ā31Updated 2 years ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesisā11Updated 2 years ago
- Implementation of stop sequencer for Huggingface Transformersā16Updated 2 years ago
- [AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Followingā78Updated 11 months ago
- ā20Updated last year
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with Lā¦ā43Updated 2 years ago
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuningā247Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedbackā97Updated 2 years ago
- A hackable, simple, and reseach-friendly GRPO Training Framework with high speed weight synchronization in a multinode environment.ā24Updated last week
- Alpaca-lora for huggingface implementation using Deepspeed and FullyShardedDataParallelā24Updated 2 years ago
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"ā55Updated 11 months ago
- Benchmarking library for RAGā224Updated last month
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsā15Updated 2 years ago
- ā10Updated 11 months ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.ā163Updated last year