pacman100 / accelerate-deepspeed-testLinks

Testing DeepSpeed integration in 🤗 Accelerate

☆11

Alternatives and similar repositories for accelerate-deepspeed-test

Users that are interested in accelerate-deepspeed-test are comparing it to the libraries listed below

Sorting:

lcw99 / evolve-instruct
evolve llm training instruction, from english instruction to any language.
☆118Updated last year
Azure / synthetic-qa-generation
This hands-on lab aims to alleviate some of that headache by demonstrating how to create/augment a QnA dataset from complex unstructured …
☆52Updated last month
gauss5930 / iDUS
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆12Updated last year
wandb / llm-kr-eval
☆20Updated 11 months ago
KyujinHan / Sakura-SOLAR-DPO
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Updated last year
kaistAI / LangBridge
[ACL 2024] LangBridge: Multilingual Reasoning Without Multilingual Supervision
☆89Updated 7 months ago
Zefty / rag-end2end-retriever
☆20Updated 3 years ago
FreedomIntelligence / MultilingualSIFT
MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning
☆91Updated last year
kaistAI / KtrlF
[NAACL 2024] Official repository for "KTRL+F: Knowledge-Augmented In-Document Search"
☆23Updated 8 months ago
hills-code / open-instruct
☆17Updated last year
HITsz-TMG / KaLM-Embedding
Code for KaLM-Embedding models
☆78Updated 3 months ago
qhjqhj00 / WebBrain
☆68Updated 2 years ago
alexa / places
This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis
☆11Updated 2 years ago
algoprog / SynTOD
Synthetic data generation for TODs
☆23Updated 11 months ago
BM-K / KoDiffCSE
Difference-based Contrastive Learning for Korean Sentence Embeddings
☆24Updated 2 years ago
kaistAI / Janus
[NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages
☆48Updated 6 months ago
StableFluffy / EasyLLMFeaturePorter
1-Click is all you need.
☆61Updated last year
Alsace08 / SumCoT
[ACL 2023] Code and Data Repo for Paper "Element-aware Summary and Summary Chain-of-Thought (SumCoT)"
☆53Updated last year
yxuansu / Contrastive_Search_Is_What_You_Need
[TMLR'23] Contrastive Search Is What You Need For Neural Text Generation
☆119Updated 2 years ago
cosmoquester / transformers-bart-pretrain
Script to pre-train hugginface transformers BART with Tensorflow 2
☆33Updated 2 years ago
mukhal / PromptRank
[ACL 2023] Few-shot Reranking for Multi-hop QA via Language Model Prompting
☆27Updated 2 years ago
hyunwoongko / stop-sequencer
Implementation of stop sequencer for Huggingface Transformers
☆16Updated 2 years ago
J-Seo / KoCommonGEN-V2
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Updated 10 months ago
amy-hyunji / Generative-Multihop-Retrieval
☆30Updated 2 years ago
Beomi / transformers-language-modeling
Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3
☆23Updated 4 years ago
neulab / data-agora
[arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"
☆33Updated 6 months ago
kamalkraj / e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
☆84Updated last year
gangiswag / llm-reranker
☆46Updated 5 months ago
joeljang / Pretraining_T5_custom_dataset
Continue Pretraining T5 on custom dataset based on available pretrained model checkpoints
☆38Updated 4 years ago
amazon-science / domain-knowledge-injection
☆35Updated last year