philschmid / deepspeed-sagemaker-exampleLinks

☆22

Alternatives and similar repositories for deepspeed-sagemaker-example

Users that are interested in deepspeed-sagemaker-example are comparing it to the libraries listed below

Sorting:

aws-samples / training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed
☆26Updated last year
huggingface / optimum-neuron
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
☆249Updated last week
aws-neuron / transformers-neuronx
☆111Updated 10 months ago
aws / sagemaker-huggingface-inference-toolkit
☆270Updated 6 months ago
philschmid / sagemaker-huggingface-llama-2-samples
☆89Updated 2 years ago
allenai / catwalk
This project studies the performance and robustness of language models and task-adaptation methods.
☆154Updated last year
aws-samples / amazon-sagemaker-bert-pytorch
☆64Updated last year
facebookresearch / distributed-faiss
A library for building and serving multi-node distributed faiss indices.
☆271Updated 2 years ago
aws-samples / llm-evaluation-methodology
☆45Updated last year
awslabs / sagemaker-debugger
Amazon SageMaker Debugger provides functionality to save tensors during training of machine learning jobs and analyze those tensors
☆162Updated last year
aws-samples / sagemaker-distributed-training-workshop
Hands-on workshop for distributed training and hosting on SageMaker
☆151Updated 2 weeks ago
bloomberg / minilmv2.bb
Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)
☆61Updated 2 years ago
hamelsmu / llama-inference
experiments with inference on llama
☆103Updated last year
salesforce / AuditNLG
AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness
☆101Updated 9 months ago
philschmid / huggingface-sagemaker-workshop-series
Enterprise Scale NLP with Hugging Face & SageMaker Workshop series
☆242Updated 2 years ago
microsoft / xtreme-distil-transformers
XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale
☆156Updated last year
amazon-science / auto-rag-eval
Code repo for the ICML 2024 paper "Automated Evaluation of Retrieval-Augmented Language Models with Task-Specific Exam Generation"
☆85Updated last year
google-research-datasets / presto
A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
☆115Updated 2 years ago
zetaalphavector / InPars
Inquisitive Parrots for Search
☆198Updated 5 months ago
huggingface / large_language_model_training_playbook
An open collection of implementation tips, tricks and resources for training large language models
☆488Updated 2 years ago
aws-neuron / aws-neuron-samples
Example code for AWS Neuron SDK developers building inference and training applications
☆151Updated last week
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
philschmid / llm-sagemaker-sample
☆59Updated 11 months ago
project-miracl / miracl
A large-scale multilingual dataset for Information Retrieval. Thorough human-annotations across 18 diverse languages.
☆197Updated last year
aws / sagemaker-pytorch-inference-toolkit
Toolkit for allowing inference and serving with PyTorch on SageMaker. Dockerfiles used for building SageMaker Pytorch Containers are at h…
☆140Updated last year
amazon-science / alexa-teacher-models
☆363Updated last year
microsoft / llm-data-creation
Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"
☆135Updated 2 years ago
Rallio67 / language-model-agents
Experiments with generating opensource language model assistants
☆97Updated 2 years ago
mehdiir / Roberta-Llama-Mistral
Comparing the Performance of LLMs: A Deep Dive into Roberta, Llama, and Mistral for Disaster Tweets Analysis with Lora
☆51Updated 2 years ago
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated 2 years ago