philschmid / deepspeed-sagemaker-example
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for deepspeed-sagemaker-example
- ☆100Updated 2 months ago
- Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.☆209Updated this week
- ☆21Updated 3 years ago
- Example code for AWS Neuron SDK developers building inference and training applications☆129Updated last month
- ☆95Updated last year
- ☆87Updated last year
- This project studies the performance and robustness of language models and task-adaptation methods.☆141Updated 6 months ago
- ☆64Updated 11 months ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆60Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated last month
- ☆48Updated 2 weeks ago
- ☆34Updated 2 months ago
- ☆22Updated 8 months ago
- ☆73Updated last year
- ☆97Updated 2 years ago
- Dense hybrid representations for text retrieval☆62Updated last year
- ☆14Updated last year
- A dataset focused on summarization of dialogs, which represents the rich domain of Twitter customer care conversations☆29Updated 11 months ago
- Inquisitive Parrots for Search☆179Updated 8 months ago
- ☆93Updated last year
- A minimal PyTorch Lightning OpenAI GPT w DeepSpeed Training!☆110Updated last year
- ☆38Updated 3 weeks ago
- No Parameter Left Behind: How Distillation and Model Size Affect Zero-Shot Retrieval☆27Updated 2 years ago
- ☆112Updated last month
- AuditNLG: Auditing Generative AI Language Modeling for Trustworthiness☆97Updated last year
- Scalable training for dense retrieval models.☆271Updated last year
- ☆55Updated last year
- Open source library for few shot NLP☆77Updated last year
- Repository for the "Understanding and Mitigating Language Confusion in LLMs" paper☆19Updated 4 months ago
- experiments with inference on llama☆105Updated 5 months ago