☆26Mar 21, 2024Updated 2 years ago
Alternatives and similar repositories for training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed
Users that are interested in training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Mar 27, 2023Updated 3 years ago
- ☆10Sep 7, 2023Updated 2 years ago
- ☆91Jul 30, 2024Updated last year
- Tools to measure latency for LLM in Amazon Bedrook☆22Jan 20, 2026Updated 4 months ago
- ☆11Dec 8, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆13May 8, 2025Updated last year
- 自己整理了一些ML入门的知识,以及ML项目中的一些经验总结分享☆329Jun 6, 2026Updated last week
- c++ mosestokenizer☆18Mar 13, 2024Updated 2 years ago
- End-to-End Neural Event Coreference Resolution☆11Jun 18, 2023Updated 3 years ago
- ☆13Oct 19, 2023Updated 2 years ago
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 11 months ago
- Python code for Leetcode☆13Jul 16, 2017Updated 8 years ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 11 months ago
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆46Aug 4, 2025Updated 10 months ago
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- This is official implementation of "Curriculum Fine-tuning of Vision Foundation Model for Medical Image Classification Under Label Noise"…☆22Mar 18, 2025Updated last year
- [ACL2023] Source code for Dialogue Summarization with Static-Dynamic Structure Fusion Graph☆11Dec 17, 2023Updated 2 years ago
- Implementing a fast scaling and low cost Stable Diffusion inference solution with serverless and containers on AWS☆41May 21, 2024Updated 2 years ago
- Use Cloudfront to build a proxy for Bedrock to accelerate access from globally☆16Nov 26, 2024Updated last year
- EAFT(Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting) official repo☆102Jan 15, 2026Updated 5 months ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 3 years ago
- React 0.13 with ES6, Immutable.js and Flux, Isomorphic as well☆11Mar 10, 2015Updated 11 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Jul 29, 2025Updated 10 months ago
- ☆14Jun 28, 2023Updated 2 years ago
- Official implementation of "On the Effectiveness of Lipschitz-Driven Rehearsal in Continual Learning"☆14Oct 13, 2022Updated 3 years ago
- ☆20Dec 22, 2021Updated 4 years ago
- EMNLP 2022: Leveraging Locality in Abstractive Text Summarization☆18Oct 21, 2024Updated last year
- 报表引擎☆15Feb 21, 2019Updated 7 years ago
- Stuff related to scraping the Code Review StackExchange☆12Jan 19, 2023Updated 3 years ago
- ☆13Jun 20, 2018Updated 7 years ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 5 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Super Mario is a legendary game we all cherish! In this project, we will deploy Super Mario on Amazon EKS (Elastic Kubernetes Service) us…☆11Feb 3, 2026Updated 4 months ago
- ☆11Feb 3, 2025Updated last year
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated 2 years ago
- Elevator is an open source, on-disk key-value store. Provides high-performance bulk read-write operations over very large datasets while …☆70May 14, 2014Updated 12 years ago
- Dataset and code for TOIS 2022 "Follow the Timeline! Generating Abstractive and Extractive Timeline Summary in Chronological Order"☆37May 26, 2024Updated 2 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Apr 22, 2020Updated 6 years ago
- Yet another isomorphic react boilerplate. This one does not require node on server.☆10Apr 4, 2017Updated 9 years ago