☆26Mar 21, 2024Updated 2 years ago
Alternatives and similar repositories for training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed
Users that are interested in training-llm-on-sagemaker-for-multiple-nodes-with-deepspeed are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a sample about how to run stanford_alpaca on Amazon SageMaker, only for demo use.☆14Jul 3, 2023Updated 2 years ago
- Use the two different methods (deepspeed and SageMaker model parallelism library) to fine tune llama model on Sagemaker. Then deploy the …☆24Aug 1, 2023Updated 2 years ago
- Tools to measure latency for LLM in Amazon Bedrook☆22Jan 20, 2026Updated 2 months ago
- The Amazon S3 Transfer Plugin for Data Transfer Hub(https://github.com/awslabs/data-transfer-hub). Transfer objects from S3(in other part…☆49Jan 29, 2025Updated last year
- Learn how to build Alexa Skills with AWS Services.☆26May 20, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Hands-on workshop for distributed training and hosting on SageMaker☆153Nov 4, 2025Updated 4 months ago
- data and code for coling2018 paper☆22Oct 20, 2022Updated 3 years ago
- ☆15Jul 22, 2024Updated last year
- Implementation of the chain of responsibility design pattern☆30Nov 5, 2022Updated 3 years ago
- ☆14May 13, 2021Updated 4 years ago
- Source code for "SimCKP: Simple Contrastive Learning of Keyphrase Representations", Findings of EMNLP 2023☆12Jun 20, 2025Updated 9 months ago
- Generic build server☆64May 25, 2014Updated 11 years ago
- Collection of LLM completions for reasoning-gym task datasets☆31Jul 4, 2025Updated 8 months ago
- The official repo for DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph☆18Oct 13, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MLOps Implementing "Brain Computer Interface" on Kubernetes☆16Sep 30, 2022Updated 3 years ago
- A zero-shot faithfulness evaluation metric for text summarization☆11Oct 17, 2023Updated 2 years ago
- Combinational Class Activation Maps for Weakly Supervised Object Localization☆12Oct 25, 2020Updated 5 years ago
- [ACL2023] Source code for Dialogue Summarization with Static-Dynamic Structure Fusion Graph☆11Dec 17, 2023Updated 2 years ago
- A markdown native slides tool for academics building with agents.☆76Mar 20, 2026Updated last week
- 2D Object Tracking for automated driving using WAYMO data.☆15Jun 17, 2020Updated 5 years ago
- Use Cloudfront to build a proxy for Bedrock to accelerate access from globally☆15Nov 26, 2024Updated last year
- EAFT(Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting) official repo☆93Jan 15, 2026Updated 2 months ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 100行解决中文模糊实体识别with字典树和编辑距离 Chinese fuzzy entity matching with prefix tree and distance editing☆11Sep 25, 2023Updated 2 years ago
- A sample app to debug and validate cellular modems on balena devices☆13Jun 5, 2019Updated 6 years ago
- ☆16Jul 29, 2025Updated 8 months ago
- All notebook for FastAI learning purposes.☆15Jun 11, 2019Updated 6 years ago
- Caching for Graphql Resolvers☆19Nov 21, 2019Updated 6 years ago
- EMNLP 2022: Leveraging Locality in Abstractive Text Summarization☆18Oct 21, 2024Updated last year
- ☆19Sep 15, 2025Updated 6 months ago
- Stuff related to scraping the Code Review StackExchange☆12Jan 19, 2023Updated 3 years ago
- ☆13Jun 20, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"☆28May 28, 2024Updated last year
- Elevator is an open source, on-disk key-value store. Provides high-performance bulk read-write operations over very large datasets while …☆70May 14, 2014Updated 11 years ago
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Apr 22, 2020Updated 5 years ago
- A simple one file python script that executes AI processes defined in YML.☆14Mar 26, 2023Updated 3 years ago
- ☆28Apr 25, 2025Updated 11 months ago
- Stable Diffusion web UI☆17Dec 18, 2023Updated 2 years ago
- Official implementation of the ACL 2022 paper "Learning Non-Autoregressive Models from Search for Unsupervised Sentence Summarization"☆14Dec 26, 2022Updated 3 years ago