☆14Aug 29, 2023Updated 2 years ago
Alternatives and similar repositories for aws-neuron-reference-for-megatron-lm
Users that are interested in aws-neuron-reference-for-megatron-lm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆24Nov 18, 2025Updated 7 months ago
- ☆24Jun 4, 2026Updated last month
- ☆66Apr 9, 2026Updated 2 months ago
- ☆23Aug 21, 2025Updated 10 months ago
- Repo for our AKBC-2021 paper: Abg-CoQA: Clarifying Ambiguity in Conversational Question Answering☆11Oct 10, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Scripts for WASSA-2017 Shared Task on Emotion Intensity☆14Oct 4, 2017Updated 8 years ago
- ☆17Jun 25, 2026Updated last week
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆23May 1, 2022Updated 4 years ago
- Implementation of NAACL 2024 paper Unveiling the Generalization Power of Fine-Tuned Large Language Models☆11Mar 14, 2024Updated 2 years ago
- Repository containing scripts/helpers for configuring a Raspberry Pi to work with XMOS mic frontend☆14Jul 31, 2023Updated 2 years ago
- The official code to reproduce results from the NACCL2019 paper: White-to-Black: Efficient Distillation of Black-Box Adversarial Attacks☆12Jun 4, 2019Updated 7 years ago
- Codebase accompanying the paper 'Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts', (Emelin, D…☆11Feb 14, 2023Updated 3 years ago
- 🔍 Code Search Tools & Experiments☆12Jun 4, 2026Updated last month
- OpenQASM 3 + OpenPulse in Python☆30May 27, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A guide to structured generation using constrained decoding☆18Jun 9, 2024Updated 2 years ago
- random command line tools for deep learning☆10May 13, 2026Updated last month
- Template repository of a machine-learning Python project powered by FastAPI and PyTorch☆15Aug 26, 2021Updated 4 years ago
- Examples for using AI21's models through Amazon SageMaker☆32Dec 1, 2024Updated last year
- Powering AWS purpose-built machine learning chips. Blazing fast and cost effective, natively integrated into PyTorch and TensorFlow and i…☆613Jun 18, 2026Updated 2 weeks ago
- source code of Multiple-instance Learning Paraphrase (MultiP) Model for Twitter☆13Jun 10, 2016Updated 10 years ago
- ☆65Apr 25, 2025Updated last year
- ☆24Feb 20, 2024Updated 2 years ago
- A tool to build a graph from a codebase☆14Feb 19, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NLP examples(almost Japanese) on AWS☆12May 31, 2022Updated 4 years ago
- 書籍「AWS生成AIアプリ構築実践ガイド」で使用するサンプルコード☆19May 14, 2026Updated last month
- Openfold inference architecture for Amazon EKS☆11Oct 1, 2024Updated last year
- CMP314 Optimizing NLP models with Amazon EC2 Inf1 instances in Amazon Sagemaker☆14Dec 20, 2023Updated 2 years ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Feb 4, 2025Updated last year
- Amazon Elastic Inference tools and utilities.☆17Apr 8, 2020Updated 6 years ago
- JAX exponential map normalising flows on sphere☆17Oct 4, 2020Updated 5 years ago
- A Conversational Information Seeking (CIS) Paper Reading List Maintained by Chuan Meng.☆29Sep 27, 2022Updated 3 years ago
- Code and scripts for NAACL 2022 industry track paper "Fast and Light-weight Answer Text Retrieval in Dialogue Systems". Built on top of C…☆13Sep 17, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A tool to help you generate java call graph.☆10Apr 14, 2021Updated 5 years ago
- Cloud Queue for Quantum Devices☆14Jan 5, 2026Updated 5 months ago
- The SQL-RL-GEN is an algorithm based on a Reinforcement Learning approach with a reward function generated by a LLM to guide the agent's …☆25Sep 18, 2025Updated 9 months ago
- rdiv!(::AbstractMatrix, ::UpperTriangular) and ldiv!(::LowerTriangular, ::AbstractMatrix)☆11Nov 18, 2024Updated last year
- https://openreview.net/forum?id=OC1o4_OI6Jw☆13May 27, 2022Updated 4 years ago
- Julia implementation of flash-attention operation for neural networks.☆11May 31, 2023Updated 3 years ago
- ☆15Feb 26, 2024Updated 2 years ago