☆14Aug 29, 2023Updated 2 years ago
Alternatives and similar repositories for aws-neuron-reference-for-megatron-lm
Users that are interested in aws-neuron-reference-for-megatron-lm are comparing it to the libraries listed below
Sorting:
- ☆23Nov 18, 2025Updated 3 months ago
- ☆23Aug 21, 2025Updated 6 months ago
- ☆17Feb 19, 2024Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆17Jul 1, 2022Updated 3 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Example code for AWS Neuron SDK developers building inference and training applications☆158Jan 15, 2026Updated last month
- ☆24Feb 20, 2024Updated 2 years ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- JAX exponential map normalising flows on sphere☆17Oct 4, 2020Updated 5 years ago
- ☆63Dec 20, 2025Updated 2 months ago
- "Moshpit SGD: Communication-Efficient Decentralized Training on Heterogeneous Unreliable Devices", official implementation☆30Feb 4, 2025Updated last year
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Here, I provided the solution for exercises of IBM Quantum Challenge 2020☆10Oct 27, 2020Updated 5 years ago
- Material associated with Physics Report "Data science applications to string theory"☆11Jun 20, 2023Updated 2 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Neural Error Mitigation of Near-Term Quantum Simulations (arXiv:2105.08086)☆10Jul 6, 2022Updated 3 years ago
- A schedule language for large model training☆152Aug 21, 2025Updated 6 months ago
- Repository containing scripts/helpers for configuring a Raspberry Pi to work with XMOS mic frontend☆14Jul 31, 2023Updated 2 years ago
- angle-sequence☆12Apr 3, 2020Updated 5 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- ☆14Feb 26, 2024Updated 2 years ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- ☆13Updated this week
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Sparse symmetric indefinite solver implemented with a runtime system☆13May 11, 2020Updated 5 years ago
- ☆10Jul 22, 2024Updated last year
- ☆12Aug 15, 2023Updated 2 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- Simple tutorial to get familiar with how to program quantum computers using Qiskit☆11Sep 9, 2019Updated 6 years ago
- 書籍「AWS生成AIアプリ構築実践ガイド」で使用するサンプルコード☆18Aug 21, 2025Updated 6 months ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- Material for the course Theories of Quantum Matter at the University of Cambridge☆11Jan 20, 2023Updated 3 years ago
- Pytorch implementation of HCNAF: Hyper-Conditioned Neural Autoregressive Flow (CVPR 2020)☆15Jun 14, 2020Updated 5 years ago
- The contrastive token loss function for reducing generative repetition of autoregressive neural language models.☆13May 11, 2022Updated 3 years ago
- A guide to structured generation using constrained decoding☆14Jun 9, 2024Updated last year
- MLflow App Using React, Hooks, RabbitMQ, FastAPI Server, Celery, Microservices☆11Sep 25, 2022Updated 3 years ago
- Ab Initio Energies☆10Nov 22, 2025Updated 3 months ago