NVIDIA/nvidia-dlfw-inspect

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/nvidia-dlfw-inspect)

NVIDIA / nvidia-dlfw-inspect

The tool facilitates debugging convergence issues and testing new algorithms and recipes for training LLMs using Nvidia libraries such as Transformer Engine, Megatron-LM, and NeMo.

☆18

Alternatives and similar repositories for nvidia-dlfw-inspect

Users that are interested in nvidia-dlfw-inspect are comparing it to the libraries listed below

Sorting:

NVIDIA / multi-storage-client
View on GitHub
Unified high-performance Python client for object and file stores.
☆59Feb 6, 2026Updated 3 weeks ago
wantedly / intern-info
View on GitHub
Wantedlyのインターン情報や新卒採用についてのインフォメーションです
☆11Apr 5, 2022Updated 3 years ago
Qihoo360 / 360zhinao2
View on GitHub
360zhiano2
☆11Dec 3, 2024Updated last year
CarloLepelaars / auditus
View on GitHub
Simple Audio Embedding Toolkit
☆12Aug 9, 2025Updated 6 months ago
isucon / isucon12-prior
View on GitHub
☆10Jun 8, 2022Updated 3 years ago
Jackal08 / sa_risk_management
View on GitHub
Group project for the WorldQuant University module, risk management.
☆13Feb 3, 2019Updated 7 years ago
iwiwi / epochraft-hf-fsdp
View on GitHub
Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆11Jan 29, 2024Updated 2 years ago
algd2022 / Algorithms-Datastructures
View on GitHub
☆11Jun 21, 2022Updated 3 years ago
fujitsu / pytorch
View on GitHub
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆11Jun 2, 2024Updated last year
hyxcl / nsys_recipes
View on GitHub
these are custom recipes of nvidia nsight system post collection analysis.
☆16Nov 7, 2025Updated 3 months ago
guedou / guedou.github.io
View on GitHub
GitHub Pages repository for https://guedou.github.io
☆11Nov 24, 2025Updated 3 months ago
susumuota / kaggleenv
View on GitHub
GCP + Kaggle Docker + VSCode
☆15Feb 28, 2022Updated 4 years ago
iclr-blogposts / 2025
View on GitHub
ICLR Blog Track 2025
☆19Sep 21, 2025Updated 5 months ago
schigrinov / capstone
View on GitHub
WQU capstone project - short term currency trading strategy utilizing machine learning
☆12Dec 8, 2022Updated 3 years ago
wenbo5565 / Kaggle-Competition-Give-Me-Some-Credit
View on GitHub
Includes script for Kaggle Competition - Give Me Some Credit
☆11Sep 12, 2017Updated 8 years ago
jiaqiyao620 / credit-risk-management
View on GitHub
Class materials of Credit Risk Management taught by prof. Ed Hayes
☆14Feb 22, 2018Updated 8 years ago
aws-samples / aws-parallelcluster-megatron
View on GitHub
☆15Mar 15, 2021Updated 4 years ago
igenki / youtube_python
View on GitHub
☆14Jan 12, 2024Updated 2 years ago
Anonymous1252022 / Megatron-DeepSpeed
View on GitHub
☆15Sep 22, 2024Updated last year
lethienhoavn / auto-causal-inference
View on GitHub
Auto Causal Inference Assistant for Banking using LangGraph and MCP
☆23Jun 28, 2025Updated 8 months ago
huggingface / kernels-community
View on GitHub
Kernel sources for https://huggingface.co/kernels-community
☆66Updated this week
CortexFoundation / Home-Credit-Default-Risk-rank8
View on GitHub
☆16Sep 3, 2018Updated 7 years ago
abofficial444 / KOWOPE
View on GitHub
3rd Place. Kowope Mart is a Nigerian-based retail company with a vision to provide quality goods, education and automobile services to it…
☆13Mar 15, 2022Updated 3 years ago
IHPCSS / software-engineering
View on GitHub
Software engineering for the IHPCSS Laplace code
☆17Jul 9, 2025Updated 7 months ago
okoge-kaz / moe-recipes
View on GitHub
Ongoing research training Mixture of Expert models.
☆21Sep 16, 2024Updated last year
yixiaoer / tpu-training-example
View on GitHub
☆16Jul 8, 2024Updated last year
chandar-lab / EfficientLLMs
View on GitHub
☆19Jul 30, 2024Updated last year
JerryYin777 / Cross-Layer-Attention
View on GitHub
Self Reproduction Code of Paper "Reducing Transformer Key-Value Cache Size with Cross-Layer Attention (MIT CSAIL)
☆17May 24, 2024Updated last year
swallow-llm / swallow-evaluation-instruct
View on GitHub
Swallowプロジェクト事後学習済み大規模言語モデル評価フレームワーク
☆26Oct 20, 2025Updated 4 months ago
FranklinMa810 / Baruch-MFE-Credit-Risk-Modeling
View on GitHub
☆16Nov 16, 2016Updated 9 years ago
LLVM-AD / ucu-dataset
View on GitHub
[WACV 2024 LLVM-AD Challenge] UCU Dataset
☆15Sep 9, 2023Updated 2 years ago
rupakc / Kaggle-Compendium
View on GitHub
Baseline Python Scripts for Popular Kaggle Competitions
☆17Aug 20, 2022Updated 3 years ago
turingmotors / vlm-recipes
View on GitHub
☆20Aug 28, 2024Updated last year
ashupadhyay / Credit-Risk-Modelling-Python
View on GitHub
In this repository I document my learnings from the course on Udemy - udemy.com/course/credit-risk-modeling-in-python/
☆18Jan 10, 2020Updated 6 years ago
WillKoehrsen / kaggle-automated-feature-engineering
View on GitHub
Applying automated feature engineering to the Kaggle Home Credit Default Risk Competition
☆19Jun 15, 2018Updated 7 years ago
priteshgohil / CUDA-programming-tutorial
View on GitHub
Get started with CUDA programming
☆17Apr 22, 2023Updated 2 years ago
truongkhanhduy95 / Heritage-Health-Prize
View on GitHub
Kaggle Heritage Health Prize Challenge
☆19Dec 5, 2023Updated 2 years ago
Xianchao-Wu / peft
View on GitHub
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆18Jul 21, 2023Updated 2 years ago
ajtulloch / IntensityCreditModels
View on GitHub
Code used to implement various stochastic intensity models for univariate and multivariate credit risk models.
☆21Nov 10, 2013Updated 12 years ago