NVIDIA-NeMo / NemotronLinks
Developer Asset Hub for NVIDIA Nemotron — A one-stop resource for training recipes, usage cookbooks, and full end-to-end reference examples to build with Nemotron models
☆33Updated 3 weeks ago
Alternatives and similar repositories for Nemotron
Users that are interested in Nemotron are comparing it to the libraries listed below
Sorting:
- An interface library for RL post training with environments.☆753Updated this week
- This NVIDIA RAG blueprint serves as a reference solution for a foundational Retrieval Augmented Generation (RAG) pipeline.☆377Updated this week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆294Updated this week
- 📚 Tutorial on building a modern search app for Amazon e-commerce products leveraging tabular semantic search and natural language querie…☆86Updated 7 months ago
- Scalable data pre processing and curation toolkit for LLMs☆1,233Updated this week
- This is suite of the hands-on training materials that shows how to scale CV, NLP, time-series forecasting workloads with Ray.☆446Updated last year
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆141Updated last week
- Fine-tune an LLM to perform batch inference and online serving.☆114Updated 6 months ago
- Starter pack for NeurIPS LLM Efficiency Challenge 2023.☆128Updated 2 years ago
- dLLM: Simple Diffusion Language Modeling☆1,022Updated this week
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆48Updated last year
- Collection of reference workflows for building intelligent agents with NIMs☆179Updated 10 months ago
- A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗☆1,101Updated this week
- Hugging Face Deep Learning Containers (DLCs) for Google Cloud☆156Updated 2 weeks ago
- ☆223Updated last month
- How to quickly serve an LLM using Fast API, Celery, and Redis☆16Updated 2 years ago
- ☆176Updated this week
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆133Updated 3 months ago
- An open-source tool for LLM prompt optimization.☆711Updated 2 weeks ago
- ☆266Updated 5 months ago
- Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.☆210Updated this week
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 4 months ago
- ☆20Updated last year
- ☆912Updated 3 weeks ago
- Scalable toolkit for efficient model reinforcement☆1,048Updated this week
- How to serve ML predictions 100x faster☆59Updated last year
- ☆79Updated 2 months ago
- Accelerate your Gen AI with NVIDIA NIM and NVIDIA AI Workbench☆191Updated 6 months ago
- Utils for Unsloth https://github.com/unslothai/unsloth☆173Updated this week
- Banishing LLM Hallucinations Requires Rethinking Generalization☆275Updated last year