NVIDIA / NeMoLinks

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

☆15,490

Alternatives and similar repositories for NeMo

Users that are interested in NeMo are comparing it to the libraries listed below

Sorting:

NVIDIA / Megatron-LM
Ongoing research training transformer models at scale
☆13,312Updated this week
triton-lang / triton
Development repository for the Triton language and compiler
☆16,642Updated this week
speechbrain / speechbrain
A PyTorch-based Speech Toolkit
☆10,299Updated last week
facebookresearch / xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆9,866Updated last week
espnet / espnet
End-to-End Speech Processing Toolkit
☆9,395Updated last week
NVIDIA / FasterTransformer
Transformer related optimization, including BERT, GPT
☆6,280Updated last year
facebookresearch / fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆31,738Updated 2 months ago
OpenNMT / CTranslate2
Fast inference engine for Transformer models
☆3,974Updated 4 months ago
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆18,997Updated this week
NVIDIA / TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizati…
☆11,437Updated this week
NVIDIA / DeepLearningExamples
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enter…
☆14,460Updated last year
google-research / text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
☆6,410Updated 3 months ago
jax-ml / jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
☆33,265Updated this week
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,490Updated last week
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,442Updated this week
triton-inference-server / server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
☆9,672Updated this week
huggingface / trl
Train transformer language models with reinforcement learning.
☆15,174Updated this week
UKPLab / sentence-transformers
State-of-the-Art Text Embeddings
☆17,397Updated 2 weeks ago
microsoft / unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆21,661Updated last month
google / sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
☆11,184Updated this week
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆9,055Updated last week
Lightning-AI / pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
☆30,017Updated this week
facebookresearch / fairscale
PyTorch extensions for high performance and large scale training.
☆3,364Updated 3 months ago
huggingface / transformers
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal model…
☆148,585Updated this week
huggingface / optimum
🚀 Accelerate inference and training of 🤗 Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimization…
☆3,035Updated this week
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆17,106Updated this week
deepspeedai / DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆39,840Updated this week
gradio-app / gradio
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
☆39,552Updated this week
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆55,787Updated this week
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆12,575Updated 8 months ago