at-aaims / forgeLinks

☆15

Alternatives and similar repositories for forge

Users that are interested in forge are comparing it to the libraries listed below

Sorting:

axonn-ai / axonn
A parallel framework for training deep neural networks
☆62Updated 4 months ago
AMDResearch / hpcfund
AMD HPC Research Fund Cloud
☆14Updated 2 months ago
gevtushenko / llm.c
LLM training in simple, raw C/CUDA
☆99Updated last year
at-aaims / OpenMxP
This is the open source version of HPL-MXP. The code performance has been verified on Frontier
☆17Updated last week
spcl / CheckEmbed
Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"
☆20Updated last month
groq / mlagility
Machine Learning Agility (MLAgility) benchmark and benchmarking tools
☆39Updated 2 months ago
aime-team / pytorch-benchmarks
A benchmark framework for Pytorch
☆26Updated 4 months ago
7shoe / AdaParse
Adaptive Parallel PDF Parsing and Resource Scaling Engine
☆48Updated last month
sambanova / tutorials
☆12Updated last year
graphcore / distributed-kge-poplar
The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …
☆18Updated last month
graphcore / Gradient-HuggingFace
Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace
☆16Updated last year
EmbeddedLLM / vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆87Updated this week
salykova / sgemm.cu
High-Performance SGEMM on CUDA devices
☆97Updated 5 months ago
mag- / gpu_benchmark
Gpu benchmark
☆63Updated 5 months ago
abacusai / gh200-llm
Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning
☆46Updated 4 months ago
huggingface / optimum-graphcore
Blazing fast training of 🤗 Transformers on Graphcore IPUs
☆85Updated last year
HazyResearch / train-tk
train with kittens!
☆61Updated 8 months ago
ROCm / hipRAND
[DEPRECATED] Moved to ROCm/rocm-libraries repo
☆26Updated last week
apple / ml-hypercloning
☆48Updated 8 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
foundation-model-stack / fms-model-optimizer
FMS Model Optimizer is a framework for developing reduced precision neural network models.
☆20Updated this week
PiotrNawrot / nano-sparse-attention
The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.
☆78Updated last month
LambdaLabsML / llama
Inference code for LLaMA models
☆42Updated 2 years ago
fw-ai / llama-cuda-graph-example
Example of applying CUDA graphs to LLaMA-v2
☆12Updated last year
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆73Updated 2 weeks ago
yixiaoer / tpu-training-example
☆14Updated last year
UoB-HPC / performance-portability
Data and reproducibility scripts for the UoB-HPC Performance Portability studies
☆17Updated last year
nlpodyssey / rwkv.f90
Port of the RWKV-LM model in Fortran (Back to the Future!)
☆49Updated last year
lianakoleva / no-libtorch-compile
☆21Updated 4 months ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆91Updated last year