at-aaims / forge
☆12Updated last year
Alternatives and similar repositories for forge:
Users that are interested in forge are comparing it to the libraries listed below
- AMD HPC Research Fund Cloud☆13Updated 2 weeks ago
- A parallel framework for training deep neural networks☆58Updated last month
- train with kittens!☆56Updated 5 months ago
- Example of applying CUDA graphs to LLaMA-v2☆12Updated last year
- Make triton easier☆47Updated 10 months ago
- An introduction to LLM Sampling☆77Updated 4 months ago
- LLM training in simple, raw C/CUDA☆92Updated 11 months ago
- Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace☆14Updated last year
- Inference code for LLaMA models☆42Updated 2 years ago
- look how they massacred my boy☆63Updated 6 months ago
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆16Updated last year
- ☆13Updated 10 months ago
- Gpu benchmark☆59Updated 2 months ago
- Latent Large Language Models☆17Updated 7 months ago
- Graph-indexed Pandas DataFrames for analyzing hierarchical performance data☆32Updated 5 months ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 10 months ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆39Updated last month
- ☆11Updated this week
- Experiment of using Tangent to autodiff triton☆78Updated last year
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆16Updated 10 months ago
- Python bindings for OpenSHMEM☆16Updated this week
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 9 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- ☆48Updated last year
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated this week
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- ☆13Updated last year
- ☆28Updated last month
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆16Updated this week