at-aaims / forgeLinks
☆15Updated 2 months ago
Alternatives and similar repositories for forge
Users that are interested in forge are comparing it to the libraries listed below
Sorting:
- A parallel framework for training deep neural networks☆62Updated 4 months ago
- AMD HPC Research Fund Cloud☆14Updated 2 months ago
- LLM training in simple, raw C/CUDA☆99Updated last year
- This is the open source version of HPL-MXP. The code performance has been verified on Frontier☆17Updated last week
- Official Implementation of "CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks"☆20Updated last month
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆39Updated 2 months ago
- A benchmark framework for Pytorch☆26Updated 4 months ago
- Adaptive Parallel PDF Parsing and Resource Scaling Engine☆48Updated last month
- ☆12Updated last year
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated last month
- Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace☆16Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆87Updated this week
- High-Performance SGEMM on CUDA devices☆97Updated 5 months ago
- Gpu benchmark☆63Updated 5 months ago
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆46Updated 4 months ago
- Blazing fast training of 🤗 Transformers on Graphcore IPUs☆85Updated last year
- train with kittens!☆61Updated 8 months ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆26Updated last week
- ☆48Updated 8 months ago
- look how they massacred my boy☆63Updated 9 months ago
- FMS Model Optimizer is a framework for developing reduced precision neural network models.☆20Updated this week
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆78Updated last month
- Inference code for LLaMA models☆42Updated 2 years ago
- Example of applying CUDA graphs to LLaMA-v2☆12Updated last year
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆73Updated 2 weeks ago
- ☆14Updated last year
- Data and reproducibility scripts for the UoB-HPC Performance Portability studies☆17Updated last year
- Port of the RWKV-LM model in Fortran (Back to the Future!)☆49Updated last year
- ☆21Updated 4 months ago
- Data preparation code for Amber 7B LLM☆91Updated last year