Blazing fast training of π€ Transformers on Graphcore IPUs
β87Mar 11, 2024Updated 2 years ago
Alternatives and similar repositories for optimum-graphcore
Users that are interested in optimum-graphcore are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A new repo to demonstrate tutorials for using HuggingFace on Graphcore IPUs.β12May 3, 2023Updated 2 years ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise β¦β18Mar 12, 2026Updated last week
- Easy and lightning fast training of π€ Transformers on Habana Gaudi processor (HPU)β207Mar 16, 2026Updated last week
- Example code and applications for machine learning on Graphcore IPUsβ333Mar 5, 2024Updated 2 years ago
- Training material for IPU users: tutorials, feature examples, simple applicationsβ88Apr 6, 2023Updated 2 years ago
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,332Mar 13, 2026Updated last week
- β13Apr 30, 2024Updated last year
- The OGB-LSC is the Large Scale Competition by Open Graph Benchmark to help accelerate research into machine learning on graph structured β¦β79Jul 25, 2024Updated last year
- Accelerated inference of π€ models using FuriosaAI NPU chips.β27Jun 9, 2025Updated 9 months ago
- π€ Optimum Intel: Accelerate inference with Intel optimization toolsβ549Mar 16, 2026Updated last week
- Make your own mask. My mask protects you. Your mask protects me.β27Oct 20, 2022Updated 3 years ago
- Documentation Sprint for the fastai deep learning libraryβ15May 11, 2022Updated 3 years ago
- This is a simple torch implementation of the high performance Multi-Query Attentionβ16Aug 23, 2023Updated 2 years ago
- Use OpenAI with HuggingChat by emulating the text_generation_inference_serverβ44Jun 25, 2023Updated 2 years ago
- Hugging Face Jobsβ19Jul 11, 2025Updated 8 months ago
- MLCommons Science benchmarking working groupβ13May 19, 2023Updated 2 years ago
- Artifacts of VLDB'22 paper "COMET: A Novel Memory-Efficient Deep Learning TrainingFramework by Using Error-Bounded Lossy Compression"β10Aug 2, 2022Updated 3 years ago
- ποΈ Retroactively fix your Zoom recordings with a click! Won 1st Place, Best Use of GCP, Best Start-Up, and Best Entrepreneurial Hack at β¦β10Feb 10, 2022Updated 4 years ago
- hybrid computing engine executed by both GPU and multicore to accelerate PH matrix reductionβ13Dec 2, 2019Updated 6 years ago
- End-to-end example of training, exporting and deploying a fastai model to a native iOS appβ11Mar 2, 2023Updated 3 years ago
- All-in-one repository for Fine-tuning & Pretraining (Large) Language Modelsβ15Mar 8, 2023Updated 3 years ago
- Discover how to build vision transformer from scratch with this comprehensive tutorial. Follow our step-by-step guide to create your own β¦β11Apr 14, 2023Updated 2 years ago
- Document parameters using commentsβ10Aug 6, 2021Updated 4 years ago
- β12Feb 11, 2026Updated last month
- python package of rocm-smi-libβ24Dec 15, 2025Updated 3 months ago
- A diff tool for language modelsβ44Dec 28, 2023Updated 2 years ago
- Repository for "Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators"β12Mar 25, 2025Updated 11 months ago
- The ROCdebug-agent is a library that can be loaded by ROCm Platform Runtime to provide some debugging functionality.β32Feb 27, 2026Updated 3 weeks ago
- A tutorial example for nbdevβ15Feb 26, 2022Updated 4 years ago
- TPU support for the fastai libraryβ13Apr 15, 2021Updated 4 years ago
- RACE is a multi-dimensional benchmark for code generation that focuses on Readability, mAintainability, Correctness, and Efficiency.β12Oct 12, 2024Updated last year
- Awesome Quantization Paper lists with Codesβ10Feb 24, 2021Updated 5 years ago
- WIPE implementationβ13Nov 26, 2023Updated 2 years ago
- A portable implementation of SZ lossy compression for AMD GPUs and Hygon DCUs.β10Feb 26, 2025Updated last year
- β24Updated this week
- HDF5 Cache VOL connector for caching data on fast storage layers and moving data asynchronously to the parallel file system to hide I/O oβ¦β21Feb 10, 2026Updated last month
- Ahead of Time (AOT) Triton Math Libraryβ94Updated this week
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.β15Sep 4, 2024Updated last year
- β14Jun 4, 2024Updated last year