facebookresearch/FAMBench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/facebookresearch/FAMBench)

facebookresearch / FAMBench

Benchmarks to capture important workloads.

☆32

Alternatives and similar repositories for FAMBench

Users that are interested in FAMBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ROCm / rocmProfileData
View on GitHub
☆30Jun 16, 2026Updated last month
zdevito / custom_loader
View on GitHub
☆12May 25, 2021Updated 5 years ago
mingfeima / pytorch_profiler_parser
View on GitHub
parser script to process pytorch autograd profiler result, convert json file to excel.
☆15Oct 8, 2019Updated 6 years ago
suo / lintrunner
View on GitHub
☆31May 10, 2026Updated 2 months ago
ParCoreLab / CPU-Free-model
View on GitHub
Source code for the CPU-Free model - a fully autonomous execution model for multi-GPU applications that completely excludes the involveme…
☆21Apr 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
t-vi / pytorch
View on GitHub
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆17Jan 16, 2024Updated 2 years ago
intel / flexmalloc
View on GitHub
Flexible memory allocation tool for multi-tiered memory systems
☆15Jan 7, 2026Updated 6 months ago
mlcommons / inference_results_v4.0
View on GitHub
This repository contains the results and code for the MLPerf™ Inference v4.0 benchmark.
☆11Jul 24, 2025Updated 11 months ago
olcf / NVIDIA-tensor-core-examples
View on GitHub
☆20Nov 7, 2019Updated 6 years ago
bertmaher / llama2.so
View on GitHub
Inference Llama 2 with a model compiled to native code by TorchInductor
☆14Feb 8, 2024Updated 2 years ago
gagolews / lmlcr
View on GitHub
Lightweight Machine Learning Classics with R (Book Draft)
☆16Jun 13, 2022Updated 4 years ago
parlab-tuwien / lockfree-linked-list
View on GitHub
A more Pragmatic Implementation of the Lock-free, Ordered, Linked List
☆19Dec 20, 2020Updated 5 years ago
libxsmm / parlooper
View on GitHub
PArallelLOOPgEneratoR: Threaded Loops Code Generation Infrastructure targeting Tensor Contraction Applications such as GEMMs, Convolution…
☆19Jan 22, 2026Updated 5 months ago
google / pyctr
View on GitHub
☆22Jul 31, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hao-ai-lab / flash-attention-fp4
View on GitHub
NVFP4 Flash-Attention 4 on BlackWell
☆28Updated this week
jasontbradshaw / irobot
View on GitHub
A human-friendly implementation of the iRobot Open Interface version 2 API.
☆14May 14, 2016Updated 10 years ago
UofT-EcoSystem / Tempo
View on GitHub
Memory footprint reduction for transformer models
☆11Jan 24, 2023Updated 3 years ago
iree-org / iree-nvgpu
View on GitHub
☆48Mar 5, 2024Updated 2 years ago
mlcommons / inference_policies
View on GitHub
Issues related to MLPerf® Inference policies, including rules and suggested changes
☆62Jul 7, 2026Updated 2 weeks ago
EECS150 / project_skeleton_sp20
View on GitHub
EECS 151/251A FPGA Project Skeleton for Spring 2020
☆12May 6, 2020Updated 6 years ago
sharan-dce / autograd
View on GitHub
Auto-differentiation library for C++
☆12Jan 16, 2022Updated 4 years ago
ROCm / FlyDSL
View on GitHub
FlyDSL is the Python front‑end of the project: Flexible LaYout DSL.
☆237Updated this week
reger-men / HPL_GPU
View on GitHub
High-Performance Linpack Benchmark adopted version for GPU backend
☆12Sep 12, 2022Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NVIDIA / apt-packaging-cuda-keyring
View on GitHub
CUDA keyring packaging for Debian
☆14Apr 14, 2023Updated 3 years ago
meta-pytorch / multipy
View on GitHub
torch::deploy (multipy for non-torch uses) is a system that lets you get around the GIL problem by running multiple Python interpreters i…
☆179Dec 16, 2025Updated 7 months ago
mrcat2018 / AutodiffEngine
View on GitHub
AutodiffEngine
☆13Apr 1, 2019Updated 7 years ago
cxphong / Build-gstreamer-Raspberry-Pi-3
View on GitHub
Build gstreamer on Raspberry Pi 3
☆14Nov 2, 2018Updated 7 years ago
ekondis / cl2-reduce-bench
View on GitHub
A test case for evaluating the performance of the workgroup reduction operation in OpenCL 2.0
☆10Nov 26, 2020Updated 5 years ago
mk1-project / quickreduce
View on GitHub
QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.
☆38Aug 29, 2025Updated 10 months ago
amd / UIF
View on GitHub
☆61Sep 15, 2023Updated 2 years ago
antkillerfarm / antkillerfarm_crazy
View on GitHub
antkillerfarm's crazy magic
☆18Oct 3, 2024Updated last year
ezyang / ai-blindspots
View on GitHub
Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.
☆13Mar 20, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
suzukimain / auto_diffusers
View on GitHub
diffusers with search engine
☆12Jan 13, 2026Updated 6 months ago
jansel / pytorch-jit-paritybench
View on GitHub
☆42Dec 10, 2024Updated last year
ZeusYang / NBodySimulation
View on GitHub
N-body simulation based on CUDA.
☆14Jun 20, 2019Updated 7 years ago
Deep-Learning-Profiling-Tools / triton-samples
View on GitHub
☆14Mar 8, 2025Updated last year
ROCm / AITemplate
View on GitHub
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (N…
☆12Jun 24, 2024Updated 2 years ago
MHageH / c_uart_interface
View on GitHub
A heavy modification of the original c_uart_interface_example, works on ARM Cortex-M4 STM32F4 (as an offboard processor)
☆11Jul 8, 2016Updated 10 years ago
ROCm / hipTensor
View on GitHub
AMD’s C++ library for accelerating tensor primitives
☆49Updated this week