TTC: A high-performance Compiler for Tensor Transpositions
☆21Oct 19, 2017Updated 8 years ago
Alternatives and similar repositories for TTC
Users that are interested in TTC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tensor Contraction Code Generator☆40Aug 14, 2017Updated 8 years ago
- Automatic High-Order Optimization for Tensors☆22Apr 14, 2023Updated 3 years ago
- Communication Avoiding Numerical Dense Matrix Computations☆11Dec 20, 2020Updated 5 years ago
- High-Performance Tensor Transpose library☆205May 13, 2023Updated 3 years ago
- Sparse matrix-matrix multiplication on CPU+GPU systems.☆13Mar 17, 2014Updated 12 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Mar 1, 2024Updated 2 years ago
- Test failing snippets from Nim's issues☆12Sep 14, 2018Updated 7 years ago
- LLVM Version Manager☆11Apr 21, 2017Updated 9 years ago
- ☆14Feb 1, 2017Updated 9 years ago
- heterogeneous BLAST (H-BLAST), a fast parallel search tool for a heterogeneous computer that couples CPUs and GPUs, to accelerate BLASTX…☆12Jun 20, 2018Updated 7 years ago
- Evaluation Kit of Joint Recovery of Dense Correspondence and Cosegmentation in Two Images (CVPR 2016)☆12Apr 25, 2018Updated 8 years ago
- 2D & 3D Jump Flooding Algorithm and 2D Centroidal Voronoi Tessellation based on taichi☆11Nov 30, 2020Updated 5 years ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆73Mar 3, 2022Updated 4 years ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆23Jan 11, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Omnivore Optimizer and Distributed CcT☆13Jun 17, 2016Updated 9 years ago
- C/C++ Programming☆11May 7, 2024Updated 2 years ago
- Source code of our implementation of the concurrent RMA☆12May 23, 2019Updated 7 years ago
- Catamount is a compute graph analysis tool to load, construct, and modify deep learning models and to symbolically analyze their compute …☆14May 18, 2021Updated 5 years ago
- A CUDA implementation of the PageRank Pipeline Benchmark☆32Jan 31, 2017Updated 9 years ago
- Library containing advanced collection types and miscellaneous utilities involving iteration☆20Jan 20, 2019Updated 7 years ago
- utf-8 string for Nim☆15Jul 1, 2019Updated 6 years ago
- Continuum Dynamics Evaluation and Test Suite☆15Aug 29, 2017Updated 8 years ago
- C++ library for tensor computations☆37Apr 27, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Towards Hardware and Software Continuous Integration☆13Jun 8, 2020Updated 6 years ago
- Org export engine for Jekyll on Markdown☆12May 12, 2022Updated 4 years ago
- Oz-style dataflow (single-assignment) variables and streams for Scala☆42Nov 20, 2009Updated 16 years ago
- When you want to be a brilliant man, you should write down something interesting thing for recall.☆12Dec 18, 2022Updated 3 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Jul 7, 2017Updated 8 years ago
- SSE intrinsics implementation for ECL & SBCL☆22Mar 2, 2016Updated 10 years ago
- Optimizations on Graph500☆10Jul 15, 2016Updated 9 years ago
- EmerCoin SSH PKI and distributed ACL☆15Mar 4, 2017Updated 9 years ago
- Experimentation code for the article "Building Topic Models Based on Anchor Words" based on the paper "Learning Topic Models: Going beyon…☆15May 13, 2014Updated 12 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Functional operations for iterators and slices, similar to sequtils, in Nim☆22Sep 15, 2022Updated 3 years ago
- Linux kernel source tree with fast swap patches.☆20Nov 19, 2013Updated 12 years ago
- Parallel implementation of k-means clustering using MPI4PY and PyCUDA.☆10Mar 11, 2019Updated 7 years ago
- BLAS OpenCL implementation.☆17Apr 8, 2015Updated 11 years ago
- A collection of string sorting algorithms☆58May 16, 2026Updated 3 weeks ago
- A LaTeX version of the UTBM internship report covers using TikZ☆13Mar 21, 2018Updated 8 years ago
- High-Performance Machine Learning Primitives☆13Apr 17, 2021Updated 5 years ago