Pastorsin / python-hpc-studyLinks
Incremental optimizations to the N-Body problem in order to evaluate and compare the performance of Python translators in the HPC environment.
☆14Updated 2 years ago
Alternatives and similar repositories for python-hpc-study
Users that are interested in python-hpc-study are comparing it to the libraries listed below
Sorting:
- Roblox-Oxygen is a cutting-edge plugin designed to enhance the graphical performance and experience of Roblox games. It offers advanced r…☆17Updated last year
- 💻 As a Frontend Development Intern at Shen AI (Aug – Oct 2024), I built the company website using React.js and worked with the design te…☆15Updated 4 months ago
- Apple-Ware: Delivers 80% UNC and Level 7 capabilities, backed by a responsive support team and a sleek, intuitive interface. Regular upda…☆22Updated 11 months ago
- High-performance technical indicators library for financial analysis, optimized with Numba☆12Updated 3 months ago
- "Optimizing Performance and Energy Efficiency in Massively Parallel Systems" PhD Dissertation repository.☆31Updated 2 years ago
- Optimizing loading training data from cloud bucket storage for cloud-based distributed deep learning. Official repository for Quantifying…☆12Updated 3 years ago
- Implementing a comprehensive Quantitative Momentum Strategy to optimize portfolio allocation. The strategy integrates two key financial i…☆14Updated 2 years ago
- Library with JIT (Just-in-time) compilation support to optimize performance of small and medium matrix multiplication☆15Updated 4 years ago
- Sparse Matrix Factorization (SMF) is a key component in many machine learning problems and there exist a verity a applications in real-w…☆12Updated 9 years ago
- Repository for running LLMs efficiently on Mac silicon (M1, M2, M3). Features Jupyter notebook for Meta-Llama-3 setup using MLX framework…☆12Updated last year
- An investment portfolio of stocks is created using Long Short-Term Memory (LSTM) stock price prediction and optimized weights. The perfor…☆35Updated last year
- DPLASMA is a highly optimized, accelerator-aware, implementation of a dense linear algebra package for distributed heterogeneous systems…☆16Updated 4 months ago
- ☆24Updated 2 weeks ago
- Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.☆103Updated last year
- ☆29Updated last week
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆61Updated 2 weeks ago
- Benchmarks to capture important workloads.☆31Updated 7 months ago
- Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU.☆638Updated this week
- A throughput-oriented high-performance serving framework for LLMs☆893Updated last week
- llama3.cuda is a pure C/CUDA implementation for Llama 3 model.☆344Updated 5 months ago
- ☆16Updated 5 months ago
- Fastest kernels written from scratch☆355Updated last week
- This project implements an advanced pairs trading strategy using statistical arbitrage techniques. It leverages Bayesian optimization to …☆42Updated last year
- The GBM Plus API Python library aims to provide all current API calls to interface with the GBM Plus/Homebroker platform.☆32Updated 3 years ago
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆12Updated 5 months ago
- Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators☆465Updated this week
- Pretrain, finetune and serve LLMs on Intel platforms with Ray☆132Updated this week
- NVIDIA tools guide☆142Updated 8 months ago
- Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O☆500Updated 2 weeks ago
- Training MLP on MNIST in 1.5 seconds with pure CUDA☆46Updated 10 months ago