mikex86 / scicore
A tiny deep learning library written in Java
☆25Updated 2 years ago
Alternatives and similar repositories for scicore:
Users that are interested in scicore are comparing it to the libraries listed below
- A fork of llama3.c used to do some R&D on inferencing☆19Updated 3 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 6 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated 7 months ago
- The Finite Field Assembly Programming Language☆35Updated this week
- ☆97Updated 11 months ago
- ☆19Updated 7 months ago
- ☆18Updated 8 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆127Updated last year
- Tensor library with autograd using only Rust's standard library☆67Updated 9 months ago
- C++ raytracer that supports custom models. Supports running the calculations on the CPU using C++11 threads or in the GPU via CUDA.☆75Updated 2 years ago
- minimal diffusion transformer in pytorch.☆16Updated 5 months ago
- throwaway GPT inference☆140Updated 10 months ago
- ☆46Updated 7 months ago
- ☆242Updated last year
- ☆9Updated 3 weeks ago
- Minimal C++ implementation of GPT2☆40Updated last year
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 8 months ago
- Learning about CUDA by writing PTX code.☆125Updated last year
- A star for organising blocks and playing with transformers.☆23Updated 11 months ago
- Can I make an *optimizing* compiler under 1k lines of code?☆55Updated last month
- A formalization of first-order logic and Peano's axioms in Python☆20Updated last year
- C implementation of the L-Mul f32/f16 multiplications from paper: https://arxiv.org/html/2410.00907☆27Updated 5 months ago
- Learn CUDA with PyTorch☆19Updated 2 months ago
- Reference Kernels for the Leaderboard☆23Updated 3 weeks ago
- This repository contain the simple llama3 implementation in pure jax.☆58Updated last month
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 9 months ago
- Rust Implementation of micrograd☆51Updated 8 months ago
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆177Updated last year
- A package for defining deep learning models using categorical algebraic expressions.☆60Updated 8 months ago
- pytorch from scratch in pure C/CUDA and python☆40Updated 5 months ago