chsasank / llama.lisp
Lisp dialect designed for HPC and AI
☆18Updated this week
Alternatives and similar repositories for llama.lisp:
Users that are interested in llama.lisp are comparing it to the libraries listed below
- Make triton easier☆47Updated 9 months ago
- NAACL '24 (Best Demo Paper RunnerUp) / MlSys @ NeurIPS '23 - RedCoast: A Lightweight Tool to Automate Distributed Training and Inference☆64Updated 3 months ago
- Learn CUDA with PyTorch☆19Updated 2 months ago
- ☆12Updated 3 years ago
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆62Updated last week
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 6 months ago
- Explore training for quantized models☆17Updated 2 months ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Updated 9 months ago
- ☆17Updated last year
- ☆21Updated 3 weeks ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆40Updated last year
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆68Updated 10 months ago
- Write a fast kernel and run it on Discord. See how you compare against the best!☆35Updated this week
- This repository contain the simple llama3 implementation in pure jax.☆58Updated last month
- FlexAttention w/ FlashAttention3 Support☆26Updated 5 months ago
- Jax like function transformation engine but micro, microjax☆30Updated 5 months ago
- [wip] Deep Learning Compiler based on Polyhedral Compiler, Light-weight IRs, and Optimizing Pattern Matcher. (development is on hold unti…☆202Updated this week
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- Example ML projects that use the Determined library.☆30Updated 6 months ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated 10 months ago
- Benchmarks of different devices I have come across☆22Updated 3 months ago
- train with kittens!☆55Updated 5 months ago
- ML/DL Math and Method notes☆59Updated last year
- ☆13Updated 9 months ago
- LLM training in simple, raw C/CUDA☆14Updated 3 months ago
- Introduction to Quantization☆20Updated last year
- Senna is an advanced AI-powered search engine designed to provide users with immediate answers to their queries by leveraging natural lan…☆19Updated 6 months ago
- ☆27Updated 2 months ago
- Training hybrid models for dummies.☆20Updated 2 months ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆127Updated last year