chsasank / llama.lispLinks
Lisp dialect designed for HPC and AI
☆18Updated last week
Alternatives and similar repositories for llama.lisp
Users that are interested in llama.lisp are comparing it to the libraries listed below
Sorting:
- See https://github.com/cuda-mode/triton-index/ instead!☆11Updated last year
- Make triton easier☆46Updated last year
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆15Updated 2 years ago
- Learn CUDA with PyTorch☆27Updated last week
- fast.ai APL study group notes☆28Updated 2 years ago
- The compressor-retriever architecture for language model OS☆15Updated 9 months ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Updated last year
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆59Updated 8 months ago
- Standalone commandline CLI tool for compiling Triton kernels☆17Updated 9 months ago
- RoBERTa Marathi Language model trained from scratch during huggingface 🤗 x flax community week☆28Updated 3 years ago
- Presents comprehensive benchmarks of XLA-compatible pre-trained models in Keras.☆37Updated last year
- ☆12Updated 4 years ago
- Jax like function transformation engine but micro, microjax☆33Updated 8 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆15Updated last year
- First-order logic theorem prover supporting unification with approximate vector similarity☆12Updated 2 years ago
- ☆22Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆67Updated 4 months ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 9 months ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Updated 2 years ago
- Efficiently computing & storing token n-grams from large corpora☆24Updated 8 months ago
- ☆23Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆84Updated last year
- E2E AutoML Model Compression Package☆46Updated 3 months ago
- An attempt at the implementation of GLOM, Geoffrey Hinton's paper for emergent part-whole hierarchies from data☆37Updated 4 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- a Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization in pure C.☆21Updated 11 months ago
- program synthesis with neuro-symbolic differentiable interpreters☆14Updated last year
- ☆30Updated 6 months ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Updated 2 years ago
- Interesting ATP Proofs☆13Updated 3 years ago