sar-mo / CS2051-HonorsDiscreteMathLinks
A collection of resources for CS 2051, an undergraduate Honors Discrete Mathematics course at Georgia Tech.
☆10Updated 2 years ago
Alternatives and similar repositories for CS2051-HonorsDiscreteMath
Users that are interested in CS2051-HonorsDiscreteMath are comparing it to the libraries listed below
Sorting:
- Puzzles for exploring transformers☆367Updated 2 years ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆349Updated last year
- Resources from the EleutherAI Math Reading Group☆54Updated 6 months ago
- Simple Transformer in Jax☆140Updated last year
- Just large language models. Hackable, with as little abstraction as possible. Done for my own purposes, feel free to rip.☆44Updated last year
- Extract full next-token probabilities via language model APIs☆247Updated last year
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆225Updated 3 weeks ago
- ☆450Updated 10 months ago
- A puzzle to learn about prompting☆132Updated 2 years ago
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆138Updated last year
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆192Updated last year
- Solve puzzles to improve your tinygrad skills!☆142Updated 5 months ago
- A blog where I write about research papers and blog posts I read.☆12Updated 9 months ago
- FastAsk is a Python package that installs an easy to use command to your terminal to get a quick answer to a question, using either OpenA…☆56Updated 8 months ago
- Solve puzzles. Learn CUDA.☆64Updated last year
- Fast bare-bones BPE for modern tokenizer training☆164Updated 2 months ago
- Tutorials on tinygrad☆406Updated 3 weeks ago
- A set of Python scripts that makes your experience on TPU better☆54Updated last year
- ☆275Updated last year
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆172Updated last year
- seqax = sequence modeling + JAX☆166Updated last month
- Annotated version of the Mamba paper☆488Updated last year
- Helpers and such for working with Lambda Cloud☆51Updated last year
- A comprehensive deep dive into the world of tokens☆226Updated last year
- What would you do with 1000 H100s...☆1,094Updated last year
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆813Updated last month
- An implementation of the transformer architecture onto an Nvidia CUDA kernel☆189Updated last year
- JAX implementation of the Llama 2 model☆219Updated last year
- Stream of my favorite papers and links☆42Updated 5 months ago
- ☆530Updated last year