Jackmin801 / DS-Assignment-2022Links
☆5Updated 3 years ago
Alternatives and similar repositories for DS-Assignment-2022
Users that are interested in DS-Assignment-2022 are comparing it to the libraries listed below
Sorting:
- Programming League National 2022☆18Updated 3 years ago
- The 2021 Programming League Contest (University of Malaya)☆9Updated 3 years ago
- Prune transformer layers☆69Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆185Updated last week
- Mixed precision training from scratch with Tensors and CUDA☆24Updated last year
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 2 months ago
- ☆14Updated this week
- ☆64Updated 8 months ago
- ☆24Updated 8 months ago
- rl from zero pretrain, can it be done? we'll see.☆37Updated this week
- Learn CUDA with PyTorch☆25Updated this week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Learning about CUDA by writing PTX code.☆131Updated last year
- PTX-Tutorial Written Purely By AIs (Deep Research of Openai and Claude 3.7)☆67Updated 2 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆61Updated last month
- Solve puzzles. Learn CUDA.☆64Updated last year
- Small scale distributed training of sequential deep learning models, built on Numpy and MPI.☆133Updated last year
- This is a repo covers ai research papers pseudocodes☆14Updated last year
- ☆23Updated 10 months ago
- Pytorch/XLA SPMD Test code in Google TPU☆23Updated last year
- The official repo for "LLoCo: Learning Long Contexts Offline"☆117Updated 11 months ago
- Template repo for Python projects, especially those focusing on machine learning and/or deep learning.☆15Updated 2 weeks ago
- CS2030S Programming Methodology module in NUS☆13Updated 4 years ago
- ☆20Updated last year
- Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of…☆134Updated 10 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated last month
- Accelerate, Optimize performance with streamlined training and serving options with JAX.☆276Updated this week
- working implimention of deepseek MLA☆42Updated 5 months ago
- CUDA and Triton implementations of Flash Attention with SoftmaxN.☆70Updated last year
- Docker image NVIDIA GH200 machines - optimized for vllm serving and hf trainer finetuning☆42Updated 3 months ago