kmohan321 / Research_Papers
☆24Updated last week
Alternatives and similar repositories for Research_Papers:
Users that are interested in Research_Papers are comparing it to the libraries listed below
- Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish☆167Updated 5 months ago
- This repo has all the basic things you'll need in-order to understand complete vision transformer architecture and its various implementa…☆190Updated last week
- ☆98Updated 8 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆59Updated 2 months ago
- small auto-grad engine inspired from Karpathy's micrograd and PyTorch☆229Updated last month
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆37Updated 2 months ago
- Collection of autoregressive model implementation☆76Updated this week
- Solving data for LLMs - Create quality synthetic datasets!☆142Updated 2 months ago
- Learnings and programs related to CUDA☆50Updated this week
- A curated list of resources for learning and exploring Triton, OpenAI's programming language for writing efficient GPU code.☆136Updated this week
- ☆96Updated 4 months ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 3 months ago
- look how they massacred my boy☆63Updated 2 months ago
- An introduction to LLM Sampling☆75Updated 3 weeks ago
- a tiny vectorstore implementation built with numpy.☆58Updated 8 months ago
- Simple Transformer in Jax☆127Updated 6 months ago
- ☆121Updated 4 months ago
- smolLM with Entropix sampler on pytorch☆147Updated 2 months ago
- ☆96Updated 2 months ago
- ☆94Updated 3 months ago
- ☆83Updated 3 months ago
- Cerule - A Tiny Mighty Vision Model☆67Updated 4 months ago
- In this repository I have a code and brief explanations of the attempts that I made at the ARC-AGI (2024) challenges :)☆23Updated last month
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆79Updated 7 months ago
- Notebooks for fine tuning pali gemma☆85Updated 2 weeks ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆142Updated this week
- Video+code lecture on building nanoGPT from scratch☆64Updated 6 months ago
- ☆27Updated 6 months ago
- Set of scripts to finetune LLMs☆36Updated 9 months ago
- ☆74Updated 3 months ago