huggingface / ai-deadlines
⏰ AI conference deadline countdowns
☆244Updated last week
Alternatives and similar repositories for ai-deadlines:
Users that are interested in ai-deadlines are comparing it to the libraries listed below
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆135Updated 2 weeks ago
- minimal GRPO implementation from scratch☆62Updated last week
- Reproduction of DeepSeek-R1☆121Updated this week
- An introduction to LLM Sampling☆77Updated 3 months ago
- From scratch implementation of a vision language model in pure PyTorch☆205Updated 10 months ago
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆420Updated 3 months ago
- ☆173Updated 3 months ago
- ☆209Updated this week
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆277Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆167Updated this week
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆268Updated 3 weeks ago
- Build your own visual reasoning model☆312Updated this week
- Visualizations of the theory behind diffusion models.☆151Updated 11 months ago
- ☆149Updated 7 months ago
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆111Updated last month
- "LLM from Zero to Hero: An End-to-End Large Language Model Journey from Data to Application!"☆27Updated this week
- Notebooks for fine tuning pali gemma☆97Updated 2 months ago
- An extension of the nanoGPT repository for training small MOE models.☆106Updated 2 weeks ago
- ☆208Updated this week
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆300Updated 4 months ago
- Let's build better datasets, together!☆257Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆176Updated 2 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆269Updated 9 months ago
- ☆124Updated this week
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆161Updated 2 months ago
- Complete implementation of Llama2 with/without KV cache & inference 🚀☆47Updated 10 months ago
- documentation for content creation☆188Updated last month
- Notebook and Scripts that showcase running quantized diffusion models on consumer GPUs☆38Updated 4 months ago
- ☆105Updated 3 months ago
- ☆120Updated 4 months ago