huggingface / ai-deadlinesLinks
⏰ AI conference deadline countdowns
☆283Updated 2 weeks ago
Alternatives and similar repositories for ai-deadlines
Users that are interested in ai-deadlines are comparing it to the libraries listed below
Sorting:
- PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning☆518Updated last week
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆146Updated this week
- Large multi-modal models (L3M) pre-training.☆170Updated 2 weeks ago
- documentation for content creation☆223Updated this week
- ☆270Updated 5 months ago
- Reproduction of DeepSeek-R1☆238Updated 5 months ago
- ☆199Updated 9 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆343Updated 3 months ago
- Build your own visual reasoning model☆412Updated last month
- Code for ExploreTom☆86Updated 3 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆296Updated last month
- Best practices & guides on how to write distributed pytorch training code☆487Updated 7 months ago
- Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models. TMLR 2025.☆107Updated 3 weeks ago
- MLGym A New Framework and Benchmark for Advancing AI Research Agents☆556Updated last month
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆456Updated last month
- Post-training with Tinker☆550Updated this week
- This repo contains the code for the paper "Intuitive physics understanding emerges fromself-supervised pretraining on natural videos"☆186Updated 7 months ago
- ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"☆111Updated 3 months ago
- Conference schedule, top papers, and analysis of the data for NeurIPS 2023!☆119Updated last year
- ☆609Updated 5 months ago
- Fine tune Gemma 3 on an object detection task☆85Updated 2 months ago
- ☆45Updated 4 months ago
- All credits go to HuggingFace's Daily AI papers (https://huggingface.co/papers) and the research community. 🔉Audio summaries here (https…☆196Updated last week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆290Updated 2 months ago
- ☆237Updated last month
- Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation (NeurIPS 2025)☆461Updated last week
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, spars…☆342Updated 9 months ago
- Public repository for "The Surprising Effectiveness of Test-Time Training for Abstract Reasoning"☆329Updated 10 months ago
- [EMNLP 2025 Demo] TinyScientist: A Lightweight Framework for Building Research Agents☆106Updated this week
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆236Updated this week