alrojo / me
about me
☆21Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for me
- List of AI Internships☆101Updated last year
- ☆149Updated 6 months ago
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆48Updated last year
- Deep Learning & Information Bottleneck☆50Updated last year
- Collections of CS PhD Application Fee Waivers of schools in North America☆337Updated 11 months ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆34Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆43Updated 5 months ago
- A repository for compiling graduate application materials for prospective computer science graduate students (Masters & PhD).☆169Updated this week
- 📝 A not-so-fancy but still a pretty research CV☆61Updated 3 years ago
- Framework code with wandb, checkpointing, logging, configs, experimental protocols. Useful for fine-tuning models or training from scratc…☆146Updated last year
- ☆75Updated 9 months ago
- Auto get diffusion nlp papers in Axriv. More papers Information can be found in another repository "Diffusion_NLP_Papers".☆64Updated this week
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- Omnigrok: Grokking Beyond Algorithmic Data☆48Updated last year
- Pytorch code for experiments on Linear Transformers☆13Updated 9 months ago
- ☆61Updated 2 years ago
- List of AI Residency & Research programs, Ph.D Fellowships, Research Internships☆154Updated 4 years ago
- ☆21Updated 4 months ago
- ☆108Updated last year
- ☆96Updated 3 months ago
- ☆102Updated last month
- A curated list for awesome discrete diffusion models resources.☆61Updated this week
- NeuroSurgeon is a package that enables researchers to uncover and manipulate subnetworks within models in Huggingface Transformers☆36Updated 3 months ago
- Efficient LLM inference on Slurm clusters using vLLM.☆37Updated this week
- ☆35Updated 7 months ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆84Updated last year
- A curated list of fellowships for graduate students in Computer Science and related fields.☆51Updated 3 months ago
- NanoGPT-like codebase for LLM training☆73Updated this week
- Personal implementation of ASIF by Antonio Norelli☆24Updated 5 months ago
- ☆48Updated 8 months ago