Kaffaljidhmah2 / Arxiv-Recommender
☆41Updated last year
Related projects ⓘ
Alternatives and complementary repositories for Arxiv-Recommender
- ☆52Updated 9 months ago
- OpenReivew Submission Visualization (ICLR 2024/2025)☆140Updated 3 weeks ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆73Updated 4 months ago
- Welcome to the 'In Context Learning Theory' Reading Group☆22Updated this week
- `dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.☆30Updated last week
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆95Updated 2 months ago
- ☆44Updated 10 months ago
- ICLR2024 statistics☆46Updated 11 months ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆143Updated 2 weeks ago
- The implementation for MLSys 2023 paper: "Cuttlefish: Low-rank Model Training without All The Tuning"☆43Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆29Updated 8 months ago
- Stick-breaking attention☆32Updated last week
- [ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View☆27Updated 3 weeks ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆63Updated last year
- [SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates☆58Updated 2 weeks ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆127Updated last month
- [ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆64Updated 5 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆28Updated 10 months ago
- ☆28Updated 7 months ago
- ICLR2023 statistics☆60Updated 11 months ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆52Updated 2 months ago
- ☆26Updated last year
- ☆12Updated last week
- [ICML 2021] This is the official github repo for training L_inf dist nets with high certified accuracy.☆41Updated 2 years ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆77Updated 2 weeks ago
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- ☆59Updated 3 years ago
- DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models☆55Updated 2 weeks ago
- This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…☆51Updated this week
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆156Updated 3 months ago