Kaffaljidhmah2 / Arxiv-Recommender
☆50Updated last year
Alternatives and similar repositories for Arxiv-Recommender:
Users that are interested in Arxiv-Recommender are comparing it to the libraries listed below
- Welcome to the 'In Context Learning Theory' Reading Group☆26Updated 5 months ago
- Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…☆176Updated 3 months ago
- A brief and partial summary of RLHF algorithms.☆127Updated last month
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆50Updated 2 weeks ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆97Updated 9 months ago
- ☆62Updated 4 months ago
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆71Updated 2 years ago
- ☆54Updated 5 months ago
- Lightweight Adapting for Black-Box Large Language Models☆22Updated last year
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆35Updated 8 months ago
- ☆50Updated last year
- ☆33Updated last week
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆34Updated last week
- summer school materials☆44Updated last year
- OpenReivew Submission Visualization (ICLR 2024/2025)☆152Updated 6 months ago
- An index of algorithms for reinforcement learning from human feedback (rlhf))☆93Updated last year
- Efficient empirical NTKs in PyTorch☆18Updated 2 years ago
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆20Updated last year
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆120Updated 7 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆54Updated 10 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Updated last year
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆76Updated 5 months ago
- [NeurIPS'24 Spotlight] Observational Scaling Laws☆54Updated 6 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆89Updated 2 weeks ago
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆72Updated 7 months ago
- ☆82Updated last year
- [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)☆62Updated 6 months ago
- official code for paper Probing the Decision Boundaries of In-context Learning in Large Language Models. https://arxiv.org/abs/2406.11233…☆17Updated 7 months ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆24Updated 2 months ago