Kaffaljidhmah2 / Arxiv-Recommender
☆49Updated last year
Alternatives and similar repositories for Arxiv-Recommender:
Users that are interested in Arxiv-Recommender are comparing it to the libraries listed below
- ☆61Updated 4 months ago
- ☆54Updated 4 months ago
- ☆81Updated last year
- Welcome to the 'In Context Learning Theory' Reading Group☆28Updated 4 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆119Updated 6 months ago
- [ACL'24, Outstanding Paper] Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!☆34Updated 7 months ago
- official implementation of ICLR'2025 paper: Rethinking Bradley-Terry Models in Preference-based Reward Modeling: Foundations, Theory, and…☆34Updated last week
- A curated list of awesome resources dedicated to Scaling Laws for LLMs☆70Updated last year
- A brief and partial summary of RLHF algorithms.☆124Updated 2 weeks ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆74Updated 4 months ago
- OpenReivew Submission Visualization (ICLR 2024/2025)☆152Updated 5 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆34Updated last year
- GenRM-CoT: Data release for verification rationales☆51Updated 5 months ago
- Lightweight Adapting for Black-Box Large Language Models☆21Updated last year
- ICLR2024 statistics☆47Updated last year
- Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep☆82Updated 8 months ago
- ☆32Updated last week
- ☆37Updated last year
- ☆50Updated last year
- Efficient empirical NTKs in PyTorch☆18Updated 2 years ago
- Code for "Towards Revealing the Mystery behind Chain of Thought: a Theoretical Perspective"☆19Updated last year
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆130Updated 6 months ago
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆91Updated 8 months ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆23Updated 2 months ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆23Updated 3 months ago
- Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"☆103Updated last year
- [ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization☆70Updated 7 months ago
- ☆78Updated last year