ablghtianyi / ICL_Modular_ArithmeticLinks
☆19Updated 4 months ago
Alternatives and similar repositories for ICL_Modular_Arithmetic
Users that are interested in ICL_Modular_Arithmetic are comparing it to the libraries listed below
Sorting:
- ☆20Updated last year
- ☆34Updated 6 months ago
- Unofficial Implementation of Selective Attention Transformer☆17Updated 9 months ago
- ☆18Updated this week
- Offcial Repo of Paper "Eliminating Position Bias of Language Models: A Mechanistic Approach""☆14Updated last month
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆12Updated 4 months ago
- ☆20Updated 3 months ago
- [NeurIPS 2024] Can LLMs Learn by Teaching for Better Reasoning? A Preliminary Study☆52Updated 8 months ago
- Reinforcing General Reasoning without Verifiers☆76Updated last month
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆81Updated 9 months ago
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆39Updated 9 months ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 6 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 5 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆33Updated last week
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆29Updated 3 months ago
- The repository contains code for Adaptive Data Optimization☆25Updated 7 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆15Updated 4 months ago
- ☆83Updated 11 months ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆19Updated 5 months ago
- An official implementation of "Catastrophic Failure of LLM Unlearning via Quantization" (ICLR 2025)☆29Updated 5 months ago
- Resa: Transparent Reasoning Models via SAEs☆41Updated last month
- ☆47Updated 5 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆32Updated 3 months ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆15Updated 7 months ago
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆35Updated last month
- ☆27Updated 5 months ago
- SCoRe: Training Language Models to Self-Correct via Reinforcement Learning☆12Updated 6 months ago
- Code for "Reasoning to Learn from Latent Thoughts"☆114Updated 4 months ago
- ☆24Updated last month
- ☆22Updated 2 months ago