katiekang1998 / reasoning_generalization
☆16Updated last month
Related projects ⓘ
Alternatives and complementary repositories for reasoning_generalization
- ☆55Updated last month
- ☆62Updated 3 months ago
- A repository for research on medium sized language models.☆74Updated 5 months ago
- Efficient Scaling laws and collaborative pretraining.☆13Updated this week
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 5 months ago
- The repository contains code for Adaptive Data Optimization☆18Updated last month
- GoldFinch and other hybrid transformer components☆39Updated 4 months ago
- ☆39Updated 10 months ago
- The official implementation of Self-Exploring Language Models (SELM)☆55Updated 5 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆24Updated 7 months ago
- ☆35Updated 3 weeks ago
- Genetics for Language Models☆12Updated 4 months ago
- ☆63Updated 4 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆74Updated this week
- Official implementation of ECCV24 paper: POA☆24Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago
- Triton Implementation of HyperAttention Algorithm☆46Updated 11 months ago
- ☆49Updated 6 months ago
- Pytorch implementation for "Compressed Context Memory For Online Language Model Interaction" (ICLR'24)☆50Updated 7 months ago
- ☆25Updated 2 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆35Updated 10 months ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆36Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆78Updated 8 months ago
- ☆22Updated 2 weeks ago
- ☆53Updated 10 months ago
- ☆41Updated 2 weeks ago
- ☆53Updated 3 weeks ago
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆26Updated last week
- From GaLore to WeLore: How Low-Rank Weights Non-uniformly Emerge from Low-Rank Gradients. Ajay Jaiswal, Lu Yin, Zhenyu Zhang, Shiwei Liu,…☆43Updated 4 months ago
- ☆57Updated last week