ALT-JS / OthelloSAE
CS194-196 Course Project
☆13Updated last month
Alternatives and similar repositories for OthelloSAE:
Users that are interested in OthelloSAE are comparing it to the libraries listed below
- Official PyTorch Implementation for Task Vectors are Cross-Modal☆22Updated 3 months ago
- ☆35Updated last month
- Control LLM☆13Updated this week
- Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆14Updated 2 weeks ago
- ☆15Updated 8 months ago
- OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement☆30Updated this week
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆23Updated last week
- ☆16Updated 2 months ago
- This repo contains code for the paper "Both Text and Images Leaked! A Systematic Analysis of Data Contamination in Multimodal LLM"☆13Updated this week
- implementation of dualformer☆13Updated 3 weeks ago
- ☆18Updated 4 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆19Updated 3 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Official implementation of ECCV24 paper: POA☆24Updated 7 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆26Updated 11 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆21Updated 3 weeks ago
- Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)☆29Updated 8 months ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆10Updated last month
- ☆31Updated 2 months ago
- [NAACL 2025] Source code for MMEvalPro, a more trustworthy and efficient benchmark for evaluating LMMs☆23Updated 6 months ago
- The official implementation of Cross-Task Experience Sharing (COPS)☆21Updated 5 months ago
- ☆13Updated 2 months ago
- Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"☆36Updated last year
- DeciMamba: Exploring the Length Extrapolation Potential of Mamba (ICLR 2025)☆24Updated 8 months ago
- ☆21Updated 8 months ago
- Codes for ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding☆19Updated 2 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated 5 months ago
- official code for "BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning"☆33Updated 2 months ago
- Official implementation of the paper "MMInA: Benchmarking Multihop Multimodal Internet Agents"☆41Updated last month
- ☆40Updated 4 months ago