CS194-196 Course Project
☆14Feb 20, 2025Updated last year
Alternatives and similar repositories for OthelloSAE
Users that are interested in OthelloSAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repository of paper "Mojito: LLM-Aided Motion Instructor with Jitter-Reduced Inertial Tokens".☆24May 12, 2025Updated last year
- ☆15Oct 5, 2025Updated 8 months ago
- ☆26Jul 2, 2023Updated 3 years ago
- ☆77Mar 6, 2025Updated last year
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Materials for "Multi-property Steering of Large Language Models with Dynamic Activation Composition"☆14Nov 22, 2024Updated last year
- ☆18Oct 8, 2023Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 10 months ago
- A benchmark for mechanistic discovery of circuits in Transformers☆17Dec 15, 2024Updated last year
- ☆34Jun 17, 2024Updated 2 years ago
- Sparse Autoencoders (SAE) vs CLIP fine-tuning fun.☆18Dec 19, 2024Updated last year
- Code and data for "A Systematic Assessment of Syntactic Generalization in Neural Language Models"☆30Jun 18, 2021Updated 5 years ago
- ☆13Mar 26, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Jul 23, 2024Updated last year
- ☆32Jun 5, 2025Updated last year
- [ICML'25] "Rethinking Addressing in Language Models via Contextualized Equivariant Positional Encoding" by Jiajun Zhu, Peihao Wang, Ruisi…☆15Jun 6, 2025Updated last year
- VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning☆37May 9, 2026Updated last month
- Repository of GUI Action Narrator☆13Apr 8, 2025Updated last year
- KernelBench v2: Can LLMs Write GPU Kernels? - Benchmark with Torch -> Triton (and more!) problems☆24Jul 4, 2025Updated last year
- ☆11Oct 8, 2023Updated 2 years ago
- Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers☆32Mar 1, 2025Updated last year
- Tempo: Small Vision-Language Models are Smart Compressors for Long Video Understanding (ECCV 2026)☆72Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- I use OpenAi Robotics environment Fetch to train a robot to lift, slide, move objectives to defined targets. I do this using Deep Determi…☆32Feb 6, 2020Updated 6 years ago
- FeatureAlignment = Alignment + Mechanistic Interpretability☆35Mar 8, 2025Updated last year
- Policy Optimization is awesome, let’s put a tree on it! 🌲🌟☆22Jul 4, 2025Updated last year
- The official implementation of Preference Data Reward-Augmentation.☆18May 1, 2025Updated last year
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆20Mar 4, 2025Updated last year
- ☆37Jul 9, 2025Updated 11 months ago
- [ECCV 2024] M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts.☆61Oct 1, 2024Updated last year
- [CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding☆44Mar 15, 2024Updated 2 years ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆29Feb 25, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Command helper for slurm system. Act as if you are on compute node.☆16Feb 1, 2025Updated last year
- MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment☆35Jul 1, 2024Updated 2 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated 2 years ago
- Algorithms for approximate attention in LLMs☆22Apr 14, 2025Updated last year
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆32Nov 12, 2024Updated last year
- Code for the ACL 2021 paper "Structural Guidance for Transformer Language Models"☆14Sep 17, 2025Updated 9 months ago