☆45Feb 22, 2026Updated last month
Alternatives and similar repositories for OpenSAE
Users that are interested in OpenSAE are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NIPS 2025 DB Spotlight] AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios☆30Dec 1, 2025Updated 4 months ago
- ☆12Apr 25, 2024Updated last year
- Source code for EMNLP2022 paper "Finding Skill Neurons in Pre-trained Transformers via Prompt Tuning".☆18Mar 13, 2023Updated 3 years ago
- Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models☆27Jul 14, 2025Updated 9 months ago
- ☆62Oct 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code and dataset for the ACL 2021 paper "TWAG: A Topic-guided Wikipedia Abstract Generator"☆20Aug 9, 2021Updated 4 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated 2 years ago
- ☆59Aug 1, 2023Updated 2 years ago
- ☆25Apr 7, 2026Updated last week
- Code for "In-Context Former: Lightning-fast Compressing Context for Large Language Model" (Findings of EMNLP 2024)☆21Nov 21, 2024Updated last year
- Efficient Dictionary Learning with Switch Sparse Autoencoders (SAEs)☆25Dec 1, 2024Updated last year
- Group project "Algorithms for large-scale optimal transport". Implement ADMMs and Sinkhorn's Algorithms.☆11Jan 28, 2019Updated 7 years ago
- [CIKM 2025] LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking☆17Sep 6, 2025Updated 7 months ago
- ☆43Jun 26, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆14Apr 19, 2024Updated last year
- Python Implementation of Hierarchical Mesh Decomposition using Fuzzy Clustering and Cuts☆17Dec 17, 2022Updated 3 years ago
- [KDD 2025] AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge Reasoning☆16May 27, 2025Updated 10 months ago
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆248Apr 6, 2026Updated last week
- Resource, Evaluation and Detection Papers for ChatGPT☆455Mar 21, 2024Updated 2 years ago
- A simple Python implementation of ngram sunburst (nested pie chart) visualization showed in CoQA paper☆13Mar 12, 2019Updated 7 years ago
- [ICLR24] The open-source repo of THU-KEG's KoLA benchmark.☆55Sep 28, 2023Updated 2 years ago
- The data and source code for the paper "MoocRadar: A Fine-grained and Multi-aspect Knowledge Repository for Improving Cognitive Student M…☆59Oct 7, 2024Updated last year
- This is my codes that can visualize the psnr image in testing videos.☆15Apr 4, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆30May 22, 2025Updated 10 months ago
- ☆20Apr 10, 2025Updated last year
- Code for the paper: Improving Multi-Document Summarization through Referenced Flexible Extraction with Credit-Awareness☆12Oct 22, 2023Updated 2 years ago
- Repository of paper "Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis" (ACL 2025 Main)☆19Jul 19, 2025Updated 8 months ago
- ☆159Dec 30, 2025Updated 3 months ago
- ☆33Jun 18, 2025Updated 9 months ago
- Codebase for LLM Textual Hallucination Benchmark☆79Apr 25, 2025Updated 11 months ago
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆85Mar 14, 2026Updated last month
- Quality Shapes Extraction from very large Knowledge Graphs☆13Nov 15, 2025Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Sparsify transformers with SAEs and transcoders☆705Apr 6, 2026Updated last week
- blog☆17Oct 2, 2024Updated last year
- This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).☆16Dec 14, 2023Updated 2 years ago
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆13Jan 26, 2025Updated last year
- Data and code for the paper: Finding Safety Neurons in Large Language Models☆25Jan 29, 2026Updated 2 months ago
- Dataset and baseline for Coling 2022 long paper (oral): "ConFiguRe: Exploring Discourse-level Chinese Figures of Speech"☆13Jul 27, 2023Updated 2 years ago
- Source code for EMNLP 2023 paper "Probabilistic Tree-of-thought Reasoning for Answering Knowledge-intensive Complex Questions".☆23Mar 21, 2024Updated 2 years ago