A mechanistic approach for understanding and detecting factual errors of large language models.
☆49Jul 6, 2024Updated last year
Alternatives and similar repositories for mechanistic-error-probe
Users that are interested in mechanistic-error-probe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- quica is a tool to run inter coder agreement pipelines in an easy and effective ways. Multiple measures are run and results are collected…☆23Nov 9, 2020Updated 5 years ago
- ☆83Mar 26, 2024Updated 2 years ago
- Some numerical optimization methods implemented in Haskell☆47Jun 24, 2020Updated 5 years ago
- 👁️ Isometric 3D Graphing / Rendering module for Haskell☆15Sep 2, 2017Updated 8 years ago
- ☆10Nov 1, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Nov 8, 2022Updated 3 years ago
- Concept for a fast event system, using JIT and GPU acceleration☆12Feb 8, 2020Updated 6 years ago
- Shared code for training sentence embeddings with Flax / JAX☆28Jul 15, 2021Updated 4 years ago
- Comparative Analysis of Graph Neural Networks for Node Regression task on Wiki-Squirrel dataset (Bachelor's Research Project)☆12Nov 6, 2025Updated 5 months ago
- A repository to keep tools, scripts, data for SMART task.☆11May 24, 2022Updated 3 years ago
- ☆21Mar 19, 2024Updated 2 years ago
- IAI Style Guide☆11Jun 27, 2025Updated 9 months ago
- Belief in the Machine: Investigating Epistemological Blind Spots of Language Models☆32Apr 19, 2025Updated 11 months ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"☆47May 31, 2024Updated last year
- ☆10Jul 13, 2024Updated last year
- ☆58Jun 30, 2023Updated 2 years ago
- ☆15Apr 26, 2025Updated 11 months ago
- [ICME 2019] Source code and datasets for "Semi-supervised Compatibility Learning Across Categories for Clothing Matching"☆10Apr 26, 2024Updated last year
- Data and download script to accompany LREC2020 paper "Automated Fact-Checking of Claims from Wikipedia"☆13Jul 19, 2023Updated 2 years ago
- ☆23Jun 13, 2024Updated last year
- Official implementation of UnifiedReward & UnifiedReward-Think☆18Jun 18, 2025Updated 9 months ago
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆23Feb 6, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- This is the code corresponding to our publication introducing ConvDecoder with physics-based regularization (CD+r) for MRI☆10Feb 6, 2026Updated 2 months ago
- Simple phoenix setup for padded window management☆13Apr 25, 2018Updated 7 years ago
- Mental state inference from observable behavior☆15Dec 3, 2021Updated 4 years ago
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 2 years ago
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- ☆10Apr 14, 2021Updated 4 years ago
- ☆20Apr 12, 2024Updated 2 years ago
- A Domain-Specific Language, Jailbreak Attack Synthesizer and Dynamic LLM Redteaming Toolkit☆27Dec 5, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆22Feb 23, 2026Updated last month
- Implementation of the AAAI-21 Workshop on Scientific Document Understanding paper "A Paragraph-level Multi-task Learning Model for Scient…☆15Oct 9, 2023Updated 2 years ago
- ☆13Jul 12, 2024Updated last year
- Comparison-based Machine Learning in Python☆21Jun 16, 2024Updated last year
- This repo contains the ToMnet+ model for preference inference. Developed by Yun-Shiuan, Edwinn, Hsin-Yi, and Elaine.☆10Feb 24, 2023Updated 3 years ago
- [WSDM 2025] Source code for "Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Cali…☆14Oct 14, 2025Updated 5 months ago
- This repository contains data, code and models for contextual noncompliance.☆25Jul 18, 2024Updated last year