ejmichaud / neural-verification
MI and Formal Verification of NNs on Algorithmic tasks!
☆13Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for neural-verification
- Tutorial on neural theorem proving☆157Updated 10 months ago
- Tools for studying developmental interpretability in neural networks.☆77Updated last week
- Benchmark for undergraduate-level formal mathematics☆94Updated last month
- ☆59Updated last year
- llmstep: [L]LM proofstep suggestions in Lean 4.☆119Updated last year
- Formalizing stochastic doubly-efficient debate☆92Updated last month
- Retrieval-Augmented Theorem Provers for Lean☆229Updated 2 months ago
- LLMs + Lean, on your laptop or in the cloud☆125Updated 3 weeks ago
- AI for Mathematics (AI4Math) paper list☆135Updated last month
- Mechanistic Interpretability Visualizations using React☆200Updated 4 months ago
- https://albertqjiang.github.io/Portal-to-ISAbelle/☆51Updated last year
- ☆34Updated this week
- ☆61Updated last year
- Python wrapper for lean-gym☆11Updated last year
- ☆177Updated last year
- [COLM 2024] A Survey on Deep Learning for Theorem Proving☆135Updated 2 months ago
- ☆188Updated last month
- ☆44Updated this week
- ☆146Updated last month
- A scalable abstraction learning library☆74Updated 11 months ago
- ☆108Updated last year
- Formal to Formal Mathematics Benchmark☆317Updated last year
- Materials for ConceptARC paper☆77Updated 2 weeks ago
- An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.☆61Updated this week
- An environment for learning formal mathematical reasoning from scratch☆53Updated 3 months ago
- LLM verified with Monte Carlo Tree Search☆251Updated 2 months ago
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆200Updated 9 months ago
- The official code release for Don't Trust: Verify -- Grounding LLM Quantitative Reasoning with Autoformalization☆24Updated 6 months ago
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆158Updated last month
- ☆105Updated last month