☆42Aug 3, 2025Updated 6 months ago
Alternatives and similar repositories for ReForm
Users that are interested in ReForm are comparing it to the libraries listed below
Sorting:
- ☆23Updated this week
- DafnyBench: A Benchmark for Formal Software Verification☆59Dec 12, 2024Updated last year
- (Mirror) A Machine-to-Machine Interaction System for Lean 4☆52Feb 9, 2026Updated 2 weeks ago
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆46Dec 25, 2025Updated 2 months ago
- [ICLR'25 Spotlight] Rethinking and improving autoformalization: towards a faithful metric and a Dependency Retrieval-based approach☆27May 20, 2025Updated 9 months ago
- [IJCAI 2024] QiMeng-CPU-v1: Automated CPU Design by Learning from Input-Output Examples☆27May 4, 2025Updated 9 months ago
- ☆16Oct 27, 2024Updated last year
- A Machine-to-Machine Interaction System for Lean 4.☆133Updated this week
- Generic interface for hooking up to any Interactive Theorem Prover (ITP) and collecting data for training ML models for AI in formal theo…☆18Feb 19, 2026Updated last week
- AlphaVerus: Formally Verified Code Generation through Self-Improving Translation and Treefinement☆24May 14, 2025Updated 9 months ago
- CLEVER: Code Lean Evaluation for Verified End-to-end Reasoning☆37Dec 18, 2025Updated 2 months ago
- Verina (Verifiable Code Generation Arena) is a high-quality benchmark enabling a comprehensive and modular evaluation of code, specificat…☆49Jan 25, 2026Updated last month
- Official implementation of "Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving"☆29May 8, 2025Updated 9 months ago
- ☆42Dec 16, 2025Updated 2 months ago
- ☆78Jan 22, 2026Updated last month
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- ☆26Feb 11, 2026Updated 2 weeks ago
- ☆76Jan 8, 2026Updated last month
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- ☆18Jun 10, 2025Updated 8 months ago
- LLM Evaluation Benchmark on Hardware Formal Verification☆36Apr 3, 2025Updated 10 months ago
- ☆34May 9, 2025Updated 9 months ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆29Oct 23, 2025Updated 4 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- NaturalProver: Grounded Mathematical Proof Generation with Language Models☆39Mar 24, 2023Updated 2 years ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 4 months ago
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆30Updated this week
- ☆11Jun 22, 2025Updated 8 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- Clover: Closed-Loop Verifiable Code Generation☆42May 12, 2025Updated 9 months ago
- ☆56May 21, 2025Updated 9 months ago
- The raw UserRL repo under construction☆95Sep 25, 2025Updated 5 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆50Feb 10, 2025Updated last year
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆51Jul 15, 2025Updated 7 months ago
- ☆12Dec 15, 2025Updated 2 months ago
- [CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval☆21Jun 23, 2025Updated 8 months ago
- This is the official implementation for MA-LoT.☆19Aug 4, 2025Updated 6 months ago
- Official Implementation of HIMA (COLM'25)☆19Nov 25, 2025Updated 3 months ago