lab-v2 / pyreason-gym
An OpenAI wrapper for PyReason to use in a Grid World reinforcement learning setting
โ30Updated last year
Alternatives and similar repositories for pyreason-gym:
Users that are interested in pyreason-gym are comparing it to the libraries listed below
- โ14Updated last year
- โ23Updated last month
- ๐งฎ Algebraic Positional Encodings.โ11Updated 2 months ago
- Elevate your language models with insightful diversity metrics.โ11Updated last year
- โ43Updated last year
- An environment for learning formal mathematical reasoning from scratchโ65Updated 7 months ago
- REBUS: A Robust Evaluation Benchmark of Understanding Symbolsโ13Updated 7 months ago
- A Scalable Approximate Method for Probabilistic Neurosymbolic Inferenceโ14Updated 2 months ago
- Clover: Closed-Loop Verifiable Code Generationโ32Updated 10 months ago
- Evaluation of neuro-symbolic enginesโ35Updated 8 months ago
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.โ16Updated 5 months ago
- โ23Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zetaโ13Updated 4 months ago
- Get language models to generate responses in a specific format reliably. Open source implementation of Synchromesh: Reliable code generatโฆโ28Updated last year
- You should use PySR to find scaling laws. Here's an example.โ33Updated last year
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdfโ22Updated 10 months ago
- Harmonic Datasetsโ37Updated 8 months ago
- GPT-based language channel for NARS (ONA)โ29Updated 2 months ago
- LMQL implementation of tree of thoughtsโ34Updated last year
- Residual Quantization Autoencoder, used for interpreting LLMsโ11Updated 3 months ago
- Code for the paper "Learning to Prove Theorems by Learning to Generate Theorems"โ32Updated 4 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, โฆโ40Updated last year
- โ11Updated last month
- Official code for paper: Conservative objective models are a special kind of contrastive divergence-based energy modelโ14Updated last year
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classificationโ11Updated last year
- โ18Updated 11 months ago
- โ25Updated 3 years ago
- This is the official repository for all the code of TheoremLlamaโ39Updated 5 months ago
- Certified Reasoning with Language Modelsโ31Updated last year
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM datasetโ16Updated last year