DAMO-NLP-SG / CaRing
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs
☆34Updated last year
Alternatives and similar repositories for CaRing:
Users that are interested in CaRing are comparing it to the libraries listed below
- the instructions and demonstrations for building a formal logical reasoning capable GLM☆53Updated 7 months ago
- Evaluate the Quality of Critique☆34Updated 10 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆32Updated 6 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆69Updated last year
- ☆21Updated 3 weeks ago
- [arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"☆33Updated 4 months ago
- ☆19Updated 2 weeks ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆47Updated 3 months ago
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆26Updated last year
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 5 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆47Updated last year
- ☆35Updated last year
- ☆47Updated 4 months ago
- Aioli: A unified optimization framework for language model data mixing☆23Updated 3 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆41Updated last year
- ☆28Updated 5 months ago
- ☆22Updated 4 months ago
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- ☆24Updated 3 months ago
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆92Updated last week
- ☆43Updated 8 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- ☆52Updated 9 months ago
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated last year
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆33Updated last month
- ☆29Updated 3 months ago
- Benchmarking Benchmark Leakage in Large Language Models☆51Updated 11 months ago