☆25Apr 3, 2025Updated last year
Alternatives and similar repositories for reversal-curse-binding
Users that are interested in reversal-curse-binding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Autonomous Curriculum Reinforcement Learning framework that steers agents to continually learn in specific environments with zero huma…☆34May 13, 2026Updated 2 weeks ago
- MUFFIN: Curating Multi-Faceted Instructions for Improving Instruction-Following☆16Oct 31, 2024Updated last year
- Efficient Symptom Inquiring and Diagnosis via Adaptive Alignment of Reinforcement Learning and Classification [AI in Medicine Journal]☆12May 20, 2022Updated 4 years ago
- Parse Symcat (http://www.symcat.com) symptoms and conditions and generate valid Synthea (https://github.com/synthetichealth/synthea) modu…☆16Jan 28, 2021Updated 5 years ago
- Repository for Sparse Universal Transformers☆20Oct 23, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Learning from preferences is a common paradigm for fine-tuning language models. Yet, many algorithmic design decisions come into play. Ou…☆32Apr 20, 2024Updated 2 years ago
- [ACL'25 (Findings)] Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents☆29Feb 17, 2026Updated 3 months ago
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- ☆21Aug 27, 2023Updated 2 years ago
- ☆93Aug 18, 2024Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated 2 years ago
- ☆16Feb 22, 2025Updated last year
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆13Nov 15, 2022Updated 3 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- [NeurIPS'25 D&B] Mind2Web-2 Benchmark: Evaluating Agentic Search with Agent-as-a-Judge☆111May 17, 2026Updated last week
- Transformer implemented with graph attention network (GAT) layers from PyTorch Geometric☆18Aug 14, 2022Updated 3 years ago
- ☆11Jun 2, 2022Updated 3 years ago
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆27May 25, 2024Updated 2 years ago
- [NeurIPS 2024 Spotlight] Code and data for the paper "Finding Transformer Circuits with Edge Pruning".☆68Aug 15, 2025Updated 9 months ago
- Wenzhou-Kean University AI-LAB☆10Jun 6, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Jul 20, 2022Updated 3 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆16Aug 11, 2023Updated 2 years ago
- ☆18Jan 3, 2025Updated last year
- ☆12May 26, 2022Updated 4 years ago
- Official Implementation of "GEAR: Augmenting Language Models with Generalizable and Efficient Tool Resolution"☆20Apr 3, 2024Updated 2 years ago
- Symmetric Encryption with Language Models☆13Jun 13, 2023Updated 2 years ago
- [NeurIPS 2025] What Makes a Reward Model a Good Teacher? An Optimization Perspective☆44Sep 18, 2025Updated 8 months ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆13Jul 26, 2023Updated 2 years ago
- Official code for the paper Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception. The code is based on t…☆20Aug 5, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The implementation for ACL 2022 paper☆20Aug 14, 2022Updated 3 years ago
- Code for the Ask4Help project☆22Nov 24, 2022Updated 3 years ago
- Perform bayesian distribution regression☆13Mar 19, 2018Updated 8 years ago
- ☆19Nov 7, 2022Updated 3 years ago
- [AACL 2023] Official implementation of paper "Towards LLM-based Fact Verification on News Claims with a Hierarchical Step-by-Step Prompti…☆21Apr 1, 2024Updated 2 years ago
- ☆25Dec 13, 2024Updated last year
- Code for the CIKM'23 paper "A Retrieve-and-Read Framework for Knowledge Graph Link Prediction"☆13Mar 23, 2025Updated last year