โ17Nov 3, 2024Updated last year
Alternatives and similar repositories for prm
Users that are interested in prm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.โ64Oct 3, 2024Updated last year
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ51May 4, 2024Updated 2 years ago
- โ27Apr 11, 2023Updated 3 years ago
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweightingโ24Jul 30, 2024Updated last year
- โ20Dec 14, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer โข AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ๐ป Terminal-Agent with Human-in-the-Loop Learningโ39Jan 16, 2026Updated 3 months ago
- Implementations of Influential Recommender Systemโ11Oct 29, 2024Updated last year
- [NeurIPS'24] Official code for *๐ฏDART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*โ121Dec 10, 2024Updated last year
- The official repository for paper "MLLM-Protector: Ensuring MLLMโs Safety without Hurting Performance"โ46Apr 21, 2024Updated 2 years ago
- [Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routingโ58Apr 6, 2026Updated 3 weeks ago
- โ44Jun 25, 2025Updated 10 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningโ126May 6, 2025Updated last year
- AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agentsโ54Jan 28, 2025Updated last year
- Fetch a random wallpaper from Konachan.โ10Jun 4, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI โข AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradientโ67Aug 3, 2025Updated 9 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"โ17Mar 2, 2026Updated 2 months ago
- Align, a general text alignment functionโ15Dec 7, 2023Updated 2 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Populaโฆโ11Oct 18, 2022Updated 3 years ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tโฆโ134Jul 10, 2024Updated last year
- Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"โ12Aug 16, 2022Updated 3 years ago
- Neural theorem proving evaluation via the Lean REPLโ23Jul 12, 2025Updated 9 months ago
- โ39May 2, 2024Updated 2 years ago
- [ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent systemโ162Updated this week
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for Skill Set Optimizationโ14Jul 26, 2024Updated last year
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"โ11Sep 20, 2024Updated last year
- โ59Nov 18, 2024Updated last year
- โ30Dec 27, 2024Updated last year
- โ17Mar 22, 2024Updated 2 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plannโฆโ14Nov 3, 2023Updated 2 years ago
- Implementation of AdaCQR(COLING 2025)โ15Dec 30, 2024Updated last year
- LLM as World Models using Bayesian inferenceโ17May 27, 2025Updated 11 months ago
- ๆฌ้กน็ฎๅฉ็จๆทฑๅบฆๅญฆไน ๆๆฏ๏ผๅฎๆถๆฃๆตไบบไฝ3Dๅงฟๆ๏ผๅนถๅบไบๆญค้ขๆตๆชๆฅไบบไฝๅจไฝใ้็จmmposeๆกๆถไธๅค่ฟ็จๆๆฏๅฎ็ฐๅ็ซฏๅฟซ้้ขๆต๏ผๅฉ็จๆททๅ็ฐๅฎHololens2ๅคดๆดๆพ็คบๅจๆพ็คบไบบ็ฉๅจไฝ๏ผๅๅฐๅฎๆถๆๅ๏ผๅฎๆถ้ขๆต๏ผๅฎๆถๆพ็คบใโ12Oct 30, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean โข AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".โ83Jan 14, 2025Updated last year
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approachโ32Nov 6, 2023Updated 2 years ago
- Keep your cat happy with toys you earned from focusing! (SwiftUI iOS App)โ38Oct 4, 2021Updated 4 years ago
- The source code and dataset for our paper "Integrating Relation Constraints with Neural Relation Extractors" which is publicated at AAAI โฆโ15Mar 25, 2020Updated 6 years ago
- Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)โ18Nov 24, 2022Updated 3 years ago
- [EMNLP 2025 Findings] Retrieval-Augmented Machine Translation with Unstructured Knowledgeโ15Sep 4, 2025Updated 8 months ago
- ICDM2022ๅคง่งๆจก็ตๅๅพไธ็้ฃ้ฉๅๅๆฃๆตๆฏ่ต๏ผ็ฌฌๅ ญๅ๏ผโ19Sep 21, 2022Updated 3 years ago