โ17Nov 3, 2024Updated last year
Alternatives and similar repositories for prm
Users that are interested in prm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.โ61Oct 3, 2024Updated last year
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ51May 4, 2024Updated last year
- โ27Apr 11, 2023Updated 3 years ago
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweightingโ24Jul 30, 2024Updated last year
- โ20Dec 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways โข AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ๐ป Terminal-Agent with Human-in-the-Loop Learningโ39Jan 16, 2026Updated 3 months ago
- [NeurIPS'24] Official code for *๐ฏDART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*โ121Dec 10, 2024Updated last year
- The official repository for paper "MLLM-Protector: Ensuring MLLMโs Safety without Hurting Performance"โ45Apr 21, 2024Updated last year
- โ41Jun 25, 2025Updated 9 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningโ124May 6, 2025Updated 11 months ago
- AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agentsโ54Jan 28, 2025Updated last year
- Fetch a random wallpaper from Konachan.โ10Jun 4, 2018Updated 7 years ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)โ23Jun 2, 2025Updated 10 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"โ17Mar 2, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Align, a general text alignment functionโ15Dec 7, 2023Updated 2 years ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!โ72Apr 1, 2025Updated last year
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Populaโฆโ11Oct 18, 2022Updated 3 years ago
- Repo for EmbedLLM: Learning Compact Representations of Large Language Modelsโ29Sep 25, 2025Updated 6 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tโฆโ132Jul 10, 2024Updated last year
- Codes and data for CIKM 2022 paper "RuDi: Explaining Behavior Sequence Models by Automatic Statistics Generation and Rule Distillation"โ12Aug 16, 2022Updated 3 years ago
- Neural theorem proving evaluation via the Lean REPLโ23Jul 12, 2025Updated 9 months ago
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)โ50Apr 19, 2024Updated last year
- โ16May 16, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient โข AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Repository for Skill Set Optimizationโ14Jul 26, 2024Updated last year
- UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experienceโ49Apr 3, 2026Updated last week
- โ13Jun 17, 2024Updated last year
- Code Repository for the EMCL-PKDD 2021 "Multitask Recalibrated Aggregation Network for Medical Code Prediction)โ13Sep 7, 2021Updated 4 years ago
- โ30Dec 27, 2024Updated last year
- โ16Mar 22, 2024Updated 2 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plannโฆโ14Nov 3, 2023Updated 2 years ago
- Implementation of AdaCQR(COLING 2025)โ15Dec 30, 2024Updated last year
- LLM as World Models using Bayesian inferenceโ17May 27, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to vโฆโ15Apr 14, 2025Updated last year
- ๆฌ้กน็ฎๅฉ็จๆทฑๅบฆๅญฆไน ๆๆฏ๏ผๅฎๆถๆฃๆตไบบไฝ3Dๅงฟๆ๏ผๅนถๅบไบๆญค้ขๆตๆชๆฅไบบไฝๅจไฝใ้็จmmposeๆกๆถไธๅค่ฟ็จๆๆฏๅฎ็ฐๅ็ซฏๅฟซ้้ขๆต๏ผๅฉ็จๆททๅ็ฐๅฎHololens2ๅคดๆดๆพ็คบๅจๆพ็คบไบบ็ฉๅจไฝ๏ผๅๅฐๅฎๆถๆๅ๏ผๅฎๆถ้ขๆต๏ผๅฎๆถๆพ็คบใโ12Oct 30, 2023Updated 2 years ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".โ83Jan 14, 2025Updated last year
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approachโ32Nov 6, 2023Updated 2 years ago
- The source code and dataset for our paper "Integrating Relation Constraints with Neural Relation Extractors" which is publicated at AAAI โฆโ15Mar 25, 2020Updated 6 years ago
- Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)โ18Nov 24, 2022Updated 3 years ago
- ICDM2022ๅคง่งๆจก็ตๅๅพไธ็้ฃ้ฉๅๅๆฃๆตๆฏ่ต๏ผ็ฌฌๅ ญๅ๏ผโ19Sep 21, 2022Updated 3 years ago