โ17Nov 3, 2024Updated last year
Alternatives and similar repositories for prm
Users that are interested in prm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.โ64Oct 3, 2024Updated last year
- [๐๐๐๐๐ ๐ ๐ข๐ง๐๐ข๐ง๐ ๐ฌ ๐๐๐๐ & ๐๐๐ ๐๐๐๐ ๐๐๐๐๐ ๐๐ซ๐๐ฅ] ๐๐ฏ๐ฉ๐ข๐ฏ๐ค๐ช๐ฏ๐จ ๐๐ข๐ต๐ฉ๐ฆ๐ฎ๐ข๐ต๐ช๐ค๐ข๐ญ ๐๐ฆ๐ข๐ด๐ฐ๐ฏ๐ช๐ฏโฆโ51May 4, 2024Updated 2 years ago
- โ27Apr 11, 2023Updated 3 years ago
- This is the official implementation of ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweightingโ24Jul 30, 2024Updated last year
- โ20Dec 14, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting โข AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementations of Influential Recommender Systemโ12Oct 29, 2024Updated last year
- [NeurIPS'24] Official code for *๐ฏDART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*โ121Dec 10, 2024Updated last year
- ้ๅฏนๆ็ปๅ ธ็่กจๆ ผๅQ learning็ฎๆณ่ฟ่กไบๅค็ฐ๏ผ่ฝๅคๆฏๆgymไธญๅคงๅคๆฐ็็ฆปๆฃๅจไฝๅ็ถๆ็ฉบ้ด็็ฏๅข๏ผ่ญฌๅฆCliffWalking-v0ใโ10Jan 2, 2021Updated 5 years ago
- The official repository for paper "MLLM-Protector: Ensuring MLLMโs Safety without Hurting Performance"โ46Apr 21, 2024Updated 2 years ago
- A iOS and watchOS focus timer app ๐โ33Oct 27, 2024Updated last year
- โ45Jun 25, 2025Updated 11 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learningโ126May 6, 2025Updated last year
- AgentOccam: A Simple Yet Strong Baseline for LLM-Based Web Agentsโ57Jan 28, 2025Updated last year
- [Findings@ACL'26] LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routingโ64Apr 6, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean โข AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)โ23Jun 2, 2025Updated last year
- Official implementation for DenseMixer: Improving MoE Post-Training with Precise Router Gradientโ67Aug 3, 2025Updated 10 months ago
- Code for NAACL 2025 paper "AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge"โ16Mar 2, 2026Updated 3 months ago
- Align, a general text alignment functionโ15Dec 7, 2023Updated 2 years ago
- Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!โ71Apr 1, 2025Updated last year
- Repo for EmbedLLM: Learning Compact Representations of Large Language Modelsโ32Sep 25, 2025Updated 8 months ago
- [NAACL 2024 Outstanding Paper] Source code for the NAACL 2024 paper entitled "R-Tuning: Instructing Large Language Models to Say 'I Don'tโฆโ136Jul 10, 2024Updated last year
- Neural theorem proving evaluation via the Lean REPLโ24Jul 12, 2025Updated 11 months ago
- [ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system; [arxiv] MetaAgent-X: End-to-End Reinforcement Learning Automatic Multโฆโ185May 15, 2026Updated last month
- Virtual machines for every use case on DigitalOcean โข AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code and data for the paper "Can Large Language Models Understand Real-World Complex Instructions?"(AAAI2024)โ51Apr 19, 2024Updated 2 years ago
- โ16May 16, 2025Updated last year
- Repository for Skill Set Optimizationโ14Jul 26, 2024Updated last year
- Data and code for paper "ODSum: New Benchmarks for Open Domain Multi-Document Summarization"โ11Sep 20, 2024Updated last year
- โ57Nov 18, 2024Updated last year
- Code Repository for the EMCL-PKDD 2021 "Multitask Recalibrated Aggregation Network for Medical Code Prediction)โ13Sep 7, 2021Updated 4 years ago
- โ30Dec 27, 2024Updated last year
- โ17Mar 22, 2024Updated 2 years ago
- Implementation of AdaCQR(COLING 2025)โ15Dec 30, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer โข AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to vโฆโ15Apr 14, 2025Updated last year
- LLM as World Models using Bayesian inferenceโ20May 27, 2025Updated last year
- ๆฌ้กน็ฎๅฉ็จๆทฑๅบฆๅญฆไน ๆๆฏ๏ผๅฎๆถๆฃๆตไบบไฝ3Dๅงฟๆ๏ผๅนถๅบไบๆญค้ขๆตๆชๆฅไบบไฝๅจไฝใ้็จmmposeๆกๆถไธๅค่ฟ็จๆๆฏๅฎ็ฐๅ็ซฏๅฟซ้้ขๆต๏ผๅฉ็จๆททๅ็ฐๅฎHololens2ๅคดๆดๆพ็คบๅจๆพ็คบไบบ็ฉๅจไฝ๏ผๅๅฐๅฎๆถๆๅ๏ผๅฎๆถ้ขๆต๏ผๅฎๆถๆพ็คบใโ12Oct 30, 2023Updated 2 years ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".โ84Jan 14, 2025Updated last year
- Keep your cat happy with toys you earned from focusing! (SwiftUI iOS App)โ38Oct 4, 2021Updated 4 years ago
- PyTorch implementation of experiments in the paper Aligning Language Models with Human Preferences via a Bayesian Approachโ32Nov 6, 2023Updated 2 years ago
- The source code and dataset for our paper "Integrating Relation Constraints with Neural Relation Extractors" which is publicated at AAAI โฆโ15Mar 25, 2020Updated 6 years ago