☆20Dec 14, 2024Updated last year
Alternatives and similar repositories for ER-PRM
Users that are interested in ER-PRM are comparing it to the libraries listed below
Sorting:
- [arxiv: 2512.19673] Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies☆61Feb 6, 2026Updated last month
- Docs of NLP/deep Learning/machine learning, etc. https://siat-nlp.github.io/docs☆11Jul 17, 2019Updated 6 years ago
- [TMLR] Triple Preference Optimization☆30Feb 19, 2025Updated last year
- The attention map viewer for LLaMA models.☆37Dec 16, 2023Updated 2 years ago
- 桂林电子科技大学Evolution战队2021雷达站视觉代码开源☆11Sep 3, 2021Updated 4 years ago
- ☆13Oct 11, 2024Updated last year
- A Flexible Framework for Generative Recommendation☆27Updated this week
- COVID-19 Risk Estimation for L.A. County using a Bayesian Time-varying SIR-model☆12Feb 17, 2023Updated 3 years ago
- Code and models for EMNLP 2024 paper "WPO: Enhancing RLHF with Weighted Preference Optimization"☆41Sep 24, 2024Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Nov 11, 2024Updated last year
- An Easy Annotation Tool for Natural Language Processing☆11May 17, 2024Updated last year
- ☆13Jun 25, 2025Updated 8 months ago
- Code for "ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer"☆15Jul 17, 2024Updated last year
- This repository contains the replication package of our paper "Assessing the Security of GitHub Copilot’s Generated Code - A Targeted Rep…☆10Nov 16, 2023Updated 2 years ago
- ☆11Oct 12, 2021Updated 4 years ago
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- An approximate implementation of the OpenAI paper - An Empirical Model of Large-Batch Training for MNIST☆11Nov 19, 2022Updated 3 years ago
- ☆11Feb 11, 2026Updated 3 weeks ago
- Automated Design of Agentic Systems☆10Sep 7, 2024Updated last year
- ☆12Jul 25, 2023Updated 2 years ago
- Code of the paper "Synthesizing Aspect-Driven Recommendation Explanations from Reviews", IJCAI'20☆10Apr 5, 2024Updated last year
- ☆11Aug 8, 2018Updated 7 years ago
- This repository has been redirected into https://kuaisar.github.io/.☆11Oct 12, 2023Updated 2 years ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- ☆10Apr 15, 2023Updated 2 years ago
- [ICML'25] MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents☆21Jul 31, 2025Updated 7 months ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- ☆11May 28, 2024Updated last year
- This is the source code for Efficient Sequential Recommendation for Long Term User Interest Via Personalization.☆23Nov 18, 2025Updated 3 months ago
- LLMGeo: Benchmarking Large Language Models on Image Geolocation In-the-wild☆16Oct 31, 2024Updated last year
- INDICT: Code Generation with Internal Dialogues of Critiques for Both Security and Helpfulness☆14Nov 10, 2025Updated 3 months ago
- ☆10Jan 21, 2020Updated 6 years ago
- 2021全国大学生工程训练综合能力竞赛智能物流搬运赛道视觉开源代码.☆13Sep 27, 2022Updated 3 years ago
- The code of paper "Nonlinear Hybrid Planning with Deep Net Learned Transition Models and Mixed-Integer Linear Programming." published on …☆10Apr 27, 2018Updated 7 years ago
- ☆14Feb 20, 2025Updated last year
- Birdiebot Target Prception And Decision Making Framework☆13Aug 29, 2022Updated 3 years ago
- Dockerized openconnect client. Compatible with Cisco Anyconnect (CSD). Exposes socks5 proxy.☆13Oct 16, 2020Updated 5 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- Training diffusion model with CIFAR10 dataset(insight from 13 papers)☆15Aug 5, 2025Updated 7 months ago