Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Jul 12, 2023Updated 2 years ago
Alternatives and similar repositories for prm800k-denorm
Users that are interested in prm800k-denorm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆74Sep 5, 2023Updated 2 years ago
- Learning Formal Mathematics from Intrinsic Motivation☆36Jul 10, 2025Updated 11 months ago
- 🏆 Ambassador Paper for Innovative Use of NLP for Building Educational Applications 2023: Is ChatGPT a Good Teacher Coach? Measuring Zero…☆14Jul 21, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆26Dec 20, 2023Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmark☆429Sep 12, 2023Updated 2 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆32Oct 12, 2023Updated 2 years ago
- Get Telemetry Data from YAMCS in OpenMCT☆10Sep 15, 2017Updated 8 years ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Mar 1, 2024Updated 2 years ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 4 years ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- RASP-L in Haskell for my fellow rascals☆20Dec 3, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Pydantic-based HTTP forms☆19Jun 2, 2025Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆195Jul 9, 2025Updated 11 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated 2 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆32Jun 1, 2023Updated 3 years ago
- Debian packaging for NNCP [archived], moved to https://salsa.debian.org/go-team/packages/nncp☆14Feb 18, 2023Updated 3 years ago
- ☆22Aug 27, 2023Updated 2 years ago
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆287May 26, 2024Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Solving Inequality Proofs with Large Language Models.☆59Dec 15, 2025Updated 6 months ago
- [EMNLP 2024] Official repository for paper "From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis"☆22Oct 15, 2024Updated last year
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆27May 25, 2024Updated 2 years ago
- Code of ACM MM 2023 Paper: A Symbolic Characters Aware Model for Solving Geometry Problems☆16Dec 27, 2023Updated 2 years ago
- Learning to code TensorFlow☆10Jan 14, 2018Updated 8 years ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- Less is More: High-value Data Selection for Visual Instruction Tuning☆19Jan 18, 2025Updated last year
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆183Jul 8, 2025Updated 11 months ago
- ☆25Apr 21, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- A cog model for the all-mpnet-base-v2 sentence-transformers embedding model.☆15Jan 3, 2024Updated 2 years ago
- ☆26Jun 4, 2026Updated 2 weeks ago
- Code for Contrastive Preference Learning (CPL)☆182Nov 22, 2024Updated last year
- This is MPE-pytorch, fix some bugs.☆11Apr 26, 2020Updated 6 years ago
- Code repository for the c-BTM paper☆109Sep 26, 2023Updated 2 years ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago