Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Jul 12, 2023Updated 2 years ago
Alternatives and similar repositories for prm800k-denorm
Users that are interested in prm800k-denorm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated 2 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- ☆74Sep 5, 2023Updated 2 years ago
- Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.☆41May 6, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🏆 Ambassador Paper for Innovative Use of NLP for Building Educational Applications 2023: Is ChatGPT a Good Teacher Coach? Measuring Zero…☆14Jul 21, 2024Updated last year
- ☆26Dec 20, 2023Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmark☆430Sep 12, 2023Updated 2 years ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 4 years ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆22Aug 14, 2025Updated 9 months ago
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Easy to deploy your LLM(large language model) server with no public address GPU machine.☆15Apr 30, 2024Updated 2 years ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆195Jul 9, 2025Updated 10 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated 2 years ago
- Code for "Revisiting Batch Norm Initialization".☆12Jul 14, 2022Updated 3 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- ☆22Aug 27, 2023Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- inference code for mixtral-8x7b-32kseqlen☆104Dec 12, 2023Updated 2 years ago
- ☆10Feb 12, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Solving Inequality Proofs with Large Language Models.☆58Dec 15, 2025Updated 5 months ago
- [EMNLP 2024] Official repository for paper "From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis"☆22Oct 15, 2024Updated last year
- Code for the paper LeanReasoner: Boosting Complex Logical Reasoning with Lean: https://arxiv.org/pdf/2403.13312.pdf☆27May 25, 2024Updated 2 years ago
- Code of ACM MM 2023 Paper: A Symbolic Characters Aware Model for Solving Geometry Problems☆16Dec 27, 2023Updated 2 years ago
- Learning to code TensorFlow☆10Jan 14, 2018Updated 8 years ago
- A crowd-powered database system, with SQL-like query interface, multi-goal optimization☆11Sep 4, 2017Updated 8 years ago
- ☆43Dec 31, 2023Updated 2 years ago
- Code for Semi-crowdsourced Clustering with Deep Generative Models☆12Dec 9, 2022Updated 3 years ago
- Less is More: High-value Data Selection for Visual Instruction Tuning☆19Jan 18, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆183Jul 8, 2025Updated 10 months ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- A cog model for the all-mpnet-base-v2 sentence-transformers embedding model.☆15Jan 3, 2024Updated 2 years ago
- ☆25Jan 1, 2025Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆72Mar 26, 2023Updated 3 years ago
- tinygrad port of the RWKV large language model.☆44Mar 9, 2025Updated last year
- Code for Contrastive Preference Learning (CPL)☆182Nov 22, 2024Updated last year