Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format
☆27Jul 12, 2023Updated 2 years ago
Alternatives and similar repositories for prm800k-denorm
Users that are interested in prm800k-denorm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Token-level adaptation of LoRA matrices for downstream task generalization.☆15Apr 14, 2024Updated last year
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- ☆15Sep 7, 2022Updated 3 years ago
- Eh, simple and works.☆27Dec 9, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆74Sep 5, 2023Updated 2 years ago
- Inverse Constitutional AI [ICLR 2025]: compressing pairwise preference data into a short constitution of principles.☆41Mar 2, 2026Updated 3 weeks ago
- Learning Formal Mathematics from Intrinsic Motivation☆37Jul 10, 2025Updated 8 months ago
- 🏆 Ambassador Paper for Innovative Use of NLP for Building Educational Applications 2023: Is ChatGPT a Good Teacher Coach? Measuring Zero…☆14Jul 21, 2024Updated last year
- ☆25Dec 20, 2023Updated 2 years ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Oct 12, 2023Updated 2 years ago
- Run evaluation on LLMs using human-eval benchmark☆429Sep 12, 2023Updated 2 years ago
- Inference Llama 2 in one file of pure C☆14Jul 24, 2023Updated 2 years ago
- Get Telemetry Data from YAMCS in OpenMCT☆10Sep 15, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official github repo for the paper "Compression Represents Intelligence Linearly" [COLM 2024]☆147Sep 20, 2024Updated last year
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 3 years ago
- RASP-L in Haskell for my fellow rascals☆20Dec 3, 2023Updated 2 years ago
- LLM training in simple, raw C/CUDA☆18May 6, 2024Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆35Apr 22, 2024Updated last year
- Auto-Video maker handling many AI's☆11Mar 18, 2024Updated 2 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Jun 1, 2023Updated 2 years ago
- Debian packaging for NNCP [archived], moved to https://salsa.debian.org/go-team/packages/nncp☆14Feb 18, 2023Updated 3 years ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Sep 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- (ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training☆285May 26, 2024Updated last year
- ☆22Aug 27, 2023Updated 2 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- [EMNLP 2024] Official repository for paper "From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis"☆21Oct 15, 2024Updated last year
- Code of ACM MM 2023 Paper: A Symbolic Characters Aware Model for Solving Geometry Problems☆16Dec 27, 2023Updated 2 years ago
- Less is More: High-value Data Selection for Visual Instruction Tuning☆17Jan 18, 2025Updated last year
- Extension for AUTOMATIC1111/stable-diffusion-webui for pasting images from clipboard in any WebUI form.☆16Nov 22, 2023Updated 2 years ago
- ☆25Apr 21, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A cog model for the all-mpnet-base-v2 sentence-transformers embedding model.☆15Jan 3, 2024Updated 2 years ago
- ☆25Jan 1, 2025Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆72Mar 26, 2023Updated 3 years ago
- Code for Contrastive Preference Learning (CPL)☆180Nov 22, 2024Updated last year
- ☆14Oct 31, 2023Updated 2 years ago
- Save a picture as Webp file in Comfy + Workflow loading☆43Jun 21, 2024Updated last year
- InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning☆285Aug 20, 2023Updated 2 years ago