guidance-ai / jsonschemabench
☆29Updated this week
Alternatives and similar repositories for jsonschemabench:
Users that are interested in jsonschemabench are comparing it to the libraries listed below
- OLMost every training recipe you need to perform data interventions with the OLMo family of models.☆23Updated this week
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆21Updated 5 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated 3 weeks ago
- Lightweight tools for quick and easy LLM demo's☆26Updated 7 months ago
- Training hybrid models for dummies.☆20Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆71Updated 7 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 5 months ago
- ☆48Updated 5 months ago
- Simple GRPO scripts and configurations.☆58Updated 2 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- ☆66Updated 10 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- ☆13Updated 4 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- ☆24Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆11Updated 2 weeks ago
- ☆22Updated 10 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆34Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 7 months ago
- ☆51Updated last week
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆27Updated 2 months ago
- ☆20Updated last year
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆33Updated last week
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- ☆41Updated 4 months ago