changjonathanc / llmprocLinks
LLMProc: Unix-inspired runtime that treats LLMs as processes.
☆24Updated last week
Alternatives and similar repositories for llmproc
Users that are interested in llmproc are comparing it to the libraries listed below
Sorting:
- ☆62Updated 3 weeks ago
- look how they massacred my boy☆63Updated 8 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- Lego for GRPO☆28Updated 3 weeks ago
- Entropy Based Sampling and Parallel CoT Decoding☆17Updated 8 months ago
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 7 months ago
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- ☆96Updated last week
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 3 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆69Updated 4 months ago
- ☆38Updated 10 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆27Updated 2 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆68Updated 3 months ago
- ☆69Updated 3 months ago
- Approximating the joint distribution of language models via MCTS☆21Updated 7 months ago
- entropix style sampling + GUI☆26Updated 7 months ago
- Modify Entropy Based Sampling to work with Mac Silicon via MLX☆50Updated 7 months ago
- ☆14Updated last week
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆80Updated last month
- Modded vLLM to run pipeline parallelism over public networks☆37Updated last month
- rl from zero pretrain, can it be done? we'll see.☆49Updated this week
- Train Large Language Models on MLX.☆91Updated this week
- Prompt design in Python☆60Updated 6 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆69Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆80Updated 3 months ago
- SIMD quantization kernels☆71Updated last week
- train entropix like a champ!☆20Updated 8 months ago
- Letting Claude Code develop his own MCP tools :)☆109Updated 3 months ago