antirez / LLM-FTC-samplingLinks
First token cutoff sampling inference example
☆30Updated last year
Alternatives and similar repositories for LLM-FTC-sampling
Users that are interested in LLM-FTC-sampling are comparing it to the libraries listed below
Sorting:
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 9 months ago
- Benchmarks comparing PyTorch and MLX on Apple Silicon GPUs☆85Updated 11 months ago
- ☆20Updated 3 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 8 months ago
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Updated 4 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 5 months ago
- MLX support for the Open Neural Network Exchange (ONNX)☆52Updated last year
- Because it's there.☆16Updated 9 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆30Updated 9 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆29Updated last year
- ☆13Updated last year
- Training hybrid models for dummies.☆23Updated 5 months ago
- Latent Large Language Models☆18Updated 9 months ago
- ☆66Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 9 months ago
- A collection of optimizers for MLX☆36Updated 3 weeks ago
- Ongoing research training transformer models at scale☆37Updated last year
- NanoGPT (124M) quality in 2.67B tokens☆28Updated last month
- ☆38Updated last year
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆71Updated 4 months ago
- Tools for formatting large language model prompts.☆13Updated last year
- Lightweight tools for quick and easy LLM demo's☆28Updated 9 months ago
- ANE accelerated embedding models!☆18Updated 6 months ago
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 3 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- LLama implementations benchmarking framework☆12Updated last year
- ☆63Updated last month