Pivotal Token Search
☆145Dec 20, 2025Updated 2 months ago
Alternatives and similar repositories for pts
Users that are interested in pts are comparing it to the libraries listed below
Sorting:
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- ☆18Dec 9, 2025Updated 2 months ago
- ☆17Dec 16, 2024Updated last year
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- ☆56Nov 6, 2024Updated last year
- private-machine is an AI companion system with emotion, needs and goals simulation. Very silly, not based on real science.☆29Updated this week
- ☆21Jul 25, 2025Updated 7 months ago
- Using deep research workflow to generate datasets for finetuning LLMs.☆38Oct 9, 2025Updated 4 months ago
- Git-native prompt management and testing framework for production LLM workflows☆18Jul 2, 2025Updated 7 months ago
- ☆15Apr 9, 2025Updated 10 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- 一起来养一只拥有专属记忆的AI猫猫吧!☆10Oct 25, 2024Updated last year
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 2 weeks ago
- The rag pipeline for optimizing dynamic data editing.☆20Oct 30, 2025Updated 4 months ago
- A daily benchmark to regression-test cloud LLMs☆17Aug 7, 2025Updated 6 months ago
- Screenshot LLM is a Python application that leverages the power of AI to analyze screenshots. Built with PyQt6 for a user-friendly interf…☆45Nov 4, 2024Updated last year
- [NeurIPS'25 Oral] Query-agnostic KV cache eviction: 3–4× reduction in memory and 2× decrease in latency (Qwen3/2.5, Gemma3, LLaMA3)☆204Feb 11, 2026Updated 2 weeks ago
- Optimizing inference proxy for LLMs☆3,342Jan 28, 2026Updated last month
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- Run Open Source Local AI Models in Excel with Ollama☆24Aug 11, 2025Updated 6 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆56Feb 10, 2025Updated last year
- code for training and using chess embeddings models☆13Jun 9, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- ☆11Feb 20, 2025Updated last year
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Feb 11, 2026Updated 2 weeks ago
- Official implementation of UnifiedReward & UnifiedReward-Think☆18Jun 18, 2025Updated 8 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆41Jan 27, 2026Updated last month
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 4 months ago
- win32 native frontend for llama-cli☆12Nov 2, 2024Updated last year
- ☆21Feb 13, 2025Updated last year
- Speech-to-text typing for Linux/Wayland using Whisper.☆38Dec 6, 2025Updated 2 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆50Feb 17, 2026Updated last week
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆102Jul 19, 2025Updated 7 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Nov 4, 2024Updated last year
- LLMProxy is an intelligent large language model backend routing proxy service.☆22Dec 6, 2025Updated 2 months ago
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆14Aug 25, 2024Updated last year
- ☆15Jun 4, 2025Updated 8 months ago
- ☆15Feb 1, 2025Updated last year