Caveman Compression is a semantic compression method for LLM contexts. It removes predictable grammar while preserving the unpredictable, factual content that defines meaning.
☆488Dec 3, 2025Updated 4 months ago
Alternatives and similar repositories for caveman-compression
Users that are interested in caveman-compression are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of experimental Retrieval Augmented Generation (RAG) Techniques to elevate your pipelines, all with code and intuitive expla…☆36Jul 21, 2025Updated 8 months ago
- WebAISum is a Python script that allows you to summarize web pages using AI models. It supports both local models like Ollama and remote …☆15Apr 28, 2024Updated last year
- An extendable, interactive command launcher. Inspired by spotlight & flashlight.☆12May 12, 2023Updated 2 years ago
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 6 months ago
- Open source static analysis toolkit for LLM agent plans☆12Aug 9, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 6 months ago
- Simple and Ideal Circuit Simulation☆13Dec 4, 2017Updated 8 years ago
- Sparse Inferencing for transformer based LLMs☆217Mar 25, 2026Updated 3 weeks ago
- ☆13Apr 8, 2026Updated last week
- speech to text gui for different (mostly Whisper, also Voxtral) models and backends, including whisper.cpp, mlx-whisper, faster-whisper, …☆11Dec 7, 2025Updated 4 months ago
- 🚀 YOOtheme Pro - Starter Plugin for Wordpress and Joomla☆19Nov 25, 2025Updated 4 months ago
- A loader that lets you try running LLMs built for WebGPU.☆29Dec 20, 2023Updated 2 years ago
- macOS accessibility API showcase.☆11Jun 27, 2025Updated 9 months ago
- My website & blog with articles about coding, tech, functional programming, …☆10Mar 28, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- REAP expert pruning for MoE LLMs on Apple Silicon via MLX☆53Mar 16, 2026Updated last month
- "Learning-based One-line intelligence Owner Network Connectivity Tool"☆15Apr 19, 2023Updated 2 years ago
- Example showing how to run a LLM fully inside an AWS Lambda Function☆23Jan 13, 2024Updated 2 years ago
- A PyTorch implementation of a conditional Denoising Diffusion Probabilistic Model (DDPM) for multi-modal trajectory prediction. This proj…☆37Feb 20, 2026Updated last month
- Genesis is a groundbreaking physics platform designed for robotics and embodied AI applications that combines unprecedented simulation sp…☆34Sep 5, 2025Updated 7 months ago
- ☆13Jan 14, 2026Updated 3 months ago
- Python implementation of the Vercel AI SDK's "Data Stream Protocol".☆26Jun 15, 2025Updated 10 months ago
- A Swift package for seamless integration with OpenAI's API, enabling advanced chat capabilities and token management in your applications☆34Jul 25, 2024Updated last year
- Implementation of Hippoformer, Integrating Hippocampus-inspired Spatial Memory with Transformers☆50Feb 5, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [CVPR 2025] Official Implementation of LOCORE: Image Re-ranking with Long-Context☆15Apr 15, 2025Updated last year
- ☆18Mar 17, 2026Updated 3 weeks ago
- A web application for studying Ancient Greek texts with integrated lexical, syntactic, and morphological analysis tools.☆22Dec 1, 2025Updated 4 months ago
- Train and run transformers directly on Apple's Neural Engine in Swift bypass coreml entirely☆94Updated this week
- ☆40Feb 14, 2026Updated 2 months ago
- An extension to original C# client implementation.☆23Jul 30, 2018Updated 7 years ago
- [COLM '24] Source-Aware Training Enables Knowledge Attribution in Language Models☆18Apr 1, 2025Updated last year