Picovoice / llm-compression-benchmark
LLM Compression Benchmark
☆21Updated last month
Alternatives and similar repositories for llm-compression-benchmark:
Users that are interested in llm-compression-benchmark are comparing it to the libraries listed below
- This project implements a demonstrator agent that compares the Cache-Augmented Generation (CAG) Framework with traditional Retrieval-Augm…☆28Updated 3 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.☆42Updated 10 months ago
- Iterate fast on your RAG pipelines☆22Updated last month
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆28Updated 3 weeks ago
- Simple GRPO scripts and configurations.☆59Updated last month
- run ollama & gguf easily with a single command☆50Updated 10 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- ☆52Updated last month
- ☆126Updated 7 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 2 months ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆18Updated last year
- GraphRag vs Embeddings☆13Updated 8 months ago
- ☆27Updated 7 months ago
- This is the code that went into our practical dive using mamba as information extraction☆53Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- ☆66Updated 10 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆62Updated 2 months ago
- ☆48Updated 4 months ago
- Scripts to create your own moe models using mlx☆89Updated last year
- Training hybrid models for dummies.☆20Updated 2 months ago
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆64Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Lightweight OpenAI wrapper using FastAPI. Add rate limits to OpenAI usage, optionally log and store all API calls, and share regulated Op…☆13Updated last year
- ☆22Updated 9 months ago
- ☆83Updated 3 months ago
- fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI☆37Updated last year
- Implementation of mamba with rust☆85Updated last year
- py2dataset analyzes source code to generate structured datasets describing code content and behavior. It extracts information from Python…☆53Updated 8 months ago
- Simple LLM inference server☆20Updated 9 months ago
- ☆20Updated last year