chigkim / Ollama-MMLU-ProLinks
☆109Updated 5 months ago
Alternatives and similar repositories for Ollama-MMLU-Pro
Users that are interested in Ollama-MMLU-Pro are comparing it to the libraries listed below
Sorting:
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- Easily view and modify JSON datasets for large language models☆86Updated 8 months ago
- ☆135Updated last month
- A fast batching API to serve LLM models☆189Updated last year
- An extension that lets the AI take the wheel, allowing it to use the mouse and keyboard, recognize UI elements, and prompt itself :3...no…☆127Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆193Updated last year
- AI management tool☆119Updated last year
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆100Updated 7 months ago
- ☆209Updated last month
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- Distributed Inference for mlx LLm☆100Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- Open source LLM UI, compatible with all local LLM providers.☆177Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆266Updated 10 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆49Updated 3 months ago
- This small API downloads and exposes access to NeuML's txtai-wikipedia and full wikipedia datasets, taking in a query and returning full …☆103Updated 5 months ago
- ☆51Updated 11 months ago
- For inferring and serving local LLMs using the MLX framework☆110Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆100Updated 3 months ago
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆29Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆81Updated this week
- A multimodal, function calling powered LLM webui.☆216Updated last year
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆44Updated last year
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆86Updated last year
- automatically quant GGUF models☆219Updated last month
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Updated last year
- ☆30Updated last year