ggml-org / ci
CI for ggml and related projects
☆25Updated this week
Alternatives and similar repositories for ci:
Users that are interested in ci are comparing it to the libraries listed below
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 2 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- Web browser version of StarCoder.cpp☆44Updated last year
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆24Updated last year
- AirLLM 70B inference with single 4GB GPU☆12Updated 7 months ago
- Command line tool for Deep Infra cloud ML inference service☆29Updated 9 months ago
- The official Python library for Formulaic☆16Updated 11 months ago
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆36Updated this week
- A command-line utility to manage MLX models between your Hugging Face cache and LM Studio.☆32Updated last month
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆73Updated 3 months ago
- Training hybrid models for dummies.☆20Updated 2 months ago
- ☆31Updated last year
- ☆22Updated 5 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 7 months ago
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆16Updated 2 weeks ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 9 months ago
- First token cutoff sampling inference example☆29Updated last year
- Access the Cohere Command R family of models☆34Updated 2 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆59Updated 7 months ago
- ☆99Updated 7 months ago
- LangChain + LiteLLM that works☆39Updated last month
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated 11 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆28Updated 2 months ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆20Updated 4 months ago
- llama.cpp gguf file parser for javascript☆33Updated 3 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆20Updated 3 weeks ago