ggml-org / ci
CI for ggml and related projects
☆28Updated this week
Alternatives and similar repositories for ci
Users that are interested in ci are comparing it to the libraries listed below
Sorting:
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- AirLLM 70B inference with single 4GB GPU☆12Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆26Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Tensor library for machine learning☆17Updated last year
- ☆19Updated last month
- Port of Facebook's LLaMA model in C/C++☆20Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated 11 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- Rust crate for some audio utilities☆23Updated 2 months ago
- The Swarm Ecosystem☆20Updated 9 months ago
- Training hybrid models for dummies.☆21Updated 3 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 6 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 8 months ago
- llama.cpp gguf file parser for javascript☆42Updated 5 months ago
- Command line tool for Deep Infra cloud ML inference service☆30Updated 11 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- ☆31Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- First token cutoff sampling inference example☆30Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 10 months ago
- Web browser version of StarCoder.cpp☆45Updated last year
- Proof of concept for running moshi/hibiki using webrtc☆18Updated 2 months ago
- The official Python library for Formulaic☆16Updated last year
- cortex.llamacpp is a high-efficiency C++ inference engine for edge computing. It is a dynamic library that can be loaded by any server a…☆40Updated this week
- GRDN.AI app for garden optimization☆70Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- Access fireworks.ai models via API☆11Updated last year
- ☆24Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated last year