ggml-org / ci
CI for ggml and related projects
☆22Updated this week
Alternatives and similar repositories for ci:
Users that are interested in ci are comparing it to the libraries listed below
- GGML implementation of BERT model with Python bindings and quantization.☆53Updated 11 months ago
- A super simple web interface to perform blind tests on LLM outputs.☆27Updated 11 months ago
- Trying to deconstruct RWKV in understandable terms☆14Updated last year
- Training hybrid models for dummies.☆20Updated last month
- AirLLM 70B inference with single 4GB GPU☆12Updated 6 months ago
- Command line tool for Deep Infra cloud ML inference service☆29Updated 8 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆28Updated 3 weeks ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆28Updated 3 weeks ago
- Web browser version of StarCoder.cpp☆43Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆27Updated this week
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 7 months ago
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆23Updated 7 months ago
- ☆44Updated 6 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- Exploration: using technology to aid people who lack both the ability to speak and fine motor control.☆14Updated 3 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆29Updated last month
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆45Updated 4 months ago
- ☆31Updated last year
- A library for simplifying fine tuning with multi gpu setups in the Huggingface ecosystem.☆16Updated 3 months ago
- Experiments with BitNet inference on CPU☆53Updated 10 months ago
- ☆34Updated last year
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fast☆143Updated 5 months ago
- First token cutoff sampling inference example☆29Updated last year
- Tensor library for Zig☆10Updated 2 months ago
- LLM-based code completion engine☆178Updated 3 weeks ago
- Github repo for Peifeng's internship project☆13Updated last year
- Port of Facebook's LLaMA model in C/C++☆32Updated 11 months ago