tairov / lamatune
LLama implementations benchmarking framework
☆12Updated last year
Alternatives and similar repositories for lamatune:
Users that are interested in lamatune are comparing it to the libraries listed below
- ☆26Updated 4 months ago
- ANE accelerated embedding models!☆16Updated 5 months ago
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- KAN (Kolmogorov–Arnold Networks) in the MLX framework for Apple Silicon☆16Updated last week
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 8 months ago
- The Swarm Ecosystem☆20Updated 9 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 6 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆48Updated last year
- ☆38Updated last year
- ☆15Updated last year
- ☆18Updated last month
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆12Updated 6 months ago
- Training hybrid models for dummies.☆21Updated 3 months ago
- ☆31Updated last year
- ☆19Updated last month
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated 11 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last month
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆49Updated last year
- First token cutoff sampling inference example☆30Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Light WebUI for lm.rs☆23Updated 6 months ago
- Proof of concept for running moshi/hibiki using webrtc☆18Updated 2 months ago
- Run Llama 2 using MLX on macOS☆34Updated last year
- Tensor library for Zig☆12Updated 5 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆52Updated 3 months ago
- Because it's there.☆16Updated 7 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- ☆13Updated last year