tairov / lamatuneLinks

LLama implementations benchmarking framework

☆12

Alternatives and similar repositories for lamatune

Users that are interested in lamatune are comparing it to the libraries listed below

Sorting:

FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
kyegomez / Exa
Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…
☆27Updated 8 months ago
distantmagic / structured
Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp
☆45Updated last year
kanpuriyanawab / picograd
Rust Implementation of micrograd
☆52Updated last year
spirobel / bunny-llama
iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh
☆50Updated last year
dylibso / chainsocket
Proof of concept for a generative AI application framework powered by WebAssembly and Extism
☆14Updated last year
Narsil / hf-chat
☆26Updated 7 months ago
iamlemec / bert.cpp
GGML implementation of BERT model with Python bindings and quantization.
☆55Updated last year
huggingface / ember
ANE accelerated embedding models!
☆18Updated 7 months ago
fxnai / fxn
Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore
☆64Updated last week
guidance-ai / llgtrt
TensorRT-LLM server with Structured Outputs (JSON) built with Rust
☆55Updated 2 months ago
Vaibhavs10 / fast-llm.rs
☆138Updated last year
Oxen-AI / GRPO-With-Cargo-Feedback
This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback
☆97Updated 4 months ago
Maximilian-Winter / llama_cpp_function_calling
☆31Updated last year
dorjeduck / momograd
A Learning Journey: Micrograd in Mojo 🔥
☆61Updated 8 months ago
mzbac / mlx-lora
☆38Updated last year
hyparam / hyllama
llama.cpp gguf file parser for javascript
☆43Updated 7 months ago
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
charlesfrye / cuda-substrings
Because it's there.
☆16Updated 9 months ago
codelion / pts
Pivotal Token Search
☆109Updated last week
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆24Updated 7 months ago
kyutai-labs / moshi-webrtc
Proof of concept for running moshi/hibiki using webrtc
☆20Updated 4 months ago
jimexist / surya-rs
Rust implementation of Surya
☆58Updated 4 months ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 3 months ago
ivanleomk / modal-grpo
☆20Updated 3 months ago
zhuzilin / faster-nougat
Implementation of nougat that focuses on processing pdf locally.
☆81Updated 5 months ago
simonw / llm-mlx-llama
Run Llama 2 using MLX on macOS
☆34Updated last year
samuel-vitorino / lm.rs-webui
Light WebUI for lm.rs
☆24Updated 8 months ago
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆89Updated 2 weeks ago
jbarrow / tinyhnsw
build your own vector database -- the littlest hnsw
☆61Updated 6 months ago