tairov / lamatuneLinks
LLama implementations benchmarking framework
☆12Updated last year
Alternatives and similar repositories for lamatune
Users that are interested in lamatune are comparing it to the libraries listed below
Sorting:
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆27Updated 8 months ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- Rust Implementation of micrograd☆52Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆50Updated last year
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- ☆26Updated 7 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆55Updated last year
- ANE accelerated embedding models!☆18Updated 7 months ago
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆64Updated last week
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆55Updated 2 months ago
- ☆138Updated last year
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆97Updated 4 months ago
- ☆31Updated last year
- A Learning Journey: Micrograd in Mojo 🔥☆61Updated 8 months ago
- ☆38Updated last year
- llama.cpp gguf file parser for javascript☆43Updated 7 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- Because it's there.☆16Updated 9 months ago
- Pivotal Token Search☆109Updated last week
- Nexusflow function call, tool use, and agent benchmarks.☆24Updated 7 months ago
- Proof of concept for running moshi/hibiki using webrtc☆20Updated 4 months ago
- Rust implementation of Surya☆58Updated 4 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 3 months ago
- ☆20Updated 3 months ago
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 5 months ago
- Run Llama 2 using MLX on macOS☆34Updated last year
- Light WebUI for lm.rs☆24Updated 8 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆89Updated 2 weeks ago
- build your own vector database -- the littlest hnsw☆61Updated 6 months ago