tairov / lamatuneLinks
LLama implementations benchmarking framework
☆12Updated last year
Alternatives and similar repositories for lamatune
Users that are interested in lamatune are comparing it to the libraries listed below
Sorting:
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- ANE accelerated embedding models!☆18Updated 6 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 9 months ago
- The Swarm Ecosystem☆21Updated 10 months ago
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆12Updated 7 months ago
- First token cutoff sampling inference example☆30Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 7 months ago
- Light WebUI for lm.rs☆23Updated 8 months ago
- ☆26Updated 6 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆20Updated 6 months ago
- ☆15Updated last year
- ☆20Updated 3 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆49Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆20Updated last month
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 3 months ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- Rust Implementation of micrograd☆52Updated 11 months ago
- Because it's there.☆16Updated 9 months ago
- Run Python functions on desktop, mobile, web, and in the cloud. https://fxn.ai/explore☆64Updated this week
- A collection of optimizers for MLX☆36Updated 3 weeks ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆38Updated last year
- ☆13Updated last year
- A Learning Journey: Micrograd in Mojo 🔥☆61Updated 8 months ago
- 🛠 Self-hosted, fast, and consistent remote configuration for apps.☆15Updated 2 years ago
- Building large language foundational model☆9Updated 3 months ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated 2 years ago
- a version of baby agi using dspy and typed predictors☆17Updated last year
- ☆16Updated last year