tairov / lamatune
LLama implementations benchmarking framework
☆12Updated last year
Alternatives and similar repositories for lamatune:
Users that are interested in lamatune are comparing it to the libraries listed below
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- ANE accelerated embedding models!☆16Updated 4 months ago
- The Swarm Ecosystem☆20Updated 8 months ago
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated last year
- Tensor library for Zig☆11Updated 5 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Light WebUI for lm.rs☆23Updated 6 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- ☆19Updated 3 weeks ago
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆12Updated 5 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆48Updated last year
- ☆26Updated 4 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 7 months ago
- ☆15Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆39Updated 2 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆18Updated last month
- Example implementation of Iteration of Tought - Gives a star if you like the project☆40Updated 3 months ago
- Training hybrid models for dummies.☆20Updated 3 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆34Updated this week
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated 11 months ago
- alternative way to calculating self attention☆18Updated 10 months ago
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated 9 months ago
- MLX binary vectors and associated algorithms.☆14Updated last month
- An example implementation of RLHF (or, more accurately, RLAIF) built on MLX and HuggingFace.☆25Updated 10 months ago
- Very minimal (and stateless) agent framework☆42Updated 3 months ago
- Rust bindings for CTranslate2☆14Updated last year
- a version of baby agi using dspy and typed predictors☆17Updated last year