jozsefszalma / homelabLinks
The bare metal in my basement
☆11Updated 9 months ago
Alternatives and similar repositories for homelab
Users that are interested in homelab are comparing it to the libraries listed below
Sorting:
- Extensive introductory writeup on Zig language functionalities☆10Updated last year
- Scripts to create your own moe models using mlx☆90Updated last year
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆53Updated last year
- ☆12Updated 11 months ago
- Demo of an "always-on" AI assistant.☆24Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Embedding models from Jina AI☆64Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- A super simple web interface to perform blind tests on LLM outputs.☆28Updated last year
- inference code for mixtral-8x7b-32kseqlen☆101Updated last year
- LLaVA server (llama.cpp).☆182Updated last year
- ☆20Updated last year
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆29Updated 5 months ago
- Pivotal Token Search☆123Updated last month
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 4 months ago
- Scrape details about Code Interpreter to track any changes☆69Updated 4 months ago
- Port of Facebook's LLaMA model in C/C++☆22Updated last year
- Tools for the LLaMA language model☆12Updated 2 years ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- ☆22Updated 2 years ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated 2 years ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆52Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 2 weeks ago
- ☆40Updated 2 years ago
- Cerule - A Tiny Mighty Vision Model☆68Updated 11 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Improve prompts for e.g. GPT3 and GPT-J using templates and hyperparameter optimization.☆42Updated 2 years ago
- Proxy server for triton gRPC server that inferences embedding model in Rust☆21Updated last year