anordin95 / run-llama-locally
Run and explore Llama models locally with minimal dependencies on CPU
☆181Updated last month
Related projects ⓘ
Alternatives and complementary repositories for run-llama-locally
- ☆223Updated last month
- Financial instrument definitions built with Python and Pydantic☆188Updated 2 weeks ago
- ☆162Updated 4 months ago
- This is a python implementation for stitching images.☆226Updated last month
- Generate Cool-Looking Mazes and Animations Illustrating the A* Pathfinding Algorithm☆172Updated 3 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆250Updated 11 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆208Updated last month
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆271Updated last week
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆268Updated last month
- ☆155Updated last week
- minimal yet working VPN daemon for Linux☆107Updated last week
- Lamport's Bakery Algorithm Demonstrated in Python☆95Updated 9 months ago
- Sequential Logic☆68Updated last week
- Better Bookmarks Search w/ Transformers☆192Updated 9 months ago
- Detect whether or not an audio file was generated by NotebookLM☆119Updated 2 weeks ago
- ai for jq☆234Updated last month
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆84Updated last month
- Deep learning accelerator architectures requiring half the multipliers☆262Updated 7 months ago
- Web-Development 41, a static web server with live-reload☆138Updated last week
- CLI based packet reader in Python.☆100Updated 2 months ago
- ☆92Updated 2 weeks ago
- An open source alternative to some of Slack AI's premium features. Summarize channels and threads any time you want.☆152Updated 3 weeks ago
- LLM Analytics☆613Updated 3 weeks ago
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆106Updated 11 months ago
- Optimally allocate poker chips using constrained, nonlinear optimization☆173Updated last month
- ☆260Updated 7 months ago
- ✨ rudimentary simulation of the three-body problem☆152Updated 7 months ago
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆789Updated this week
- Ask GPT to run a command☆193Updated 2 months ago
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆138Updated this week