Inference code for LLaMA models
☆46Mar 5, 2023Updated 3 years ago
Alternatives and similar repositories for llama
Users that are interested in llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Inference code for LLaMA models☆35Mar 7, 2023Updated 3 years ago
- Inference code for facebook LLaMA models with Wrapyfi support☆128Mar 16, 2023Updated 3 years ago
- Annotation de la jurisprudence des CA Fr☆12May 4, 2018Updated 8 years ago
- Inference on CPU code for LLaMA models☆136Mar 19, 2023Updated 3 years ago
- Fork of Facebooks LLaMa model to run on CPU☆766Mar 6, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Quantized inference code for LLaMA models☆1,039Mar 17, 2023Updated 3 years ago
- Hill Space is All You Need☆17Jul 11, 2025Updated 10 months ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆121Apr 29, 2023Updated 3 years ago
- BFloat16 Fused Adam Operator for PyTorch☆19Nov 16, 2024Updated last year
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- Reproduction study of Grassmann Flows for sequence modeling (arXiv 2512.19428). Shows 22.6% gap vs claimed 10-15%, includes CUDA kernels …☆30Dec 26, 2025Updated 4 months ago
- ☆13Apr 17, 2024Updated 2 years ago
- Python implementation of factorization based image segmentation algorithm☆16Aug 30, 2024Updated last year
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆15Apr 23, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A console based graphics engine for simple Unicode games and animations.☆11Feb 10, 2016Updated 10 years ago
- C recursive descent parser based on Ian Piumarta's peg(1)☆20Feb 4, 2014Updated 12 years ago
- Machine translation with tinygrad☆19Apr 7, 2024Updated 2 years ago
- Codebase for SIGGRAPH 2023 Paper: Simulation and Retargeting of Complex Multi-Character Interactions☆28Jul 10, 2023Updated 2 years ago
- NICE: Neurogenesis Inspired Contextual Encoding for Replay-free Class Incremental Learning☆28Jul 28, 2024Updated last year
- SDXL GPU cluster scripts☆16Oct 28, 2023Updated 2 years ago
- ☆536Dec 1, 2023Updated 2 years ago
- A C++ fork/rewrite of the smhasher project to bring Murmurhash v.3 to the Linux shell and to the PHP scripting language.☆21Jul 25, 2011Updated 14 years ago
- Vinisto - Personal domotics made simple☆10Aug 9, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Inference code for LLaMA models with Gradio Interface and rolling generation like ChatGPT☆48Mar 15, 2023Updated 3 years ago
- Turn remote MCP servers into local command workflows.☆59Feb 28, 2026Updated 2 months ago
- Physical Backed Tokens (EIP-5791) for everyone using ESP32 and BLE☆12May 16, 2024Updated 2 years ago
- This project is distributed as a free Unreal Engine Plugin. It consists in a single c++ actor component that handles the playback of anim…☆12Mar 10, 2024Updated 2 years ago
- Mars craters detection and classification RAMP starting kit☆22Dec 10, 2018Updated 7 years ago
- Wifu is a wifi data analysis tool written in Python, it is based on the output of Kismet (https://www.kismetwireless.net/) files. Wifu pa…☆10Jun 11, 2015Updated 10 years ago
- Discussion Summarization is the process of condensing a text document which is a collection of discussion threads, using CBS (Cluster Bas…☆12Apr 10, 2014Updated 12 years ago
- List all network devices with hostname and vendor☆15Dec 7, 2022Updated 3 years ago
- Fast binary matrix product on CPU☆10Feb 11, 2016Updated 10 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The project page repository for Neural Categorical Priors for Physics-Based Character Control☆91Mar 14, 2024Updated 2 years ago
- ☆11Dec 14, 2016Updated 9 years ago
- ☆14Jun 15, 2022Updated 3 years ago
- BitNet a4.8 Implementation in one file of pytorch