a lightweight C++ LLaMA inference engine for mobile devices
☆15Oct 28, 2023Updated 2 years ago
Alternatives and similar repositories for mobilellama
Users that are interested in mobilellama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Codec 2 speech codec, compiled to WASM using Emscripten.☆13Apr 27, 2023Updated 3 years ago
- Run Gemini Nano locally on chrome☆24Jun 27, 2024Updated last year
- H264 encoder + MP4 output for the web☆15Dec 4, 2020Updated 5 years ago
- ☆17Jun 7, 2025Updated 11 months ago
- ☆14Oct 18, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AMD 0.9B efficient text to video diffusion model☆46Apr 27, 2026Updated 2 weeks ago
- A Game Engine for J2ME Platform☆10Mar 13, 2015Updated 11 years ago
- ☆12Feb 8, 2025Updated last year
- A complete guide in generating text using bert and fine-tune☆11Feb 25, 2026Updated 2 months ago
- source code for project instinct website☆15Feb 4, 2025Updated last year
- Quantize transformers to any learned arbitrary 4-bit numeric format☆55Apr 13, 2026Updated 3 weeks ago
- A JavaScript and TypeScript port of PyTorch C++ library (libtorch) - Node.js N-API bindings for libtorch.☆17Jan 15, 2023Updated 3 years ago
- Node.js binding for PyTorch.☆17Apr 18, 2024Updated 2 years ago
- Cordova plugin to provide FirebaseUI Authentication☆18May 30, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Llama2 inference in one TypeScript file☆20May 29, 2025Updated 11 months ago
- [ICRA 2024]This is the official repo of paper "HR-APR: APR-agnostic Framework with Uncertainty Estimation and Hierarchical Refinement for…☆11Feb 10, 2025Updated last year
- A base64 encoder/decoder with gzip or deflate abilities.☆38Oct 30, 2024Updated last year
- ☆35Oct 9, 2025Updated 7 months ago
- ☆20Apr 8, 2023Updated 3 years ago
- LLM training in simple, C++/CUDA(with Eigen3)☆17Sep 1, 2024Updated last year
- [ICLR'25] "Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers"☆43Mar 31, 2025Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 10 months ago
- TensorflowTTS in Tensorflow.js☆18Aug 11, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Minimalist <video>/<audio> plugin for markdown-it, using image syntax☆26Oct 14, 2025Updated 6 months ago
- Cloud Native Distributed Nearest Neighbour Search☆15Jun 9, 2020Updated 5 years ago
- Unsupervised Word Discovery☆10Jul 26, 2019Updated 6 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- rabitq rust implementation☆10Apr 23, 2026Updated 2 weeks ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆32Oct 2, 2025Updated 7 months ago
- Package vecf32 provides common functions and methods for slices of float32☆13Jun 14, 2023Updated 2 years ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Full Javascript implementation of libvpx vp8 decoder.☆26Aug 10, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Gulp plugin that creates all the support needed in serviceworkers to make your web app run offline☆29May 18, 2017Updated 8 years ago
- mPLM-Sim: Better Cross-Lingual Similarity and Transfer in Multilingual Pretrained Language Models☆11Jan 19, 2024Updated 2 years ago
- Notebooks for docarray, Jina, Finetuner, and other products from Jina AI☆12Mar 31, 2022Updated 4 years ago
- TSDG: An efficient index graph for graph-based nearest neighbor search☆10Jul 14, 2022Updated 3 years ago
- ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs (ICCV 2023)☆23Sep 13, 2024Updated last year
- Official code for "DexRepNet++: Learning Dexterous Robotic Manipulation With Geometric and Spatial Hand-Object Representations" (T-RO 202…☆31Feb 25, 2026Updated 2 months ago
- A PyTorch implementation of Proxy Anchor Loss based on CVPR 2020 paper "Proxy Anchor Loss for Deep Metric Learning"☆11Jan 16, 2021Updated 5 years ago