Structured inference with Llama 2 in your browser
☆52Nov 1, 2024Updated last year
Alternatives and similar repositories for ad-llama
Users that are interested in ad-llama are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Plug n Play GBNF Compiler for llama.cpp☆32Nov 8, 2023Updated 2 years ago
- javascript multivariate data visualization☆14Jan 10, 2017Updated 9 years ago
- Just-in-time Dynamic Batching with MXNet Gluon.☆52May 18, 2020Updated 6 years ago
- System for automated integration of deep learning backends.☆47Aug 15, 2022Updated 3 years ago
- Home for OctoML PyTorch Profiler☆113Apr 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Vercel and web-llm template to run wasm models directly in the browser.☆174Apr 17, 2026Updated last month
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆18Oct 13, 2025Updated 7 months ago
- A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer☆96Feb 20, 2026Updated 3 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆18Dec 22, 2025Updated 5 months ago
- ☆192Mar 28, 2023Updated 3 years ago
- Host LLM via text-generation-inference☆16Dec 5, 2023Updated 2 years ago
- TensorFlow and TVM integration☆36Apr 27, 2020Updated 6 years ago
- Experimental Vega Dataflow Visualization☆21Jul 28, 2016Updated 9 years ago
- use solid-js Components inside react☆28Sep 2, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.☆11May 12, 2023Updated 3 years ago
- OmniByteFormer is a generalized Transformer model that can process any type of data by converting it into byte sequences, bypassing tradi…☆16Updated this week
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Apr 22, 2016Updated 10 years ago
- YourAICHAT☆13Aug 16, 2023Updated 2 years ago
- ☆175May 11, 2026Updated 2 weeks ago
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆20Nov 11, 2024Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Sep 26, 2023Updated 2 years ago
- An experimental ahead of time compiler for Relay.☆49Apr 21, 2020Updated 6 years ago
- Typescript utilities for input validation, with emphasis on security☆19Jan 3, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆27Oct 13, 2025Updated 7 months ago
- ☆122Apr 22, 2024Updated 2 years ago
- python interface for mlc chat cli☆14May 7, 2023Updated 3 years ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆20Oct 13, 2025Updated 7 months ago
- Extracts static code features from opencl kernels to be used for machine learning.☆10Apr 30, 2021Updated 5 years ago
- Web service for image file/image URL classification without uploading.☆16May 27, 2022Updated 4 years ago
- Browser-compatible Client Version of Model Context Protocol implementation for TypeScript (Fork from official MCP)☆31Apr 20, 2025Updated last year
- [CVPR 2026] 3D-Fixer: Coarse-to-Fine In-place Completion for 3D Scenes from a Single Image☆72Apr 7, 2026Updated last month
- ☆14May 9, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- PET: Optimizing Tensor Programs with Partially Equivalent Transformations and Automated Corrections☆126Jun 23, 2022Updated 3 years ago
- ☆11Jun 7, 2023Updated 2 years ago
- A preliminary platform for up to 1 million reinforcement learning agents☆11Aug 27, 2017Updated 8 years ago
- ☆42Sep 8, 2023Updated 2 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆133Jul 6, 2023Updated 2 years ago
- A curated collection of papers and related projects on using LLMs for privacy.☆31Oct 8, 2025Updated 7 months ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago