unum-cloud / examplesLinks
Learning Unum's efficient data-processing tools one cool project at a time
☆12Updated 2 years ago
Alternatives and similar repositories for examples
Users that are interested in examples are comparing it to the libraries listed below
Sorting:
- A list of awesome resources and blogs on topics related to Unum☆40Updated 9 months ago
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆51Updated last year
- ☆11Updated 6 months ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆26Updated last year
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- A cog implementation of Nvidia's Triton server☆17Updated 9 months ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆22Updated 4 months ago
- Proof of concept for a generative AI application framework powered by WebAssembly and Extism☆14Updated 2 years ago
- Python bindings to llama.cpp☆27Updated 2 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- A library for working with GBNF files☆24Updated this week
- ☆15Updated last year
- ☆39Updated 3 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆42Updated 3 weeks ago
- The fastest ACID-transactional persisted Key-Value store designed as modified LSM-Tree for NVMe block-devices with GPU-acceleration and S…☆74Updated 2 years ago
- A library for building software agents using behavior trees and language models.☆83Updated 6 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- Using Large Language Models for Repo-wide Type Prediction☆111Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆57Updated last year
- ☆59Updated 4 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- A minimalistic C++ Jinja templating engine for LLM chat templates☆164Updated this week
- GPT-2 inference engine written in Zig☆39Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 9 months ago
- LLM-powered autonomous agent with hierarchical task management☆50Updated 2 years ago
- Create embeddings for LLM using the Nomic API☆24Updated 8 months ago
- ☆12Updated 10 months ago