gsuuon / ad-llama
Structured inference with Llama 2 in your browser
☆52Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for ad-llama
- LLMs as Collaboratively Edited Knowledge Bases☆43Updated 9 months ago
- ☆31Updated last year
- ☆61Updated 9 months ago
- Compression for Foundation Models☆19Updated 3 weeks ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 11 months ago
- Generate & Stream Patches of Changes and Save Tokens and Shortens Response Time☆13Updated 5 months ago
- ☆34Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated 10 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆45Updated 3 months ago
- GRDN.AI app for garden optimization☆69Updated 9 months ago
- One Line To Build Zero-Data Classifiers in Minutes☆33Updated last month
- Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents☆110Updated 5 months ago
- Experiments on speculative sampling with Llama models☆118Updated last year
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆37Updated last year
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.☆74Updated last month
- ☆43Updated 4 months ago
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Track the progress of LLM context utilisation☆53Updated 4 months ago
- A simple library for working with Hugging Face models.☆15Updated 2 months ago
- Implementation of nougat that focuses on processing pdf locally.☆73Updated 6 months ago
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆18Updated this week
- Repository for CPU Kernel Generation for LLM Inference☆25Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆46Updated last year
- Understanding the correlation between different LLM benchmarks☆29Updated 10 months ago
- RWKV-7: Surpassing GPT☆45Updated this week
- Generate High Quality textual or multi-modal datasets with Agents☆17Updated last year
- Vector Database with support for late interaction and token level embeddings.☆54Updated last month