gsuuon / ad-llamaLinks
Structured inference with Llama 2 in your browser
☆52Updated 7 months ago
Alternatives and similar repositories for ad-llama
Users that are interested in ad-llama are comparing it to the libraries listed below
Sorting:
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- Latent Large Language Models☆18Updated 9 months ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- ☆62Updated last year
- ☆35Updated 2 years ago
- ReLM is a Regular Expression engine for Language Models☆105Updated 2 years ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Official Repository for Task-Circuit Quantization☆20Updated last week
- Using Large Language Models for Repo-wide Type Prediction☆109Updated last year
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptations☆33Updated 2 years ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 4 months ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- A new way to generate large quantities of high quality synthetic data (on par with GPT-4), with better controllability, at a fraction of …☆22Updated 8 months ago
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 4 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- A simple library for working with Hugging Face models.☆14Updated 5 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 10 months ago
- Advanced Reasoning Benchmark Dataset for LLMs☆46Updated last year
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆126Updated 6 months ago
- Repository for CPU Kernel Generation for LLM Inference☆26Updated last year
- AskIt (for JavaScript/TypeScript): Unified programming interface for large language models (GPT-4, GPT-3.5)☆34Updated last year
- Training hybrid models for dummies.☆21Updated 4 months ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 8 months ago
- ☆54Updated last year
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆59Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆137Updated 10 months ago
- LMQL implementation of tree of thoughts☆34Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 11 months ago
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 11 months ago