gsuuon / ad-llamaLinks
Structured inference with Llama 2 in your browser
☆52Updated 8 months ago
Alternatives and similar repositories for ad-llama
Users that are interested in ad-llama are comparing it to the libraries listed below
Sorting:
- LLMs as Collaboratively Edited Knowledge Bases☆45Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- ☆62Updated last year
- A re-implementation of Meta-Prompt in LangChain for building self-improving agents.☆63Updated 2 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆40Updated last year
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 6 months ago
- Website with current metrics on the fastest AI models.☆41Updated 8 months ago
- Latent Large Language Models☆18Updated 10 months ago
- Pivotal Token Search☆109Updated last week
- ☆49Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆63Updated last year
- OpenAI GPT hosted Agent Framework for Windows and MacOS☆36Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 5 months ago
- A library for building software agents using behavior trees and language models.☆81Updated 5 months ago
- ☆54Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated 3 weeks ago
- Replace expensive LLM calls with finetunes automatically☆65Updated last year
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Benchmarking suite for popular AI APIs☆87Updated 5 months ago
- ☆87Updated 5 months ago
- utilities for loading and running text embeddings with onnx☆44Updated 11 months ago
- Experiments on speculative sampling with Llama models☆128Updated 2 years ago
- ReLM is a Regular Expression engine for Language Models☆106Updated 2 years ago
- Fast approximate inference on a single GPU with sparsity aware offloading☆38Updated last year
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆144Updated last year
- GRDN.AI app for garden optimization☆70Updated last year
- ☆89Updated 9 months ago
- ☆75Updated last year