grantslatton / llama.cpp
Port of Facebook's LLaMA model in C/C++
☆45Updated last year
Related projects: ⓘ
- Use context-free grammars with an LLM☆162Updated 5 months ago
- Simple embedding -> text model trained on a small subset of Wikipedia sentences.☆152Updated last year
- Drive a browser with Cohere☆72Updated last year
- ☆61Updated last year
- [Added T5 support to TRLX] A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆47Updated last year
- Run GGML models with Kubernetes.☆172Updated 9 months ago
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆60Updated 4 months ago
- GPT-based Conversation Summarizer☆144Updated last year
- ☆171Updated last year
- Some of the scripts I use for scribepod @ https://scribepod.substack.com/, an automated AI podcast☆171Updated last year
- An HTTP serving framework by Banana☆97Updated 9 months ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆102Updated last year
- Factored Cognition Primer: How to write compositional language model programs☆48Updated last year
- ☆111Updated 7 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆63Updated 9 months ago
- automatic sentence highlights based on their significance to the document☆179Updated 9 months ago
- ☆128Updated last year
- A dictionary, but it shows you position in embedding space relative to some synonyms/antonyms instead of a definition.☆69Updated 2 months ago
- AI sends pull requests for features you request in natural language☆110Updated last year
- MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…☆143Updated this week
- Hands-free companionship on demand.☆78Updated last year
- Tiny inference-only implementation of LLaMA☆91Updated 5 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- ☆34Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆300Updated last year
- Grounding LLM mathematical reasoning with proof assistants.☆59Updated last year
- Implement recursion using English as the programming language and an LLM as the runtime.☆125Updated last year
- Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types☆384Updated 6 months ago
- ☆12Updated last year
- A production-grade framework for building AI agents.☆68Updated last year