recmo / criaLinks

Tiny inference-only implementation of LLaMA

☆93

Alternatives and similar repositories for cria

Users that are interested in cria are comparing it to the libraries listed below

Sorting:

MF-FOOM / wikivec2text
Simple embedding -> text model trained on a small subset of Wikipedia sentences.
☆156Updated 2 years ago
Narsil / fast_gpt2
☆156Updated 2 years ago
taylorai / onnx_embedding_models
utilities for loading and running text embeddings with onnx
☆44Updated last year
kolinko / effort
An implementation of bucketMul LLM inference
☆221Updated last year
kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
sdan / selfextend
an implementation of Self-Extend, to expand the context window via grouped attention
☆119Updated last year
teknium1 / RawTransform
A repository of prompts and Python scripts for intelligent transformation of raw text into diverse formats.
☆30Updated 2 years ago
danielgross / ggml-k8s
Run GGML models with Kubernetes.
☆173Updated last year
vikhyat / mixtral-inference
inference code for mixtral-8x7b-32kseqlen
☆101Updated last year
Dicklesworthstone / fast_vector_similarity
The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.
☆397Updated 5 months ago
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆74Updated last year
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
567-labs / fastllm
A collection of LLM services you can self host via docker or modal labs to support your applications development
☆192Updated last year
RobertRiachi / nanoPALM
☆143Updated 2 years ago
definitive-io / code-indexer-loop
Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…
☆175Updated last year
thesephist / spectre
Sparse autoencoders for Contra text embedding models
☆25Updated last year
teknium1 / stanford_alpaca-replit
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
☆41Updated 2 years ago
kayvr / token-hawk
WebGPU LLM inference tuned by hand
☆151Updated 2 years ago
teknium1 / transformers-gptq-quant
☆47Updated last year
SpellcraftAI / turing
Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.
☆58Updated last year
FL33TW00D / embd
GPU accelerated client-side embeddings for vector search, RAG etc.
☆66Updated last year
abacaj / openhermes-function-calling
☆135Updated last year
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆100Updated 2 years ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆286Updated last month
jiggy-ai / hnsqlite
hnsqlite integrates hnswlib and sqlite for simple text embedding search
☆161Updated 2 years ago
BerriAI / instructprompt
☆107Updated 2 years ago
jayhack / llm.sh
GPT-3 on your command line
☆131Updated 2 years ago
r2d4 / parserllm
Use context-free grammars with an LLM
☆170Updated last year
gbt42 / redactAI
Private inference over your sensitive data with off-the-shelf models
☆35Updated 2 years ago