thekevinscott / vicuna-7b

Vicuna 7B is a large language model that runs in the browser. Exposes programmatic access with minimal configuration.

☆21

Alternatives and similar repositories for vicuna-7b:

Users that are interested in vicuna-7b are comparing it to the libraries listed below

bentoml / BentoLMDeploy
Self-host LLMs with LMDeploy and BentoML
☆18Updated last month
deep-diver / gradio-chat
HuggingChat like UI in Gradio
☆72Updated last year
kyegomez / Falcon
A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…
☆13Updated last year
GreenBitAI / low_bit_llama
Advanced Ultra-Low Bitrate Compression Techniques for the LLaMA Family of LLMs
☆111Updated last year
Agora-Lab-AI / The-Distiller
Generate High Quality textual or multi-modal datasets with Agents
☆18Updated last year
LAION-AI / Desktop-BUD-E_V1.0
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆19Updated 6 months ago
LLM360 / amber-data-prep
Data preparation code for Amber 7B LLM
☆88Updated 11 months ago
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated 11 months ago
AIAnytime / Zephyr-7B-beta-RAG-Demo
Zephyr 7B beta RAG Demo inside a Gradio app powered by BGE Embeddings, ChromaDB, and Zephyr 7B Beta LLM.
☆34Updated last year
severian42 / Proteus-The-Genesis-LLM
Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine
☆21Updated 4 months ago
XiaoduoAILab / XmodelLM
XmodelLM
☆39Updated 5 months ago
mlc-ai / web-llm-assistant
AI Assistant running within your browser.
☆62Updated 4 months ago
menloresearch / model-converter
☆21Updated last year
hetailang / SqueezeAttention
☆37Updated 6 months ago
samchaineau / llm_slerp_generation
Repo hosting codes and materials related to speeding LLMs' inference using token merging.
☆36Updated 11 months ago
kyegomez / Ocean
Ultra Fast Multi-Modality Vector Database
☆18Updated last year
imagination-research / sot
[ICLR 2024] Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation
☆166Updated last year
EmbeddedLLM / vllm
vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs
☆86Updated this week
nexusflowai / nexusraven-pip
☆38Updated last year
GreenBitAI / green-bit-llm
A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.
☆82Updated last month
emrgnt-cmplxty / SmolTrainer
☆20Updated last year
kyegomez / autogpt-tot
Simple Autogpt with tree of thoughts
☆15Updated last year
IST-DASLab / QIGen
Repository for CPU Kernel Generation for LLM Inference
☆26Updated last year
chu-tianxiang / QuIP-for-all
QuIP quantization
☆51Updated last year
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated 7 months ago
TheBlokeAI / AIScripts
Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub
☆160Updated last year
OpenAccess-AI-Collective / ggml-webui
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
☆36Updated last year
Manuel030 / alpaca-opt
Yet another LLM
☆10Updated 2 years ago
NolanoOrg / SpectraSuite
☆46Updated 9 months ago