namtranase / gemma-cpp-pythonLinks

A Python wrapper for gemma.cpp

☆51

Alternatives and similar repositories for gemma-cpp-python

Users that are interested in gemma-cpp-python are comparing it to the libraries listed below

Sorting:

NousResearch / Obsidian
Maybe the new state of the art vision model? we'll see 🤷‍♂️
☆163Updated last year
Locutusque / TPU-Alignment
Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free
☆231Updated 7 months ago
namtranase / terminalmind
Friendly Terminal Assistant for Developers
☆17Updated last year
trzy / llava-cpp-server
LLaVA server (llama.cpp).
☆179Updated last year
astramind-ai / BitMat
An efficent implementation of the method proposed in "The Era of 1-bit LLMs"
☆153Updated 7 months ago
abetlen / ggml-python
Python bindings for ggml
☆141Updated 9 months ago
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆67Updated 11 months ago
cognitivecomputations / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆238Updated last year
teknium1 / ShareGPT-Builder
☆114Updated 5 months ago
sshh12 / multi_token
Embed arbitrary modalities (images, audio, documents, etc) into large language models.
☆184Updated last year
rmihaylov / mpttune
Tune MPTs
☆84Updated last year
huggingface / optimum-tpu
Google TPU optimizations for transformers models
☆112Updated 4 months ago
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆71Updated 7 months ago
cognitivecomputations / kraken
☆66Updated last year
migtissera / Sensei
Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI
☆222Updated last year
cognitivecomputations / grokadamw
☆130Updated 9 months ago
adithya-s-k / YoloGemma
Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…
☆80Updated last year
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆78Updated last year
VITA-Group / Q-GaLore
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
☆199Updated 10 months ago
sumo43 / loopvlm
run paligemma in real time
☆131Updated last year
edgarGracia / gradio_image_annotator
A Gradio component that can be used to annotate images with bounding boxes.
☆52Updated 3 months ago
cognitivecomputations / OpenChatML
☆157Updated 10 months ago
euclaise / supertrainer2000
☆49Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆171Updated last year
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆147Updated last year
yvrjsharma / HugginFace_Gradio
☆70Updated last month
jllllll / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆63Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆93Updated last year
jquesnelle / transformers-openai-api
An OpenAI Completions API compatible server for NLP transformers models
☆65Updated last year
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 9 months ago