namtranase / gemma-cpp-python
A Python wrapper for gemma.cpp
β47Updated 9 months ago
Alternatives and similar repositories for gemma-cpp-python:
Users that are interested in gemma-cpp-python are comparing it to the libraries listed below
- Friendly Terminal Assistant for Developersβ15Updated 10 months ago
- Maybe the new state of the art vision model? we'll see π€·ββοΈβ159Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for freeβ225Updated 3 months ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.β46Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'β232Updated 8 months ago
- Python bindings for ggmlβ136Updated 4 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRAβ123Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMsβ77Updated 9 months ago
- Framework agnostic computer vision inference. Run 1000+ models by changing only one line of code. Supports models from transformers, timmβ¦β130Updated 2 months ago
- Google TPU optimizations for transformers modelsβ90Updated last week
- Low-Rank adapter extraction for fine-tuned transformers modelsβ167Updated 8 months ago
- Full finetuning of large language models without large memory requirementsβ93Updated last year
- An OpenAI Completions API compatible server for NLP transformers modelsβ63Updated last year
- Set of scripts to finetune LLMsβ36Updated 10 months ago
- 1.58-bit LLaMa modelβ80Updated 9 months ago
- β196Updated 8 months ago
- Embed arbitrary modalities (images, audio, documents, etc) into large language models.β177Updated 10 months ago
- whisper.cpp bindings for pythonβ85Updated last year
- β54Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAIβ222Updated 9 months ago
- LLaVA server (llama.cpp).β176Updated last year
- β65Updated 8 months ago
- Tune MPTsβ84Updated last year
- Let's create synthetic textbooks together :)β73Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.β82Updated last year
- An efficent implementation of the method proposed in "The Era of 1-bit LLMs"β154Updated 3 months ago
- experiments with inference on llamaβ104Updated 7 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β79Updated 8 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β200Updated 3 months ago
- inference code for mixtral-8x7b-32kseqlenβ99Updated last year