avocardio / ZickleinLinks
Finetuning instruct-LLaMA on german datasets.
☆34Updated last year
Alternatives and similar repositories for Zicklein
Users that are interested in Zicklein are comparing it to the libraries listed below
Sorting:
- A guidance compatibility layer for llama-cpp-python☆35Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated last year
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- ☆55Updated 2 years ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 9 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆147Updated last year
- ☆38Updated last year
- Run embedding models using ONNX☆34Updated last year
- ☆25Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆35Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated 2 years ago
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate code☆44Updated 2 years ago
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- Experimental LLM Inference UX to aid in creative writing☆114Updated 6 months ago
- Real-time Fallacy Detection using OpenAI whisper and ChatGPT/LLaMA/Mistral☆115Updated last year
- ☆73Updated last year
- PanML is a high level generative AI/ML development and analysis library designed for ease of use and fast experimentation.☆116Updated last year
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- Full finetuning of large language models without large memory requirements☆94Updated last year
- ☆31Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 7 months ago
- Patch for MPT-7B which allows using and training a LoRA☆58Updated 2 years ago
- ☆29Updated 8 months ago
- LLM plugin for clustering embeddings☆76Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- create workflows with LLMs☆54Updated 10 months ago
- ☆22Updated last year
- Draft42 - Streamlit chatbot with function calling☆32Updated last year
- A web-app to explore topics using LLM (less typing and more clicks)☆67Updated last year