lucyknada / detective-needle-llm
β10Updated last month
Related projects: β
- β101Updated 6 months ago
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ36Updated 7 months ago
- Embed anything.β30Updated 3 months ago
- Scripts to create your own moe models using mlxβ86Updated 6 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasksβ31Updated 3 months ago
- idea: https://github.com/nyxkrage/ebook-groupchat/β77Updated last month
- GPT-4 Level Conversational QA Trained In a Few Hoursβ53Updated last month
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more moβ¦β11Updated 8 months ago
- β37Updated 2 months ago
- Very basic framework for parameterized large language model (Q)LoRa fine-tuning using mlx, mlx_lm, and OgbujiPT. Architecture for systemβ¦β32Updated last month
- an implementation of Self-Extend, to expand the context window via grouped attentionβ117Updated 8 months ago
- PyGPTPrompt: A CLI tool that manages context windows for AI models, facilitating user interaction and data ingestion for optimized long-tβ¦β28Updated 4 months ago
- A simple experiment on letting two local LLM have a conversation about anything!β81Updated 2 months ago
- β29Updated 4 months ago
- Low-Rank adapter extraction for fine-tuned transformers modelβ154Updated 4 months ago
- run ollama & gguf easily with a single commandβ46Updated 4 months ago
- automatically quant GGUF modelsβ119Updated this week
- β50Updated 3 months ago
- β64Updated 3 months ago
- Experimental sampler to make LLMs more creativeβ29Updated last year
- This repo is for handling Question Answering, especially for Multi-hop Question Answeringβ59Updated 9 months ago
- A super simple web interface to perform blind tests on LLM outputs.β24Updated 6 months ago
- auto fine tune of models with synthetic dataβ71Updated 7 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.β25Updated 11 months ago
- Just a bunch of benchmark logs for different LLMsβ112Updated last month
- StructuredRAG Benchmarkerβ85Updated last week
- Local LLM inference & management server with built-in OpenAI APIβ30Updated 5 months ago
- Self-hosted LLM chatbot arena, with yourself as the only judgeβ36Updated 7 months ago
- β31Updated 2 months ago
- β21Updated this week