lucyknada / detective-needle-llm
☆12Updated 7 months ago
Alternatives and similar repositories for detective-needle-llm
Users that are interested in detective-needle-llm are comparing it to the libraries listed below
Sorting:
- ☆27Updated 8 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆30Updated 4 months ago
- ☆66Updated 11 months ago
- Web Interface for Vision Language Models Including InternVLM2☆21Updated 9 months ago
- Embed anything.☆28Updated 11 months ago
- TLS & API keys for your LLM APIs☆16Updated 5 months ago
- A high performance batching router optimises max throughput for text inference workload☆16Updated last year
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆22Updated 2 weeks ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 8 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated last month
- Scripts to create your own moe models using mlx☆89Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 5 months ago
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- Experimental sampler to make LLMs more creative☆31Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- Self-hosted LLM chatbot arena, with yourself as the only judge☆40Updated last year
- ☆114Updated 4 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- 5X faster 60% less memory QLoRA finetuning☆21Updated 11 months ago
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆40Updated 2 months ago
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 4 months ago
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆51Updated last week
- The DPAB-α Benchmark☆21Updated 4 months ago
- Simple examples using Argilla tools to build AI☆52Updated 5 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- GRDN.AI app for garden optimization☆70Updated last year
- Lego for GRPO☆28Updated last month