lucyknada / detective-needle-llmLinks
☆12Updated last year
Alternatives and similar repositories for detective-needle-llm
Users that are interested in detective-needle-llm are comparing it to the libraries listed below
Sorting:
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆63Updated last month
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- ☆30Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated last year
- ☆102Updated last year
- Scripts to create your own moe models using mlx☆90Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 11 months ago
- ☆51Updated last year
- ☆116Updated 10 months ago
- TLS & API keys for your LLM APIs☆18Updated 10 months ago
- The DPAB-α Benchmark☆30Updated 9 months ago
- ☆62Updated 3 months ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago
- Modified Beam Search with periodical restart☆12Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆42Updated 4 months ago
- A high performance batching router optimises max throughput for text inference workload☆16Updated 2 years ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- ☆67Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆81Updated this week
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 10 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated last year
- ☆49Updated last year
- Zero-trust AI APIs for easy and private consumption of open-source LLMs☆39Updated last year
- A Python library to orchestrate LLMs in a neural network-inspired structure☆51Updated last year
- Pivotal Token Search☆130Updated 3 months ago
- Simple examples using Argilla tools to build AI☆56Updated 11 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆95Updated 5 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 11 months ago
- Distributed Inference for mlx LLm☆97Updated last year