lucyknada / detective-needle-llmLinks
☆12Updated last year
Alternatives and similar repositories for detective-needle-llm
Users that are interested in detective-needle-llm are comparing it to the libraries listed below
Sorting:
- Scripts to create your own moe models using mlx☆90Updated last year
- Modified Beam Search with periodical restart☆12Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Updated last year
- ☆119Updated last year
- ☆30Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- TLS & API keys for your LLM APIs☆18Updated 3 weeks ago
- ☆32Updated last year
- Ongoing research training transformer models at scale☆38Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.☆43Updated 6 months ago
- Thematic Generalization Benchmark: measures how effectively various LLMs can infer a narrow or specific "theme" (category/rule) from a sm…☆63Updated 3 months ago
- ☆20Updated last year
- ☆50Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- An OpenAI API compatible LLM inference server based on ExLlamaV2.☆25Updated last year
- ☆101Updated last year
- GPT-4 Level Conversational QA Trained In a Few Hours☆66Updated last year
- ☆68Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- ☆62Updated 6 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated last year
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated 2 years ago
- Simple examples using Argilla tools to build AI☆57Updated last year
- The DPAB-α Benchmark☆32Updated 11 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆86Updated this week
- ☆38Updated last year
- Embedding models from Jina AI☆65Updated last year
- Pivotal Token Search☆142Updated 3 weeks ago
- Experimental sampler to make LLMs more creative☆31Updated 2 years ago