lucyknada / detective-needle-llm
β12Updated 6 months ago
Alternatives and similar repositories for detective-needle-llm:
Users that are interested in detective-needle-llm are comparing it to the libraries listed below
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- β53Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.β19Updated 3 months ago
- β27Updated 6 months ago
- Modified Beam Search with periodical restartβ12Updated 6 months ago
- β111Updated 3 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.β30Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLMβ44Updated 10 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorchβ86Updated 3 months ago
- GPT-4 Level Conversational QA Trained In a Few Hoursβ59Updated 7 months ago
- run ollama & gguf easily with a single commandβ49Updated 10 months ago
- Scripts to create your own moe models using mlxβ89Updated last year
- Embed anything.β29Updated 10 months ago
- β20Updated last month
- The DPAB-Ξ± Benchmarkβ19Updated 2 months ago
- A framework for high-fidelity retrieval augmented generation in industrial knowledge bases. Integrates jargon identification, context recβ¦β29Updated 7 months ago
- Simple examples using Argilla tools to build AIβ53Updated 4 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β25Updated 4 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 2 weeks ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasksβ31Updated 10 months ago
- Yet Another (LLM) Web UI, made with Geminiβ11Updated 2 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing β‘β67Updated 4 months ago
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- Web Interface for Vision Language Models Including InternVLM2β19Updated 7 months ago
- entropix style sampling + GUIβ25Updated 4 months ago