llmonpy / needle-in-a-needlestack
☆112Updated 2 months ago
Alternatives and similar repositories for needle-in-a-needlestack:
Users that are interested in needle-in-a-needlestack are comparing it to the libraries listed below
- a curated list of data for reasoning ai☆134Updated 8 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- Synthetic Data for LLM Fine-Tuning☆114Updated last year
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆87Updated this week
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated 11 months ago
- ☆153Updated 9 months ago
- Just a bunch of benchmark logs for different LLMs☆119Updated 8 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆139Updated 2 months ago
- Hallucinations (Confabulations) Document-Based Benchmark for RAG. Includes human-verified questions and answers.☆124Updated last week
- Visualize the intermediate output of Mistral 7B☆357Updated 3 months ago
- Train your own SOTA deductive reasoning model☆88Updated last month
- Function Calling Benchmark & Testing☆87Updated 9 months ago
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- look how they massacred my boy☆63Updated 6 months ago
- Mistral7B playing DOOM☆131Updated 9 months ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆233Updated 2 months ago
- ☆67Updated 5 months ago
- ☆150Updated 4 months ago
- Enforce structured output from LLMs 100% of the time☆249Updated 9 months ago
- A comprehensive deep dive into the world of tokens☆222Updated 10 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆220Updated 4 months ago
- A subset of jailbreaks automatically discovered by the Haize Labs haizing suite.☆89Updated last week
- Action library for AI Agent☆214Updated 3 weeks ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- Your buddy in the (L)LM space.☆64Updated 7 months ago
- run embeddings in MLX☆86Updated 6 months ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 7 months ago
- ☆118Updated 8 months ago
- Generate ideal question-answers for testing RAG☆126Updated 2 months ago