rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lines of modular code.
☆279Updated last month
Alternatives and similar repositories for bocoel:
Users that are interested in bocoel are comparing it to the libraries listed below
- Dead Simple LLM Abliteration☆211Updated last month
- A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…☆601Updated 3 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆250Updated last year
- a curated list of data for reasoning ai☆131Updated 7 months ago
- LLM Analytics☆646Updated 5 months ago
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆384Updated 3 weeks ago
- ☆163Updated 9 months ago
- ai for jq☆238Updated 6 months ago
- Visualize the intermediate output of Mistral 7B☆344Updated 2 months ago
- Lightweight Nearest Neighbors with Flexible Backends☆260Updated 2 weeks ago
- Agent accuracy measurements for LLMs☆205Updated 9 months ago
- Enforce structured output from LLMs 100% of the time☆248Updated 8 months ago
- Tiny inference-only implementation of LLaMA☆92Updated 11 months ago
- OpenAI's Structured Outputs with Logprobs☆154Updated last month
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆220Updated 3 months ago
- An implementation of bucketMul LLM inference☆215Updated 8 months ago
- Docker-based inference engine for AMD GPUs☆230Updated 5 months ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆274Updated last week
- Implement recursion using English as the programming language and an LLM as the runtime.☆137Updated last year
- Revealing example of self-attention, the building block of transformer AI models☆130Updated last year
- Generate ideal question-answers for testing RAG☆126Updated 3 weeks ago
- Visualize text embeddings☆35Updated last year
- Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".☆365Updated 9 months ago
- ☆253Updated last year
- Run and explore Llama models locally with minimal dependencies on CPU☆191Updated 5 months ago
- ☆742Updated 11 months ago
- LLM verified with Monte Carlo Tree Search☆270Updated last month
- Examples and guides for using the VLM Run API☆265Updated last week