rentruewang / bocoelLinks

Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lines of modular code.

☆286

Alternatives and similar repositories for bocoel

Users that are interested in bocoel are comparing it to the libraries listed below

Sorting:

labmlai / inspectus
LLM Analytics
☆690Updated last year
andyk / recursive_llm
Implement recursion using English as the programming language and an LLM as the runtime.
☆236Updated 2 years ago
PaulPauls / llama3_interpretability_sae
A complete end-to-end pipeline for LLM interpretability with sparse autoencoders (SAEs) using Llama 3.2, written in pure PyTorch and full…
☆625Updated 6 months ago
Tsadoq / ErisForge
Dead Simple LLM Abliteration
☆232Updated 8 months ago
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆254Updated last year
samvher / bert-for-laptops
A BERT that you can train on a (gaming) laptop.
☆207Updated 2 years ago
Futrell / ziplm
☆254Updated 2 years ago
fzliu / radient
Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.
☆280Updated last month
jostmey / NakedAttention
Revealing example of self-attention, the building block of transformer AI models
☆130Updated 2 years ago
Dicklesworthstone / fast_vector_similarity
The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.
☆404Updated 7 months ago
neurallambda / awesome-reasoning
a curated list of data for reasoning ai
☆140Updated last year
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆375Updated 9 months ago
arc53 / llm-price-compass
This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …
☆220Updated 10 months ago
DebarghaG / proofofthought
Proof of thought : LLM-based reasoning using Z3 theorem proving with multiple backend support (SMT2 and JSON DSL)
☆342Updated this week
facebookresearch / searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
☆373Updated last year
anordin95 / run-llama-locally
Run and explore Llama models locally with minimal dependencies on CPU
☆189Updated last year
taylorai / aiq
ai for jq
☆244Updated last year
bananaml / fructose
☆746Updated last year
ask-fini / paramount
Agent accuracy measurements for LLMs
☆203Updated last year
MinishLab / vicinity
Lightweight Nearest Neighbors with Flexible Backends
☆311Updated 2 weeks ago
kolinko / effort
An implementation of bucketMul LLM inference
☆223Updated last year
M4THYOU / TokenDagger
High-Performance Implementation of OpenAI's TikToken.
☆458Updated 3 months ago
slashml / amd_inference
Docker-based inference engine for AMD GPUs
☆230Updated last year
em-llm / EM-LLM-model
☆234Updated 7 months ago
arena-ai / structured-logprobs
OpenAI's Structured Outputs with Logprobs
☆190Updated 4 months ago
idoh / mamba.np
A pure NumPy implementation of Mamba.
☆223Updated last year
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆124Updated 6 months ago
klara-research / klarity
See Through Your Models
☆400Updated 3 months ago
lechmazur / elimination_game
A multi-player tournament benchmark that tests LLMs in social reasoning, strategy, and deception. Players engage in public and private co…
☆290Updated 2 months ago
okuvshynov / slowllama
Finetune llama2-70b and codellama on MacBook Air without quantization
☆449Updated last year