ezyang / ai-blindspotsLinks
Blindspots in LLMs I've noticed while AI coding. Sonnet family emphasis.
☆13Updated 3 months ago
Alternatives and similar repositories for ai-blindspots
Users that are interested in ai-blindspots are comparing it to the libraries listed below
Sorting:
- Pivotal Token Search☆109Updated this week
- Editor with LLM generation tree exploration☆71Updated 5 months ago
- Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.☆32Updated 4 months ago
- ☆28Updated 10 months ago
- Approximating the joint distribution of language models via MCTS☆21Updated 8 months ago
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆54Updated 4 months ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆31Updated 3 months ago
- Simple LLM inference server☆20Updated last year
- Benchmark that evaluates LLMs using 651 NYT Connections puzzles extended with extra trick words☆130Updated this week
- Lego for GRPO☆28Updated last month
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆78Updated 9 months ago
- The developper starter pack for document processing☆16Updated this week
- look how they massacred my boy☆63Updated 9 months ago
- Access the Cohere Command R family of models☆37Updated 3 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 8 months ago
- ☆38Updated 4 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆64Updated 8 months ago
- a socketteer/loom reimplementation in obsidian☆17Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆25Updated 7 months ago
- The code repository for the CURLoRA research paper. Stable LLM continual fine-tuning and catastrophic forgetting mitigation.☆47Updated 10 months ago
- Because it's there.☆16Updated 9 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated last year
- explore token trajectory trees on instruct and base models☆134Updated last month
- Turing machines, Rule 110, and A::B reversal using Claude 3 Opus.☆58Updated last year
- ☆14Updated 10 months ago
- ☆49Updated last week
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆126Updated 2 months ago
- ☆38Updated 11 months ago
- Samples of good AI generated CUDA kernels☆84Updated last month
- new optimizer☆20Updated 11 months ago