labmlai / inspectus
LLM Analytics
☆615Updated last month
Related projects ⓘ
Alternatives and complementary repositories for inspectus
- Felafax is building AI infra for non-NVIDIA GPUs☆509Updated this week
- Visualize the intermediate output of Mistral 7B☆313Updated 9 months ago
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆271Updated this week
- Stateful load balancer custom-tailored for llama.cpp☆563Updated this week
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆210Updated last month
- Tech Stack for Building, Evaluating, and Deploying your LLM Application☆328Updated this week
- Fine-tune LLM agents with online reinforcement learning☆995Updated 8 months ago
- ☆379Updated 3 months ago
- clean & curate your data with LLMs.☆470Updated 4 months ago
- Action library for AI Agent☆191Updated 2 weeks ago
- Things you can do with the token embeddings of an LLM☆1,376Updated last week
- ai for jq☆234Updated 2 months ago
- ☆727Updated 7 months ago
- Finetune llama2-70b and codellama on MacBook Air without quantization☆447Updated 7 months ago
- ☆777Updated 2 weeks ago
- The Open Source Memory Layer For Autonomous Agents☆1,483Updated 3 weeks ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆498Updated 3 weeks ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,095Updated last week
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆448Updated 8 months ago
- A library for making RepE control vectors☆481Updated last month
- ☆448Updated 7 months ago
- 🦛 CHONK your texts with Chonkie ✨ - The no-nonsense RAG chunking library☆1,415Updated this week
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆268Updated 2 months ago
- ☆416Updated 2 months ago
- Implementing the 4 agentic patterns from scratch☆751Updated 3 weeks ago
- Agents Capable of Self-Editing Their Prompts / Python Code☆745Updated 8 months ago
- Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.☆687Updated 2 months ago
- Agent accuracy measurements for LLMs☆203Updated 5 months ago
- Ask GPT to run a command☆193Updated 2 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆836Updated 10 months ago