alvin-r / databonsai
clean & curate your data with LLMs.
☆489Updated 10 months ago
Alternatives and similar repositories for databonsai:
Users that are interested in databonsai are comparing it to the libraries listed below
- 90% of what you need for LLM app development. Nothing you don't.☆258Updated 2 weeks ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆221Updated 4 months ago
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆222Updated 4 months ago
- Fully neural approach for text chunking☆343Updated 2 weeks ago
- Action library for AI Agent☆214Updated last month
- A hub for various industry-specific schemas to be used with VLMs.☆506Updated last week
- ☆438Updated 7 months ago
- ☆401Updated 8 months ago
- Radient turns many data types (not just text) into vectors for similarity search, RAG, regression analysis, and more.☆275Updated 2 months ago
- Agents Capable of Self-Editing Their Prompts / Python Code☆764Updated last year
- An SDK for working with LLMs and AI Agents from Apache Airflow, based on Pydantic AI☆369Updated 2 weeks ago
- Things you can do with the token embeddings of an LLM☆1,440Updated last month
- LLM Analytics☆659Updated 6 months ago
- Packages whisper.cpp into pre-built, pip-installable wheels, for macOS and Linux.☆172Updated 11 months ago
- Structured information extraction from documents☆314Updated 7 months ago
- Build, Improve Performance, and Productionize your LLM Application with an Integrated Framework☆339Updated 5 months ago
- Browser-LLM Auto-Scaling Technology☆490Updated this week
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆284Updated last week
- Lightweight Nearest Neighbors with Flexible Backends☆272Updated 2 months ago
- See Through Your Models☆390Updated 2 months ago
- ai for jq☆240Updated 7 months ago
- data cleaning and curation for unstructured text☆329Updated 9 months ago
- Dead Simple LLM Abliteration☆214Updated 2 months ago
- Prompt engineering for developers☆686Updated last year
- OCR Benchmark☆477Updated 3 weeks ago
- The Fast Vector Similarity Library is designed to provide efficient computation of various similarity measures between vectors.☆385Updated 2 months ago
- A scientific instrument for investigating latent spaces☆696Updated 3 weeks ago
- Turn docstrings into LLM-functions☆480Updated last month
- Prompt engineering, automated.☆308Updated 2 weeks ago
- High-performance retrieval engine for unstructured data☆1,373Updated this week