flowaicom / flow-judgeView external linksLinks
Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafted for accuracy, speed, and customization.
☆84Oct 29, 2024Updated last year
Alternatives and similar repositories for flow-judge
Users that are interested in flow-judge are comparing it to the libraries listed below
Sorting:
- C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…☆23Mar 4, 2024Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆77Updated this week
- Tooling for exact and MinHash deduplication of large-scale text datasets☆68Feb 4, 2026Updated last week
- ☆28Aug 21, 2025Updated 5 months ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆47Sep 26, 2024Updated last year
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Python package for extractive NLP using the OpenAI API☆17Aug 28, 2024Updated last year
- ☆53Oct 16, 2024Updated last year
- A text analysis library for relevance and subtheme detection☆16Sep 22, 2025Updated 4 months ago
- ☆13May 30, 2024Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated 11 months ago
- ☆210Jun 26, 2025Updated 7 months ago
- Chunk Dedupe Estimation☆20Nov 5, 2024Updated last year
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Feb 9, 2024Updated 2 years ago
- Simple examples using Argilla tools to build AI☆57Nov 18, 2024Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆69Nov 17, 2025Updated 2 months ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- LlamaWorksDB is a Retrieval Augmented Generation (RAG) product designed to interact with the documentation of various products such as Ll…☆17May 3, 2024Updated last year
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆201Jul 17, 2024Updated last year
- Evaluating LLMs with fewer examples☆169Apr 12, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- A modular framework for building massively parallel agentic systems☆29Sep 8, 2025Updated 5 months ago
- ☆120Aug 28, 2024Updated last year
- private-machine is an AI companion system with emotion, needs and goals simulation. Very silly, not based on real science.☆28Nov 13, 2025Updated 3 months ago
- A version of BabyAGI with numpy instead of pinecone and an evaluation agent to check success criteria☆15Apr 18, 2023Updated 2 years ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50May 19, 2025Updated 8 months ago
- ☆20Jan 27, 2024Updated 2 years ago
- ☆18Sep 5, 2024Updated last year
- Lightweight tools for quick and easy LLM demo's☆28Sep 22, 2024Updated last year
- Code generator using LlamaIndexTS workflows with OpenAI o1 model☆52Feb 4, 2025Updated last year
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Jul 25, 2024Updated last year
- Generic rag framework to apply the power of LLMs on any given dataset☆668Dec 16, 2025Updated 2 months ago
- Python library to use Pleias-RAG models☆68May 1, 2025Updated 9 months ago
- Shared personal notes created while working with the Apple MLX machine learning framework☆24Dec 12, 2025Updated 2 months ago
- ☆28Apr 14, 2025Updated 10 months ago
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 7 months ago
- ☆30Mar 18, 2024Updated last year
- ☆29Nov 9, 2025Updated 3 months ago