phospho-app / fastassertLinks

Dockerized LLM inference server with constrained output (JSON mode), built on top of vLLM and outlines. Faster, cheaper and without rate limits. Compare the quality and latency to your current LLM API provider.

☆27

Alternatives and similar repositories for fastassert

Users that are interested in fastassert are comparing it to the libraries listed below

Sorting:

FanaHOVA / openai-o1-code-interpreter
Code interpreter support for o1
☆32Updated 9 months ago
2sunflower33 / homeai
AI real estate agent
☆35Updated last year
yoheinakajima / captainfunction
A Python package to dynamically load functions for OpenAI Assistant
☆54Updated last year
JeezAI / DSPy_matchmaking
A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…
☆59Updated last year
andrewnguonly / ChatAbstractions
LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!
☆81Updated last year
parea-ai / parea-sdk-py
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
☆78Updated 4 months ago
WaitThatShouldntWork / Infer
Your local personalised AI agent
☆44Updated 7 months ago
superagent-ai / gpt-researcher
GPT based autonomous agent that does online comprehensive research on any given topic
☆11Updated last year
pavanjava / llama_workflow_and_agents
This repository is a combination of llama workflows and agents together which is a powerful concept.
☆17Updated 10 months ago
Attunewise / GPT
OpenAI GPT hosted Agent Framework for Windows and MacOS
☆36Updated 11 months ago
mendableai / gen-ui-firecrawl
☆46Updated last year
swyxio / openlangmem
☆47Updated last year
zhudotexe / redel
ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)
☆80Updated 3 months ago
sagentic-ai / sagentic-af
😎 Sagentic.ai Agent Framework - Sagentic.ai is a unified platform for building, running and scaling autonomous agents.
☆71Updated last week
rsrohan99 / dynamic-few-shot-llamaindex-workflow
☆53Updated 8 months ago
StanGirard / starfinder
Extract valuable information from your project github Stars & Forks such as email, company, twitter and then explore it with streamlit🌟
☆21Updated last year
yoheinakajima / autofinetune
auto fine tune of models with synthetic data
☆75Updated last year
BoundaryML / baml-examples
☆88Updated 3 weeks ago
yoheinakajima / quick_email_scripts
A couple scripts to grab stats from email
☆43Updated 9 months ago
instructor-ai / cloud
Annoucing Instructor Cloud
☆36Updated 10 months ago
yoheinakajima / jsondr
converts url content into JSON with a simple prefix
☆70Updated last year
yasyf / anthropic-computer-use-modal
Anthropic Computer Use with Modal Sandboxes
☆36Updated 8 months ago
seanchatmangpt / rdddy
Reactive DDD with DSPy
☆22Updated last year
ruvnet / open-space
An open source code of the GitHub Copilot Workspace
☆11Updated last year
justrach / bhumi
⚡ Bhumi – The fastest AI inference client for Python, built with Rust for unmatched speed, efficiency, and scalability 🚀
☆56Updated 3 weeks ago
Bklieger / Semantic
SemanticPDF: Drag, Drop, Semantic Search - SemanticPDF is a simple, privacy-focused application that makes it easy to upload a PDF file a…
☆66Updated last year
mendableai / data-connectors
LLM-ready data connectors
☆85Updated last year
mourad-ghafiri / OpenMindedChatbot
OpenMindedChatbot is a Proof Of Concept that leverages the power of Open source Large Language Models (LLM) with Function Calling capabil…
☆29Updated last year
OpenPipe / pii-redaction
Detect and redact PII locally with SOTA performance
☆53Updated 3 months ago
marco-bertelli / medium-rag-frontend
Rag Chatbot React And Tyepscript base boilerplate
☆33Updated last year