dwarvesf / llm-hosting
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β20Updated last week
Alternatives and similar repositories for llm-hosting:
Users that are interested in llm-hosting are comparing it to the libraries listed below
- A swarm of LLM agents that will help you test, document, and productionize your code!β14Updated last month
- Chrome Extension for exploring Hugging Face datasets πβ49Updated 5 months ago
- Tools for formatting large language model prompts.β12Updated last year
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, β¦β26Updated 6 months ago
- β30Updated last year
- Structured outputs from DSPy and Jinja2β23Updated 2 months ago
- Web Interface for Vision Language Models Including InternVLM2β17Updated 7 months ago
- applications of https://github.com/PrefectHQ/marvinβ12Updated last year
- The official Python library for Formulaicβ17Updated 10 months ago
- examples and guides to using Nomic Atlasβ27Updated 2 weeks ago
- Python package for extractive NLP using the OpenAI APIβ17Updated 6 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated last week
- LLM plugin for embeddings using sentence-transformersβ52Updated last month
- FalkorDB-Browser is a visualization UI for FalkorDB.β26Updated this week
- A cog model for the all-mpnet-base-v2 sentence-transformers embedding model.β11Updated last year
- Automatically pass your funcions defined in Python to ChatGPT have it call them back seemlessly.β13Updated last year
- β20Updated last year
- β20Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β32Updated last year
- a simple create-llama template using llama-index v0.10 and integrated with Ollamaβ10Updated 9 months ago
- A QT GUI for large language modelsβ31Updated last year
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1β¦β14Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated 8 months ago
- Lightweight OpenAI wrapper using FastAPI. Add rate limits to OpenAI usage, optionally log and store all API calls, and share regulated Opβ¦β13Updated last year
- Paste Word, get Markdownβ15Updated 7 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β25Updated 4 months ago
- β12Updated 6 months ago