dwarvesf / llm-hosting
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
☆20Updated 3 months ago
Alternatives and similar repositories for llm-hosting:
Users that are interested in llm-hosting are comparing it to the libraries listed below
- applications of https://github.com/PrefectHQ/marvin☆12Updated last year
- LLM plugin for embeddings using sentence-transformers☆48Updated last week
- ☆18Updated 4 months ago
- ☆30Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated 5 months ago
- Tools for formatting large language model prompts.☆12Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 7 months ago
- The official Python library for Formulaic☆16Updated 9 months ago
- Inference examples☆35Updated last month
- GraphRag vs Embeddings☆13Updated 7 months ago
- Structured outputs from DSPy and Jinja2☆22Updated last month
- FalkorDB-Browser is a visualization UI for FalkorDB.☆26Updated this week
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 2 months ago
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆26Updated 6 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 3 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- ☆41Updated 2 months ago
- Github repo for storing LlamaDatasets☆33Updated last year
- Chat Markup Language conversation library☆55Updated last year
- Apps that run on modal.com☆12Updated 8 months ago
- Web Interface for Vision Language Models Including InternVLM2☆17Updated 6 months ago
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆31Updated last week
- Repo to experiment with Graph RAG strategies using Kùzu☆44Updated 2 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆18Updated 5 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebook☆14Updated 2 months ago
- ☆20Updated last year
- ☆12Updated 5 months ago