dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β26Updated 9 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ49Updated last year
- Embedding models from Jina AIβ65Updated last year
- A cookiecutter template for building plugins for LLMβ28Updated last week
- Paste Word, get Markdownβ17Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated last year
- Structured outputs from DSPy and Jinja2β26Updated 5 months ago
- Handout for a talk I gave about LLM and CLI toolsβ62Updated last year
- The official Python library for Formulaicβ17Updated last year
- Tools for formatting large language model prompts.β13Updated last year
- A collection of tools for your LLMs that run on Modalβ22Updated 9 months ago
- Web Interface for Vision Language Models Including InternVLM2β25Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 8 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.β89Updated last year
- Voyage AI Official Python Libraryβ83Updated this week
- Verbosity control for AI agentsβ64Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β43Updated last year
- examples and guides to using Nomic Atlasβ37Updated 7 months ago
- Code interpreter support for o1β31Updated last year
- An introduction to DSPyβ32Updated 3 months ago
- Code Interpreter Replicaβ25Updated 2 years ago
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UXβ43Updated 3 weeks ago
- Vanilla-Python ergonomics on top of DSPyβ38Updated 6 months ago
- β32Updated last month
- AI_Powered_Dev_Search_Engineβ12Updated last year
- Run embedding models using ONNXβ35Updated last year
- β20Updated last year
- LLM plugin for embeddings using sentence-transformersβ72Updated 7 months ago
- β21Updated last year
- LLM plugin for clustering embeddingsβ82Updated last year