dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β23Updated 7 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ49Updated last year
- Embedding models from Jina AIβ65Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β25Updated 11 months ago
- Voyage AI Official Python Libraryβ80Updated last month
- A cookiecutter template for building plugins for LLMβ28Updated 3 weeks ago
- Structured outputs from DSPy and Jinja2β26Updated 4 months ago
- Paste Word, get Markdownβ17Updated last year
- Handout for a talk I gave about LLM and CLI toolsβ62Updated last year
- Tools for formatting large language model prompts.β13Updated last year
- Using modal.com to process FineWeb-edu dataβ20Updated 6 months ago
- A collection of tools for your LLMs that run on Modalβ22Updated 8 months ago
- Run embedding models using ONNXβ35Updated last year
- Chat Markup Language conversation libraryβ55Updated last year
- Use sync mode Playwright interactively, inside a Jupyter notebookβ15Updated 6 months ago
- FalkorDB-Browser is a visualization UI for FalkorDB.β56Updated last week
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated last year
- LLM plugin for embeddings using sentence-transformersβ72Updated 6 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.β31Updated 9 months ago
- β32Updated 2 years ago
- An introduction to DSPyβ32Updated 2 months ago
- Web Interface for Vision Language Models Including InternVLM2β23Updated last year
- LLM plugin for clustering embeddingsβ82Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β41Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.β87Updated 11 months ago
- Code interpreter support for o1β32Updated last year
- π Build knowledge bases for RAGβ29Updated 3 months ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzuβ72Updated last month
- The official Python library for Formulaicβ16Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. MCP & ACP (Agent Client Protocol) sβ¦β29Updated this week