dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β26Updated 10 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ48Updated last year
- Tools for formatting large language model prompts.β13Updated 2 years ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated last year
- π Build knowledge bases for RAGβ31Updated 6 months ago
- Embedding models from Jina AIβ65Updated 2 years ago
- β21Updated last year
- An introduction to DSPyβ32Updated 4 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 9 months ago
- Paste Word, get Markdownβ17Updated last year
- Voyage AI Official Python Libraryβ91Updated last month
- LLM plugin for clustering embeddingsβ82Updated last year
- Vanilla-Python ergonomics on top of DSPyβ39Updated 7 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated last year
- Web Interface for Vision Language Models Including InternVLM2β25Updated last year
- The official Python library for Formulaicβ18Updated last year
- A cookiecutter template for building plugins for LLMβ29Updated last month
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β43Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graphβ25Updated last year
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UXβ43Updated 3 weeks ago
- A collection of tools for your LLMs that run on Modalβ23Updated 10 months ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzuβ73Updated 4 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ41Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.β32Updated last year
- Chat Markup Language conversation libraryβ55Updated 2 years ago
- β35Updated last year
- β17Updated 6 months ago
- Structured outputs from DSPy and Jinja2β26Updated 6 months ago
- β33Updated 2 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Modelsβ46Updated 2 years ago
- Handout for a talk I gave about LLM and CLI toolsβ62Updated last year