dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β26Updated 11 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ48Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated last year
- Embedding models from Jina AIβ65Updated 2 years ago
- Voyage AI Official Python Libraryβ91Updated last week
- β20Updated last year
- Chat Markup Language conversation libraryβ55Updated 2 years ago
- Create embeddings for LLM using the Nomic APIβ23Updated last year
- Tools for formatting large language model prompts.β13Updated 2 years ago
- β21Updated last year
- Vanilla-Python ergonomics on top of DSPyβ39Updated 8 months ago
- Web Interface for Vision Language Models Including InternVLM2β25Updated last year
- Structured outputs from DSPy and Jinja2β27Updated 7 months ago
- Verbosity control for AI agentsβ66Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graphβ25Updated last year
- A cookiecutter template for building plugins for LLMβ29Updated 2 months ago
- LLM plugin for embeddings using sentence-transformersβ74Updated 9 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated last year
- Handout for a talk I gave about LLM and CLI toolsβ62Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.β90Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.β32Updated last year
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPIβ114Updated 2 years ago
- A FastAPI extension for integrating common AI agent frameworks.β47Updated last year
- β17Updated 7 months ago
- The official Python library for Formulaicβ18Updated last year
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UXβ44Updated this week
- Task management for AI agentsβ15Updated 7 months ago
- examples and guides to using Nomic Atlasβ37Updated 9 months ago
- Code Interpreter Replicaβ26Updated 2 years ago
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzuβ73Updated 4 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 10 months ago