dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β26Updated 11 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ48Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated last year
- Embedding models from Jina AIβ65Updated 2 years ago
- Vanilla-Python ergonomics on top of DSPyβ39Updated 8 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated last year
- Voyage AI Official Python Libraryβ91Updated 2 weeks ago
- Tools for formatting large language model prompts.β13Updated 2 years ago
- Using modal.com to process FineWeb-edu dataβ20Updated 10 months ago
- watch your screen while doing sales and fill your crm automaticallyβ17Updated last year
- Paste Word, get Markdownβ17Updated last year
- β21Updated last year
- A collection of tools for your LLMs that run on Modalβ23Updated 11 months ago
- Chat Markup Language conversation libraryβ55Updated 2 years ago
- Handout for a talk I gave about LLM and CLI toolsβ62Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.β90Updated last year
- Verbosity control for AI agentsβ66Updated last year
- A framework for building large-scale, deterministic, interactive workflows with a fault-tolerant, conversational UXβ44Updated last week
- Public reports detailing responses to sets of prompts by Large Language Models.β32Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- examples and guides to using Nomic Atlasβ37Updated 9 months ago
- An introduction to DSPyβ33Updated 5 months ago
- Python package for extractive NLP using the OpenAI APIβ17Updated last year
- Apps that run on modal.comβ12Updated 4 months ago
- β20Updated last year
- Code interpreter support for o1β31Updated last year
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzuβ73Updated 4 months ago
- A cookiecutter template for building plugins for LLMβ29Updated 2 months ago
- Simple Graph Memory for AI applicationsβ90Updated 8 months ago
- Structured outputs from DSPy and Jinja2β27Updated 7 months ago
- Code Interpreter Replicaβ26Updated 2 years ago