dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β24Updated 7 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ48Updated last year
- Embedding models from Jina AIβ65Updated last year
- Paste Word, get Markdownβ16Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β25Updated 10 months ago
- Voyage AI Official Python Libraryβ78Updated 3 weeks ago
- Handout for a talk I gave about LLM and CLI toolsβ62Updated last year
- The official Python library for Formulaicβ16Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.β87Updated 10 months ago
- A cookiecutter template for building plugins for LLMβ28Updated 6 months ago
- watch your screen while doing sales and fill your crm automaticallyβ16Updated last year
- β20Updated 11 months ago
- A collection of tools for your LLMs that run on Modalβ22Updated 7 months ago
- LLM plugin for clustering embeddingsβ82Updated last year
- Verbosity control for AI agentsβ65Updated last year
- Demo of knowledge graph creation and Graph RAG with BAML and Kuzuβ72Updated 3 weeks ago
- Use sync mode Playwright interactively, inside a Jupyter notebookβ15Updated 6 months ago
- Web Interface for Vision Language Models Including InternVLM2β23Updated last year
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unlβ¦β34Updated 5 months ago
- Tools for formatting large language model prompts.β13Updated last year
- β20Updated last year
- Chat Markup Language conversation libraryβ55Updated last year
- A curated collection of example marimo notebooks βΒ use these as templates for your own experiments, workflows, and tools.β52Updated this week
- Using modal.com to process FineWeb-edu dataβ20Updated 6 months ago
- Leverage your LangChain trace data for fine tuningβ46Updated last year
- π Build knowledge bases for RAGβ28Updated 3 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crewβ¦β59Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated last year
- An introduction to DSPyβ32Updated last month
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. Iβ¦β116Updated 2 months ago
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ41Updated last year