dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β24Updated 5 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ49Updated 11 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated 9 months ago
- Tools for formatting large language model prompts.β13Updated last year
- Embedding models from Jina AIβ64Updated last year
- Chat Markup Language conversation libraryβ55Updated last year
- π Build knowledge bases for RAGβ24Updated last month
- β20Updated 10 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 4 months ago
- Handout for a talk I gave about LLM and CLI toolsβ63Updated last year
- Run embedding models using ONNXβ35Updated last year
- Voyage AI Official Python Libraryβ71Updated last month
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.β87Updated 9 months ago
- β20Updated last year
- A collection of tools for your LLMs that run on Modalβ22Updated 6 months ago
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated last year
- Web Interface for Vision Language Models Including InternVLM2β23Updated last year
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)β20Updated last year
- A cookiecutter template for building plugins for LLMβ28Updated 4 months ago
- LLM plugin for clustering embeddingsβ81Updated last year
- Verbosity control for AI agentsβ65Updated last year
- A collection of tools that can be used for LLM function callingβ34Updated 3 months ago
- Vanilla-Python ergonomics on top of DSPyβ33Updated 2 months ago
- β31Updated 7 months ago
- Use sync mode Playwright interactively, inside a Jupyter notebookβ15Updated 4 months ago
- Task management for AI agentsβ14Updated 2 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structureβ51Updated 10 months ago
- Leverage your LangChain trace data for fine tuningβ44Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.β31Updated 7 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β37Updated last year