dwarvesf / llm-hostingLinks
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β22Updated 3 months ago
Alternatives and similar repositories for llm-hosting
Users that are interested in llm-hosting are comparing it to the libraries listed below
Sorting:
- Chrome Extension for exploring Hugging Face datasets πβ50Updated 8 months ago
- Python package for extractive NLP using the OpenAI APIβ17Updated 9 months ago
- A text-to-SQL prototype on the northwind sqlite datasetβ12Updated 8 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β26Updated 6 months ago
- The official Python library for Formulaicβ16Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β35Updated last year
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated 11 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 2 months ago
- applications of https://github.com/PrefectHQ/marvinβ12Updated last year
- GraphRag vs Embeddingsβ13Updated 10 months ago
- Create embeddings for LLM using the Nomic APIβ23Updated 6 months ago
- π€οΈ Pathik - High-Performance Web Crawler β‘β26Updated 2 months ago
- Tools for formatting large language model prompts.β13Updated last year
- β19Updated 7 months ago
- β20Updated last year
- convert natural language into technical diagramsβ14Updated 5 months ago
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injectionβ¦β12Updated 2 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!β17Updated last week
- An integration of Qdrant ANN vector database backend with txtaiβ24Updated 9 months ago
- Run evals using LLMβ25Updated last year
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unlβ¦β33Updated last month
- Writing Blog Posts with Generative Feedback Loops!β48Updated last year
- Benchmark study on LanceDB, an embedded vector DB, for full-text search and vector searchβ26Updated last year
- Adding Marimo to Datasetteβ20Updated 2 months ago
- Paste Word, get Markdownβ16Updated 10 months ago
- An AI character interaction system with emotional modeling and advanced memory managementβ16Updated 7 months ago
- Detect and redact PII locally with SOTA performanceβ50Updated 2 months ago
- Structured outputs from DSPy and Jinja2β23Updated 2 weeks ago
- A function to do allβ36Updated last year
- β1Updated 10 months ago