dwarvesf / llm-hosting
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
β16Updated 3 weeks ago
Related projects β
Alternatives and complementary repositories for llm-hosting
- Python package for extractive NLP using the OpenAI APIβ14Updated 2 months ago
- Chrome Extension for exploring Hugging Face datasets πβ47Updated last month
- applications of https://github.com/PrefectHQ/marvinβ12Updated 9 months ago
- Benchmark structured generation librariesβ21Updated 2 weeks ago
- β18Updated last month
- Knowledge Graph Generator appβ31Updated 6 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and minβ¦β23Updated last week
- LLM-driven automated knowledge graph construction from text using DSPy and Neo4jβ14Updated 2 months ago
- NLP with Rust for Python π¦πβ59Updated 5 months ago
- Using modal.com to process FineWeb-edu dataβ19Updated 2 months ago
- GraphRag vs Embeddingsβ13Updated 3 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.β64Updated last week
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. Iβ¦β19Updated 2 years ago
- β20Updated 9 months ago
- Create embeddings for LLM using the Nomic APIβ16Updated 7 months ago
- An attribution library for LLMsβ34Updated last month
- Paste Word, get Markdownβ13Updated 3 months ago
- The official Python library for Formulaicβ14Updated 6 months ago
- Embedding models from Jina AIβ56Updated 9 months ago
- Chat Markup Language conversation libraryβ54Updated 10 months ago
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graphβ20Updated 8 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ19Updated 7 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM datasetβ13Updated 8 months ago
- scraping and querying documents for LLMsβ13Updated last week
- Connect to your customer data using any LLM and gain actionable insights. IdentityRAG creates a single comprehensive customer 360 view (gβ¦β21Updated this week
- Have UV deal with all your Jupyter deps.β18Updated 2 months ago
- Quick Notebook Tutorialsβ27Updated 3 weeks ago
- examples and guides to using Nomic Atlasβ27Updated 2 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.β10Updated 4 months ago
- LLM plugin for models hosted by Anyscale Endpointsβ32Updated 6 months ago