replicate / cog-vllmLinks

Run LLMs on Replicate with vLLM

☆20

Alternatives and similar repositories for cog-vllm

Users that are interested in cog-vllm are comparing it to the libraries listed below

Sorting:

davanstrien / data-for-fine-tuning-llms
☆79Updated last year
nateraw / modal-examples
Apps that run on modal.com
☆12Updated last month
hamelsmu / replicate-examples
☆22Updated last year
eugeneyan / visualizing-finetunes
☆78Updated last year
1rgs / tokenwiz
A clone of OpenAI's Tokenizer page for HuggingFace Models
☆45Updated last year
weaviate / structured-rag
Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models
☆111Updated 4 months ago
deployradiant / pychatml
Chat Markup Language conversation library
☆55Updated last year
yoheinakajima / autofinetune
auto fine tune of models with synthetic data
☆76Updated last year
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 4 months ago
parlance-labs / mcp-llms.txt
Minimal example of MCP for parsing llms.txt
☆40Updated 4 months ago
jxnl / instructor-classify
☆35Updated 3 months ago
AnswerDotAI / web2md-ext
Get a markdown version of any webpage with a keyboard shortcut.
☆65Updated 5 months ago
BBischof / yapping
Verbosity control for AI agents
☆64Updated last year
mzbac / mlx-lora
☆38Updated last year
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆50Updated last year
matthelmer / DSPy-examples
Example code using the DSPy framework.
☆19Updated last year
teknium1 / transformers-gptq-quant
☆47Updated last year
Not-Diamond / RoRF
Routing on Random Forest (RoRF)
☆187Updated 10 months ago
hamelsmu / claudesave
A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.
☆87Updated 8 months ago
virevolai / logos-shift-client
Replace expensive LLM calls with finetunes automatically
☆65Updated last year
JeezAI / DSPy_matchmaking
A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…
☆59Updated last year
Technoculture / personal-graph
Simple Graph Memory for AI applications
☆89Updated 2 months ago
interstellarninja / function-calling-eval
A framework for evaluating function calls made by LLMs
☆37Updated last year
swyxio / openlangmem
☆47Updated last year
AnswerDotAI / web2md
Convert a web page to markdown
☆77Updated 11 months ago
QuixiAI / kraken
☆66Updated last year
interstellarninja / MeeseeksAI
A framework for orchestrating AI agents using a mermaid graph
☆77Updated last year
tom-doerr / dspy_nodes
WIP - Allows you to create DSPy pipelines using ComfyUI
☆193Updated 8 months ago
multimodalart / grog
Gradio UI for a Cog API
☆69Updated last year
taylorai / mlx_embedding_models
run embeddings in MLX
☆90Updated 10 months ago