replicate / cog-vllmLinks
Run LLMs on Replicate with vLLM
☆26Updated 6 months ago
Alternatives and similar repositories for cog-vllm
Users that are interested in cog-vllm are comparing it to the libraries listed below
Sorting:
- ☆21Updated last year
- Developer showcase of projects built on Cartesia☆20Updated last year
- Apps that run on modal.com☆12Updated 4 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Updated 2 years ago
- ☆47Updated last year
- auto fine tune of models with synthetic data☆78Updated last year
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 11 months ago
- ☆68Updated last year
- Retrieve the source code for any model made available on replicate.com!☆36Updated 2 years ago
- Gradio UI for a Cog API☆70Updated last year
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆73Updated 2 months ago
- ☆41Updated last year
- ☆40Updated 8 months ago
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated last year
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Updated last year
- Anthropic Computer Use with Modal Sandboxes☆43Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx framework☆77Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆219Updated last month
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆44Updated last year
- Safely push a Cog model version by making sure it works and is backwards-compatible with previous versions.☆16Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- Code interpreter support for o1☆31Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- AI-augmented, conversational information retrieval and data exploration☆37Updated last year
- [WIP] AI Try-On plugin for Chrome☆28Updated last year
- A curated list of amazingly awesome Modal applications, demos, and shiny things. Inspired by awesome-php.☆173Updated last month
- A framework for evaluating function calls made by LLMs☆40Updated last year
- Simple Graph Memory for AI applications☆90Updated 8 months ago
- Convert a web page to markdown☆80Updated last year