replicate / cog-triton
A cog implementation of Nvidia's Triton server
☆16Updated 3 months ago
Alternatives and similar repositories for cog-triton:
Users that are interested in cog-triton are comparing it to the libraries listed below
- ☆30Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 2 months ago
- SDK for the Tavily search API which is tailored for LLM agents.☆12Updated 8 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆14Updated last week
- Cog wrapper for collabora/WhisperSpeech☆25Updated 11 months ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated 9 months ago
- The Swarm Ecosystem☆19Updated 6 months ago
- Replit template for hosting LangChain runnables via LangServe☆39Updated last year
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 5 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 7 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- An app for generating prompts☆24Updated 3 weeks ago
- Web page with political compass quiz results for open LLMs☆37Updated last year
- A daemon that makes a desktop OS accessible to AI agents☆20Updated this week
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆22Updated last year
- Convert an audio file to a waveform video☆10Updated last year
- ☆30Updated 7 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆44Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 3 months ago
- An Infr app that automates data collection from your PC, macOS or Linux client.☆11Updated last year
- [WIP] AI Try-On plugin for Chrome☆27Updated 11 months ago
- A Model Context Protocol (MCP) server that provides JSON-RPC functionality through OpenRPC.☆18Updated 2 weeks ago
- Auto-Video maker handling many AI's