replicate / cog-triton
A cog implementation of Nvidia's Triton server
☆17Updated 5 months ago
Alternatives and similar repositories for cog-triton:
Users that are interested in cog-triton are comparing it to the libraries listed below
- SDK for the Tavily search API which is tailored for LLM agents.☆12Updated 10 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 weeks ago
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆13Updated last week
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated 11 months ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆22Updated last month
- The Swarm Ecosystem☆20Updated 8 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 5 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 4 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 9 months ago
- ☆26Updated 3 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 7 months ago
- ☆18Updated 8 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆15Updated this week
- ☆11Updated 7 months ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆16Updated last week
- ☆32Updated last year
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆22Updated last year
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…☆11Updated last month
- Multi-agent workflows and complex Agent interactions, both via YAML manifest and programmatic usage. Pydantic-AI and LiteLLM backends. Hu…☆16Updated this week
- ☆15Updated last year
- A library to convert Pydantic models to TypedDict☆26Updated 8 months ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated 11 months ago
- ☆19Updated last month
- Proof of concept for running moshi/hibiki using webrtc☆18Updated last month
- ☆11Updated 2 months ago
- Setup an MCP server in 60 seconds.☆12Updated 4 months ago
- ☆12Updated last year
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆11Updated last week
- MCP remote server for AI Engineer World's Fair 2025☆12Updated this week