replicate / cog-tritonLinks
A cog implementation of Nvidia's Triton server
☆17Updated 9 months ago
Alternatives and similar repositories for cog-triton
Users that are interested in cog-triton are comparing it to the libraries listed below
Sorting:
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 8 months ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated 11 months ago
- An app for generating prompts☆27Updated 6 months ago
- Task management for AI agents☆15Updated last month
- ☆19Updated 11 months ago
- Code Interpreter Replica☆24Updated 2 years ago
- ☆11Updated 11 months ago
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆18Updated 2 months ago
- Proof of concept for running moshi/hibiki using webrtc☆20Updated 5 months ago
- ☆33Updated 11 months ago
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆24Updated last year
- ☆38Updated 6 months ago
- An Infr app that helps you replay & talk to everything you've ever seen.☆16Updated last year
- A library to convert Pydantic models to TypedDict☆30Updated 11 months ago
- ☆33Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 4 months ago
- Agentic Workflow Platform to do Business-as-Code and deliver valuable Services-as-Software through simple APIs and SDKs☆29Updated this week
- The Swarm Ecosystem☆22Updated last year
- Simple orchestration for EC2 spot containers☆19Updated 10 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆14Updated 2 weeks ago
- Chat AI (↓↓Scroll to see more↓↓)☆28Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Updated last year
- Public reports detailing responses to sets of prompts by Large Language Models.☆31Updated 7 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆27Updated this week
- Run LLMs on Replicate with vLLM☆20Updated 3 weeks ago
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Setup an MCP server in 60 seconds.☆12Updated 7 months ago
- ☆27Updated last year