replicate / cog-tritonLinks
A cog implementation of Nvidia's Triton server
☆17Updated last year
Alternatives and similar repositories for cog-triton
Users that are interested in cog-triton are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 6 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 11 months ago
- Code for training & inference with FLAN family of models☆17Updated 2 years ago
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Chat AI (↓↓Scroll to see more↓↓)☆27Updated last year
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated last week
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Cog wrapper for collabora/WhisperSpeech☆24Updated last year
- ☆32Updated 2 years ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year
- Convert an audio file to a waveform video☆11Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆16Updated 3 weeks ago
- An app for generating prompts☆27Updated 2 months ago
- A daemon that makes a desktop OS accessible to AI agents☆34Updated 5 months ago
- Various agents from all of the top agent frameworks to integrate into swarms! Langchain, Griptape, CrewAI, and more!☆15Updated this week
- Collections of Actions for Custom GPTs (some created by Captain Action)☆10Updated last year
- Voice agent using LiveKit (orchestration), Cartesia (TTS), OpenAI (LLM), and Deepgram (STT)☆17Updated this week
- AI Assistant that can get stock prices☆46Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆23Updated 7 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆47Updated 3 weeks ago
- ☆19Updated last year
- Code Interpreter Replica☆25Updated 2 years ago
- Fooocus App deployment using Modal.☆14Updated last year
- Apps that run on modal.com☆12Updated last month
- A feed of trending repos/models from GitHub, Replicate, HuggingFace, and Reddit.☆136Updated 4 months ago
- Embedding models from Jina AI☆65Updated last year
- A library to convert Pydantic models to TypedDict☆36Updated last year