replicate / cog-tritonLinks
A cog implementation of Nvidia's Triton server
☆17Updated last year
Alternatives and similar repositories for cog-triton
Users that are interested in cog-triton are comparing it to the libraries listed below
Sorting:
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- ☆11Updated last year
- ☆19Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year
- Apps that run on modal.com☆12Updated 2 months ago
- Chat AI (↓↓Scroll to see more↓↓)☆27Updated last year
- Code Interpreter Replica☆25Updated 2 years ago
- An Infr app that helps you replay & talk to everything you've ever seen.☆15Updated 2 years ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated this week
- ☆15Updated last year
- An app for generating prompts☆27Updated 3 months ago
- a suite of finetuned LLMs for atomically precise function calling 🧪☆17Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆11Updated 2 years ago
- The Swarm Ecosystem☆26Updated last year
- A library to convert Pydantic models to TypedDict☆36Updated last year
- Langchain Agent utilizing OpenAI Function Calls to execute Git commands using Natural Language☆44Updated 2 years ago
- Simple script to quiz LLMs☆27Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 7 months ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆48Updated last month
- AI Assistant that can get stock prices☆46Updated last year
- A fork of the fabulous BabyAGI-UI, allowing users to play with the Bard API in lieu of the OpenAI GPT 4 API.☆18Updated 2 years ago
- ☆29Updated this week
- ☆15Updated last year
- A function to do all☆35Updated last year
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 9 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- 🧬 [WIP] Lobe Flow - an open-source ai powered node flow editor☆22Updated last year
- A daemon that makes a desktop OS accessible to AI agents☆36Updated 6 months ago
- ☆34Updated last year