replicate / cog-tritonLinks
A cog implementation of Nvidia's Triton server
☆17Updated last year
Alternatives and similar repositories for cog-triton
Users that are interested in cog-triton are comparing it to the libraries listed below
Sorting:
- ☆11Updated last year
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated last year
- Langchain Agent utilizing OpenAI Function Calls to execute Git commands using Natural Language☆44Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Updated 9 months ago
- Apps that run on modal.com☆12Updated 4 months ago
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆46Updated 2 years ago
- An app for generating prompts☆28Updated 5 months ago
- Chat AI (↓↓Scroll to see more↓↓)☆27Updated last year
- Generate visual podcasts about novels using open source models☆25Updated 2 years ago
- Simple script to quiz LLMs☆28Updated last year
- A library to convert Pydantic models to TypedDict☆36Updated last year
- ☆33Updated 2 years ago
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated last year
- Run LLMs on Replicate with vLLM☆26Updated 6 months ago
- Code Interpreter Replica☆26Updated 2 years ago
- A fork of the fabulous BabyAGI-UI, allowing users to play with the Bard API in lieu of the OpenAI GPT 4 API.☆18Updated 2 years ago
- Deploy your autonomous agents to production grade environments with 99% Uptime Guarantee, Infinite Scalability, and self-healing.☆49Updated 3 months ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js package☆29Updated this week
- ☆40Updated 8 months ago
- ☆21Updated last year
- Proof of concept for running moshi/hibiki using webrtc☆19Updated 10 months ago
- ☆19Updated last year
- Cog wrapper for collabora/WhisperSpeech☆25Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- ☆19Updated 2 years ago
- DSPY Experiments☆14Updated last year
- Create embeddings with infinity as serverless endpoint☆41Updated last month
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year