cloud-gpus / cloud-gpus.github.ioLinks
☆54Updated 3 months ago
Alternatives and similar repositories for cloud-gpus.github.io
Users that are interested in cloud-gpus.github.io are comparing it to the libraries listed below
Sorting:
- Code generation with LLMs 🔗☆53Updated 2 years ago
- Vast.ai python and cli api client☆175Updated last week
- ☆57Updated 7 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆65Updated last year
- Transformer GPU VRAM estimator☆68Updated last year
- ☆67Updated 10 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Updated last year
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated last month
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Updated 2 years ago
- Mistral-7B finetuned for function calling☆16Updated 2 years ago
- Gradio UI for a Cog API☆70Updated last year
- ☆119Updated last year
- Web Interface for Vision Language Models Including InternVLM2☆25Updated last year
- Website with current metrics on the fastest AI models.☆42Updated last year
- A very simple cross-service LLM API for Python☆23Updated 2 years ago
- 👩🤝🤖 A curated list of datasets for large language models (LLMs), RLHF and related resources (continually updated)☆24Updated 2 years ago
- The backend behind the LLM-Perf Leaderboard☆11Updated last year
- parallel fetch☆144Updated 2 months ago
- Public reports detailing responses to sets of prompts by Large Language Models.☆32Updated last year
- ☆24Updated last year
- ☆141Updated 2 years ago
- Simple script to quiz LLMs☆29Updated 2 years ago
- ☆74Updated 2 years ago
- Examples of models deployable with Truss☆214Updated last week
- Python client library for improving your LLM app accuracy☆97Updated 11 months ago
- Web page with political compass quiz results for open LLMs☆38Updated 2 years ago
- A high performance batching router optimises max throughput for text inference workload☆16Updated 2 years ago
- Pipeline is an open source python SDK for building AI/ML workflows☆138Updated last year
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.☆64Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated 2 years ago