inferless / smaug-72bLinks
Smaug-72B topped the Hugging Face LLM leaderboard and it’s the first model with an average score of 80, making it the world’s best open-source foundation model. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>
☆17Updated 3 months ago
Alternatives and similar repositories for smaug-72b
Users that are interested in smaug-72b are comparing it to the libraries listed below
Sorting:
- Multi-threading, Concurrency, Asynchrony, and various Execution Methods implemented in a Rust backend for bleeding edge performance.☆12Updated 9 months ago
- A swarm of LLM agents that will help you test, document, and productionize your code!☆18Updated 3 weeks ago
- GPT4 based personalized ArXiv paper assistant bot☆10Updated last year
- A collection of pre-build wrappers over common RAG systems like ChromaDB, Weaviate, Pinecone, and othersz!☆35Updated this week
- ☆47Updated last year
- The Swarm Ecosystem☆22Updated last year
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆28Updated 2 months ago
- OpenAI compatible API for open source LLMs☆15Updated last year
- 🛤️ Pathik - High-Performance Web Crawler ⚡☆26Updated 4 months ago
- AI-based search done right☆20Updated this week
- Co-Coder is a Python package that streamlines error debugging from Open AI chat GPT and Google Bard by providing hints, example code, and…☆45Updated 2 years ago
- ☆38Updated 2 weeks ago
- ☆11Updated last year
- Task management for AI agents☆15Updated last month
- CrewAI AgentOps: Monitor your AI Agents☆18Updated last year
- Multi AI Agents for Investment Risk Analysis☆14Updated last year
- Tool4AI: A model agnostic, LLM friendly router for tool/function call☆19Updated 11 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆26Updated 9 months ago
- A lightweight Python API wrapper and CLI for Perplexity’s Sonar language models.☆62Updated 11 months ago
- ☆54Updated 6 months ago
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 5 months ago
- Use this code to access pipeline to Gemini from inside notebookLM☆31Updated last year
- ☆28Updated last year
- A QT GUI for large language models☆38Updated last year
- AirLLM 70B inference with single 4GB GPU☆14Updated last month
- LangChain + LiteLLM that works☆46Updated 2 months ago
- ☆40Updated last week
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆40Updated last year
- [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI☆48Updated last month
- AI Search engine☆12Updated last month