irthomasthomas / llm-cerebras
llm plugin for Cerebras fast inference API
☆24Updated last month
Alternatives and similar repositories for llm-cerebras:
Users that are interested in llm-cerebras are comparing it to the libraries listed below
- Streamable multi-format serialization with schema☆22Updated 4 months ago
- Embedding models from Jina AI☆58Updated last year
- A text-to-SQL prototype on the northwind sqlite dataset☆12Updated 6 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated 11 months ago
- Back-of-the-envelope stuffs in Python☆20Updated last year
- Concatenated documentation for use with LLMs☆19Updated this week
- Load GitHub repository contents as LLM fragments☆12Updated this week
- ☆27Updated 7 months ago
- HTTP responses powered by AI☆16Updated 2 weeks ago
- ☆16Updated this week
- Modular, open source LLMOps stack that separates concerns: LiteLLM unifies LLM APIs, manages routing and cost controls, and ensures high-…☆92Updated last month
- A Python framework for building AI agent systems with robust task management in the form of a graph execution engine, inference capabilit…☆21Updated this week
- Slipstream provides a data-flow model to simplify development of stateful streaming applications.☆27Updated this week
- Scrape details about Code Interpreter to track any changes☆63Updated this week
- Tools for LLM agents.☆62Updated 3 months ago
- Create embeddings for LLM using the Nomic API☆23Updated 4 months ago
- Run Llama 2 using MLX on macOS☆33Updated last year
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆80Updated 4 months ago
- Analyzing hacker news in real-time with Bytewax and Proton☆39Updated last year
- Python client for accessing the turbopuffer API.☆41Updated last week
- Access the Cohere Command R family of models☆36Updated 2 weeks ago
- LLM plugin for pulling content from Hacker News☆75Updated this week
- Parallelism and preemptive concurrency for sporadic workloads☆46Updated 4 months ago
- Some tough questions to test new models.☆27Updated 11 months ago
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆19Updated 9 months ago
- Hierarchical topic segmentation of meeting transcripts using embeddings and divisive clustering.☆52Updated 8 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last week
- Multi-model transactional embedded database☆68Updated 4 months ago
- download and view the contents of a GitHub repository or a ZIP file as a single text file☆42Updated 8 months ago
- Using langchain, deeplake and openai to create a Q&A on the Mojo lang programming manual☆22Updated last year