tleers / serverless-llm-app-factory
Beginner-friendly serverless LLM deployment with Replicate & fly.io
☆13Updated last year
Alternatives and similar repositories for serverless-llm-app-factory:
Users that are interested in serverless-llm-app-factory are comparing it to the libraries listed below
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- Don't bug your friends with articles they'll never read. AI's have infinite attention, leverage them instead! Use the curation buddy to e…☆22Updated 11 months ago
- DSPY Experiments☆14Updated 11 months ago
- WhisperAnywhere: Effortless speech-to-text everywhere on your Mac. Use a hotkey to dictate in any app, powered by Whisper AI and Groq API…☆28Updated 6 months ago
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, …☆27Updated 7 months ago
- ☆47Updated last year
- Embed anything.☆29Updated 10 months ago
- Reactive DDD with DSPy☆22Updated last year
- A seamless matchmaking application that is programmed with Cohere Command R+, Stanford NLP DSPy framework, Weaviate Vector store and Crew…☆59Updated 11 months ago
- Anthropic Computer Use with Modal Sandboxes☆31Updated 5 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 6 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through r…☆60Updated 8 months ago
- AI-augmented, conversational information retrieval and data exploration☆39Updated last year
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆65Updated 5 months ago
- Mine-tuning is a methodology for synchronizing human and AI attention.☆17Updated 9 months ago
- auto fine tune of models with synthetic data☆75Updated last year
- Welcome to ResearchAgent ! A personal research assistant powered by GPT-3.5/GPT-4. You can ask follow up questions. Get source details o…☆32Updated last year
- LangChain chat model abstractions for dynamic failover, load balancing, chaos engineering, and more!☆80Updated last year
- LLM-powered autonomous agent with hierarchical task management☆47Updated last year
- ✅ Pytest-style test runner for langchain projects☆25Updated 2 years ago
- RAG example using DSPy, Gradio, FastAPI☆76Updated 11 months ago
- ☆37Updated last year
- A couple scripts to grab stats from email☆42Updated 7 months ago
- UI for testing prompts across various datasets locally☆14Updated 5 months ago
- SDK for the Tavily search API which is tailored for LLM agents.☆12Updated 10 months ago
- Code interpreter support for o1☆32Updated 6 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- Structured outputs from DSPy and Jinja2☆23Updated 3 months ago
- ☆45Updated last year
- A list of AI memory projects☆90Updated 3 months ago