beam-cloud / beta9
Run GPU Workloads Across Multiple Clouds
☆385Updated this week
Related projects: ⓘ
- LLM fine-tuning and eval☆340Updated 6 months ago
- Text analytics for LLM apps. Cluster messages to detect use cases, outliers, power users. Detect intents and run evals with LLM (OpenAI, …☆352Updated this week
- Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastr…☆1,123Updated this week
- Data-Driven Evaluation for LLM-Powered Applications☆432Updated 2 weeks ago
- Model Manager is a Python package that simplifies the process of deploying an open source AI model to your own cloud.☆282Updated 4 months ago
- Prompt engineering, automated.☆201Updated this week
- AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.☆150Updated this week
- The only Vector tooling you'll need. Star the repo and look out for an email to try out a brand new Vector Data Exploration demo! Use the…☆195Updated this week
- ☆720Updated 5 months ago
- 🦾 Take control of your AI agents☆550Updated this week
- Fine-tuning and serving LLMs on any cloud☆85Updated 9 months ago
- BAML is a language that helps you get structured data from LLMs, with the best DX possible. Check out the promptfiddle.com playground☆1,014Updated this week
- Felafax is building AI infra for non-NVIDIA GPUs☆302Updated this week
- Multi-node production AI stack. Run the best of open source AI easily on your own servers. Create your own AI by fine-tuning open source …☆319Updated this week
- Python client library for Modal☆268Updated this week
- A realtime and indexing and structured extraction engine for Unstructured Data to build Generative AI Applications☆842Updated this week
- VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…☆667Updated 4 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆821Updated 8 months ago
- ⚡️ A fast and flexible PyTorch inference server that runs locally, on any cloud or AI HW.☆125Updated 3 months ago
- Python SDK for running evaluations on LLM generated responses☆196Updated this week
- Curated collection of AI dev tools from YC companies, aiming to serve as a reliable starting point for LLM/ML developers☆172Updated last year
- Self-hardening firewall for large language models☆254Updated 6 months ago
- Private Open AI on Kubernetes☆298Updated this week
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆330Updated this week
- Agent accuracy measurements for LLMs☆201Updated 3 months ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆432Updated 4 months ago
- cluster/scheduler health monitoring for GPU jobs on k8s☆41Updated this week
- Cedana: Access and run on compute anywhere in the world, on any provider. Migrate seamlessly between providers, arbitraging price/perform…☆53Updated 5 months ago
- Infrastructure powering E2B - Secure Runtime for AI Agents & Apps☆177Updated last week
- Build and query dynamic, temporally-aware Knowledge Graphs☆767Updated this week