☆185Oct 13, 2023Updated 2 years ago
Alternatives and similar repositories for library-of-phi
Users that are interested in library-of-phi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate textbook-quality synthetic LLM pretraining data☆508Oct 19, 2023Updated 2 years ago
- A multi-purpose LLM framework for RAG and data creation.☆629Jan 13, 2024Updated 2 years ago
- ☆11Aug 26, 2024Updated last year
- ☆21Oct 6, 2023Updated 2 years ago
- ☆52Feb 5, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Score LLM pretraining data with classifiers☆55Nov 2, 2023Updated 2 years ago
- ☆22Aug 27, 2023Updated 2 years ago
- a pipeline for using api calls to agnostically convert unstructured data into structured training data☆32Sep 22, 2024Updated last year
- Package and scripts used to build a dataset of Wikipedia articles in Markdown.☆20Sep 11, 2023Updated 2 years ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- AgentSearch is a framework for powering search agents and enabling customizable local search.☆526Apr 22, 2024Updated 2 years ago
- Generate High Quality textual or multi-modal datasets with Agents☆18Jun 7, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆569Nov 20, 2024Updated last year
- Data and preprocessing scripts for SemEval 2022 Task 2: Multilingual Idiomaticity Detection and Sentence Embedding☆15Feb 3, 2022Updated 4 years ago
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 8 months ago
- DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA☆13Jul 22, 2019Updated 6 years ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆45Feb 15, 2024Updated 2 years ago
- ☆124Dec 18, 2024Updated last year
- Beginner-friendly serverless LLM deployment with Replicate & fly.io☆13Sep 3, 2023Updated 2 years ago
- Distributes tasks to a network of GPUs efficiently and uploads the result.☆15Aug 13, 2024Updated last year
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆643Mar 4, 2024Updated 2 years ago
- Customizable implementation of the self-instruct paper.☆1,052Mar 7, 2024Updated 2 years ago
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆76Feb 23, 2024Updated 2 years ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- ☆126Feb 10, 2024Updated 2 years ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Jun 15, 2024Updated last year
- A minimalist Docker project to help people getting started with Node, WizardCoder, CTransformers, Python, Express and TypeScript. Ready t…☆14Jun 23, 2023Updated 2 years ago
- ☆21Jun 4, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation☆73Sep 18, 2023Updated 2 years ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆60Dec 1, 2024Updated last year
- This is a survey of research on AI scientists, AI researchers, AI engineers, and a series of AI-driven research studies☆240Oct 31, 2025Updated 6 months ago
- 100% Private & Simple. OSS 🐍 Code Interpreter for LLMs 🦙☆34Aug 29, 2023Updated 2 years ago
- ☆17Jun 20, 2023Updated 2 years ago
- The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.☆12Dec 15, 2021Updated 4 years ago
- ☆345Mar 5, 2026Updated 2 months ago