code-kern-ai / refinery
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
☆1,436Updated 4 months ago
Alternatives and similar repositories for refinery:
Users that are interested in refinery are comparing it to the libraries listed below
- An easy way to extract information from documents☆1,753Updated 2 years ago
- Open-source natural language enrichments at your fingertips.☆458Updated 3 months ago
- The simplest way to serve AI/ML models in production☆981Updated this week
- 🦘 Explore multimedia datasets at scale☆1,057Updated 4 months ago
- Labelling platform for text using weak supervision.☆262Updated 2 years ago
- A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagem…☆2,122Updated 3 months ago
- Build data pipelines, the easy way 🛠️☆4,118Updated last year
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.☆1,888Updated this week
- 🦙 Integrating LLMs into structured NLP pipelines☆1,240Updated 3 months ago
- Lightwood is Legos for Machine Learning.☆465Updated last week
- Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand …☆1,267Updated 6 months ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆922Updated 8 months ago
- Multi-angle c(q)uestion answering☆458Updated 2 years ago
- Efficient few-shot learning with Sentence Transformers☆2,477Updated 3 weeks ago
- Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.☆747Updated this week
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆861Updated last year
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,563Updated 7 months ago
- An open-source ML pipeline development platform☆990Updated 3 months ago
- Interactively explore unstructured datasets from your dataframe.☆1,170Updated 2 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,474Updated last week
- nannyml: post-deployment data science in python☆2,057Updated last week
- With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.☆22Updated 2 years ago
- Blazing fast framework for fine-tuning similarity learning models☆657Updated 3 weeks ago
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆720Updated last year
- ML pipeline orchestration and model deployments on Kubernetes.☆435Updated last year
- Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML va…☆3,784Updated 2 months ago
- The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to s…☆698Updated last month
- A web-based document annotation tool, powered by GPT-4☆260Updated last year
- Fuzzy string matching, grouping, and evaluation.☆761Updated 2 months ago
- Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications,…☆285Updated 2 months ago