code-kern-ai / refineryLinks
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
☆1,447Updated 6 months ago
Alternatives and similar repositories for refinery
Users that are interested in refinery are comparing it to the libraries listed below
Sorting:
- Open-source natural language enrichments at your fingertips.☆458Updated 5 months ago
- The simplest way to serve AI/ML models in production☆1,012Updated this week
- Blazing fast framework for fine-tuning similarity learning models☆656Updated 2 months ago
- A low code Machine Learning personalized ranking service for articles, listings, search results, recommendations that boosts user engagem…☆2,137Updated 5 months ago
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.☆1,916Updated last month
- A scalable general purpose micro-framework for defining dataflows. THIS REPOSITORY HAS BEEN MOVED TO www.github.com/dagworks-inc/hamilton☆860Updated last year
- A Simple Bulk Labelling Tool☆586Updated 5 months ago
- An easy way to extract information from documents☆1,764Updated 2 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,552Updated last week
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆720Updated last year
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Updated 9 months ago
- A Repo For Document AI☆2,851Updated last week
- Labelling platform for text using weak supervision.☆262Updated 2 years ago
- Natural Intelligence is still a pretty good idea.☆815Updated 11 months ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,586Updated 3 weeks ago
- Neural Search☆332Updated last year
- 🦘 Explore multimedia datasets at scale☆1,060Updated 6 months ago
- Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.☆760Updated this week
- A web-based document annotation tool, powered by GPT-4☆261Updated last year
- OCR, Archive, Index and Search: Implementation agnostic OCR framework.☆222Updated last year
- Fuzzy string matching, grouping, and evaluation.☆765Updated last month
- With sequence-learn, you can build models for named entity recognition as quickly as if you were building a sklearn classifier.☆22Updated 2 years ago
- Official Python SDK for Kern AI refinery.☆19Updated 7 months ago
- 📄 ⚙️ ETL processes for medical and scientific papers☆385Updated last week
- Build and share data reports in 100% Python☆1,395Updated last year
- Transforms PDF, Documents and Images into Enriched Structured Data☆5,971Updated last year
- Doubt your data, find bad labels.☆513Updated 11 months ago
- Efficient few-shot learning with Sentence Transformers☆2,505Updated 2 months ago
- Software that makes labeling PDFs easy.☆415Updated last year
- LLM(😽)☆1,675Updated 4 months ago