code-kern-ai / refineryLinks
The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
☆1,470Updated last year
Alternatives and similar repositories for refinery
Users that are interested in refinery are comparing it to the libraries listed below
Sorting:
- The simplest way to serve AI/ML models in production☆1,113Updated this week
- Open-source natural language enrichments at your fingertips.☆462Updated last year
- An open-source ML pipeline development platform☆998Updated last year
- 🦘 Explore multimedia datasets at scale☆1,061Updated last year
- The Virtual Feature Store. Turn your existing data infrastructure into a feature store.☆1,963Updated 7 months ago
- The fastest ⚡️ way to build data pipelines. Develop iteratively, deploy anywhere. ☁️☆3,621Updated 8 months ago
- Blazing fast framework for fine-tuning similarity learning models☆662Updated last month
- An easy way to extract information from documents☆1,787Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Updated last year
- 1 line for thousands of State of The Art NLP models in hundreds of languages The fastest and most accurate way to solve text problems.☆958Updated last year
- Scalable identity resolution, entity resolution, data mastering and deduplication using ML☆1,146Updated last week
- Backend that powers the dataset viewer on Hugging Face dataset pages through a public API.☆844Updated this week
- Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand …☆1,374Updated 2 months ago
- What's in your data? Extract schema, statistics and entities from datasets☆1,541Updated 4 months ago
- AI code-writing assistant that understands data content☆2,290Updated 2 years ago
- Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.☆968Updated 11 months ago
- Curated list of open source tooling for data-centric AI on unstructured data.☆734Updated 2 years ago
- 📊 Semantic search for headlines and story text☆359Updated 2 years ago
- A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rew…☆2,135Updated last week
- Open Source Data Annotation & Labeling Tools☆671Updated 3 months ago
- Labelling platform for text using weak supervision.☆260Updated 3 years ago
- A Simple Bulk Labelling Tool☆598Updated 6 months ago
- Efficient few-shot learning with Sentence Transformers☆2,678Updated last month
- Visualise your Kedro data and machine-learning pipelines and track your experiments.☆739Updated last week
- Neural Search☆334Updated last year
- 🐶 A tool to package, serve, and deploy any ML model on any platform. Archived to be resurrected one day🤞☆719Updated 2 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,847Updated last week
- Explore and understand your training and validation data.☆852Updated last year
- Kuwala is the no-code data platform for BI analysts and engineers enabling you to build powerful analytics workflows. We are set out to b…☆805Updated 3 years ago
- Interactively explore unstructured datasets from your dataframe.☆1,245Updated this week