lmcinnes / datamapplot_examples
Hosting examples of interactive datamapplot output
☆19Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for datamapplot_examples
- a graph definition and execution library for python☆16Updated last year
- A utility for labeling clusters of text data.☆28Updated 3 years ago
- Scripts supporting the development and serving the Roots Search Tool - https://hf.co/spaces/bigscience-data/roots-search☆10Updated last year
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models☆12Updated 2 years ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBread☆19Updated 7 months ago
- ☆14Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆20Updated 8 months ago
- Efficient BM25 with DuckDB 🦆☆29Updated 3 weeks ago
- GraphRag vs Embeddings☆13Updated 3 months ago
- ☆29Updated last year
- 🔎 A Prodigy plugin for evaluating spaCy pipelines☆12Updated 7 months ago
- Set-oriented Operations in Pandas☆24Updated 4 years ago
- framework for making streamcorpus data☆11Updated 7 years ago
- spaCy entry points for Curated Transformers☆24Updated last month
- stemgraphic python package for visualization of data and text☆17Updated 3 years ago
- Efficiently computing & storing token n-grams from large corpora☆15Updated last month
- Repository to allow collaboration between Cycle Labs Cloud community in support of the community.☆9Updated 2 years ago
- o1lama: Use Ollama with Llama 3.2 3B and other models locally to create reasoning chains that are similar in appearance to OpenAI's o1.☆12Updated last month
- Visualization Tool for Mapping Out Researchers using Natural Language Processing☆50Updated 6 months ago
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.☆21Updated 2 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated last year
- A Python library for creating adversarial splits☆13Updated 2 years ago
- An open, comprehensive catalog of scholarship, connecting papers, authors, institutions, and journals.☆10Updated last year
- A visual labeling system implemented in Jupyter widgets.☆11Updated 10 months ago
- A collection of utilities for writing labeling functions, transformation functions, and slicing functions.☆20Updated 4 years ago
- Datasette enrichment for analyzing row data using OpenAI's GPT models☆19Updated 5 months ago
- A curated list of awesome open source tools and commercial products to catalog, version, and manage data 🚀☆26Updated 2 years ago
- Model explanation provides the ability to interpret the effect of the predictors on the composition of an individual score.☆13Updated 3 years ago
- Quickly match many regexes against a string☆30Updated last week