jina-ai / example-wikipedia-recommendationLinks
An example of graph embeddings for wikipedia page recommendations
☆11Updated 4 years ago
Alternatives and similar repositories for example-wikipedia-recommendation
Users that are interested in example-wikipedia-recommendation are comparing it to the libraries listed below
Sorting:
- Streamlit demo app to demonstrate the features of transformers interpret with multiple models.☆25Updated 4 years ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- Meme search engine built with Jina neural search framework. Search with captions or image files to find matching memes.☆24Updated 3 years ago
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆75Updated 2 years ago
- From Dataset Labeling, Entity Extraction to production Knowledge Graph Deployment: The Power of NLP and LLMs Combined.☆12Updated last year
- [WIP] Behold, semantic-search, built over sentence-transformers to make it easy for search engineers to evaluate, optimise and deploy mod…☆15Updated 2 years ago
- Fast fuzzy text search☆11Updated 2 years ago
- Notebooks on using transformers for sequential recommendation tasks☆17Updated 3 years ago
- Code for "Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking" (https://arxiv.org/abs/2…☆14Updated 2 years ago
- Search PDFs using Jina, DocArray and Jina Hub☆57Updated 3 years ago
- A collection of utilities for writing labeling functions, transformation functions, and slicing functions.☆22Updated 5 years ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆21Updated 2 years ago
- ☆25Updated 3 years ago
- ☆10Updated 3 years ago
- An open-source NLP library: fast text cleaning and preprocessing☆23Updated 4 years ago
- A set of NLP tools created during my medium NLP Explanation series.☆31Updated last year
- TextGraphs + LLMs + graph ML for entity extraction, linking, ranking, and constructing a lemma graph☆25Updated last year
- Automatically transform all categorical, date-time, NLP variables to numeric in a single line of code for any data set any size.☆65Updated last year
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. I…☆25Updated 3 years ago
- ☆28Updated last year
- Interactive tree-maps with SBERT & Hierarchical Clustering (HAC)☆30Updated last year
- Various Jupyter notebooks about Common Crawl data☆62Updated 2 months ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35Updated last year
- Transforming textual descriptions into process models using deep learning☆15Updated 6 years ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆40Updated 6 years ago
- semantically distinct key phrase extraction using hilbert hashes.☆50Updated 3 years ago
- Code that accompanies the PyData New York (2022) talk: Addressing the sensitivity of Large language models☆13Updated 3 years ago
- Convert english sentences to cypher☆31Updated 5 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 3 years ago