elenabarry / emojionalLinks
Emoji embeddings trained using their emotional content from their online dictionary meanings.
☆16Updated 3 years ago
Alternatives and similar repositories for emojional
Users that are interested in emojional are comparing it to the libraries listed below
Sorting:
- utilities for loading and running text embeddings with onnx☆44Updated 10 months ago
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- LLM plugin for models hosted by Anyscale Endpoints☆33Updated last year
- A clone of OpenAI's Tokenizer page for HuggingFace Models☆45Updated last year
- Using short models to classify long texts☆21Updated 2 years ago
- LLMs sitting on a council together to decide, by consensus, who among them is the best.☆15Updated last week
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- Dutch abusive language data☆11Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆50Updated 9 months ago
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆26Updated 4 months ago
- Adapter / facade for language models (OpenAI, Anthropic, Cohere, local transformers, etc)☆20Updated last year
- Create embeddings for LLM using the Nomic API☆23Updated 7 months ago
- 🦖 X—LLM: Simple & Cutting Edge LLM Finetuning☆11Updated last year
- LLM plugin for embeddings using sentence-transformers☆66Updated 2 months ago
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 3 years ago
- assign color hues to a collection of text fragments based on embeddings☆20Updated last year
- Efficiently computing & storing token n-grams from large corpora☆24Updated 8 months ago
- Jim is a simple, beautiful Jupyter notebook editor for macOS☆35Updated 2 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆15Updated last year
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆13Updated 10 months ago
- Fastlaw's purpose is to replace generic word embeddings for work on supervised machine learning NLP-tasks with legal texts.☆38Updated 6 years ago
- Answer questions against collections stored in LLM using Retrieval Augmented Generation☆27Updated last year
- Demos of ChatGPT's function calling/structured data support.☆24Updated last year
- FAMIE: A Fast Active Learning Framework for Multilingual Information Extraction☆24Updated 3 years ago
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…☆23Updated 7 months ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 10 months ago
- A visual tool to interpret and understand PyTorch machine learning models☆16Updated last year
- Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.☆19Updated last year
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Using modal.com to process FineWeb-edu data☆20Updated 2 months ago