afriedman412 / sayswho
Quote identification, attribution and resolution.
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for sayswho
- Search through Facebook Research's PyTorch BigGraph Wikidata-dataset with the Weaviate vector search engine☆31Updated 2 years ago
- Extract networks of entities from journalistic reporting☆47Updated last year
- 📜 Dehyphenation of broken text (mainly German), i.e., extracted from a PDF☆38Updated 2 years ago
- Small python package to measure OCR quality and other related metrics.☆21Updated 9 months ago
- Highly concurrent and fast content processing for Mighty Inference Server☆10Updated last year
- assign color hues to a collection of text fragments based on embeddings☆20Updated 5 months ago
- ☆68Updated 8 months ago
- Local emulator for Hugging Face Inference Endpoints customer handlers☆24Updated last year
- ☆20Updated 9 months ago
- Experiments with Hugging Face 🔬 🤗☆45Updated 3 months ago
- A raspberry pi 64bit image with spacy and neuralcoref pre-installed☆21Updated 5 years ago
- A BERT-based application for reusable text classification at scale☆37Updated last year
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆13Updated last year
- Metadata Extractor & Loader (MEL) ■ The NLP-NER Toolkit (TNNT)☆22Updated last year
- ☆11Updated 7 months ago
- A Datasette plugin providing an MLOps platform to train, eval and predict machine learning models☆16Updated this week
- ☆29Updated last year
- Experiments with generating GPT-2 fanfiction on specified topics.☆11Updated 5 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆14Updated 10 months ago
- Run embedding models using ONNX☆23Updated 9 months ago
- Python based Wikidata framework for easy dataframe extraction☆39Updated last year
- Libraries, Archives and Museums (LAM)☆82Updated 2 years ago
- 🌸 Train floret vectors☆18Updated last year
- Using embeddings compressed by Product Quantization, in Javascript☆30Updated last year
- Efficient few-shot learning with cross-encoders.☆40Updated 9 months ago
- Implementation for WikiCheck API, an open-source Wikipedia-based fact-checking API. The project is done in cooperation with Wikimedia Fou…☆22Updated 5 months ago
- Interactive Visualization Interface for Multidimensional Datasets☆52Updated 2 weeks ago
- Domain-specific language for extracting structured data from HTML documents☆52Updated 3 weeks ago
- Add website scraping abilities to Datasette☆61Updated last year
- Dockerfile and web server for running GPT-J-6B on AWS GPU instances☆18Updated 3 years ago