stumpylog / tika-client
A modern Python REST client for Apache Tika server
☆13Updated this week
Alternatives and similar repositories for tika-client:
Users that are interested in tika-client are comparing it to the libraries listed below
- Remote web browser automation.☆20Updated 7 months ago
- A Model Context Protocol server for searching and analyzing arXiv papers☆50Updated last week
- FalkorDB-Browser is a visualization UI for FalkorDB.☆23Updated this week
- Spider ported to Python☆63Updated 3 months ago
- AirLLM 70B inference with single 4GB GPU☆12Updated 5 months ago
- An extremely configurable markdown reverser for Python3.☆15Updated 11 months ago
- A library for working with GBNF files☆20Updated last year
- A text analysis library for relevance and subtheme detection☆15Updated this week
- Voyage AI Official Python Library☆47Updated last month
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆17Updated 3 months ago
- Split code into semantic chunks☆11Updated 3 months ago
- Generate embeddings for images and text using CLIP with LLM☆64Updated 9 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆52Updated 11 months ago
- Paste Word, get Markdown☆14Updated 5 months ago
- ☆20Updated 11 months ago
- scraping and querying documents for LLMs☆17Updated 3 weeks ago
- ☆25Updated 4 months ago
- Tokun to can tokens☆15Updated 2 months ago
- Compatibility layer for pydantic v1/v2☆12Updated 2 weeks ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆43Updated 10 months ago
- The code that runs my blog: https://blog.gpt4.org/☆10Updated 3 years ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆39Updated last week
- Small python package to measure OCR quality and other related metrics.☆21Updated 11 months ago
- Happy Eyeballs for pre-resolved hosts☆16Updated last week
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.☆24Updated last year
- Transform Unstructured Data into Synthetic Datasets☆24Updated 4 months ago
- Caching and distributed locks in your applications with just one or two lines. Easy to learn. Fast to code.☆27Updated this week
- Python package to access USPTO bulk data in rectangular format☆15Updated 2 years ago
- Simple, Fast, Parallel Huggingface GGML model downloader written in python☆24Updated last year
- 🚀 Kew - A Fast, Redis-backed Task Queue Manager for Python☆24Updated last week