Ingest PDFs into Weaviate
☆33Jun 14, 2024Updated last year
Alternatives and similar repositories for how-to-ingest-pdfs-with-unstructured
Users that are interested in how-to-ingest-pdfs-with-unstructured are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Associated code for the Quickstart tutorial☆17Aug 18, 2023Updated 2 years ago
- Uses conversation history to audit important decisions and changes.☆18Jul 13, 2025Updated 9 months ago
- MoodCat😼 classifies the mood of English sentences.☆14Jun 19, 2022Updated 3 years ago
- 🧠 Workshop Notebook and assets for the Anthropic Hackathon☆12Nov 4, 2023Updated 2 years ago
- A sample pattern for running CI tests on Modal☆19Apr 12, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Math evaluations of llama models.☆10Jan 3, 2024Updated 2 years ago
- Real-time OLTP system for credit card fraud detection using AWS API Gateway, Kinesis, and RDS PostgreSQL. Features a scalable, serverless…☆24Dec 16, 2024Updated last year
- MGnify documentation and Jupyter Lab notebooks to support downstream analysis of MGnify data (EMBL-EBI's metagenomics platform)☆15Mar 6, 2026Updated 2 months ago
- A Python library for efficient and flexible cycle-consistency training of transformer models via iteratie back-translation. Memory and co…☆11Jan 13, 2025Updated last year
- ☆19Apr 4, 2023Updated 3 years ago
- Stand-alone implementation of UCD's IIIF image re-formatting tool + plugin to integrate with Mirador IIIF-compliant image viewer☆18Jul 31, 2017Updated 8 years ago
- Hebrew oriented NER spaCy pipeline☆20Aug 8, 2024Updated last year
- ☆13Mar 11, 2023Updated 3 years ago
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Feb 27, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Miscellaneous data analysis tools and scripts for the EHRI project☆16Jan 25, 2024Updated 2 years ago
- Visual Embeddings with OpenAI and Nomic☆13Aug 7, 2023Updated 2 years ago
- Official Weaviate TypeScript Client☆99Apr 30, 2026Updated last week
- The open source implementation of "NeVA: NeMo Vision and Language Assistant"☆17Aug 26, 2023Updated 2 years ago
- A Streamlit wrapper component on react-smooth-dnd☆22Feb 15, 2024Updated 2 years ago
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- Analyze a real-time IPv4 packet stream and export metrics about the data flows☆14Jan 29, 2020Updated 6 years ago
- geemap with streamlit☆19Mar 29, 2022Updated 4 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆25Jul 12, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆15Nov 11, 2025Updated 5 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Feb 11, 2024Updated 2 years ago
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Dec 18, 2023Updated 2 years ago
- Data sets to support the Expert Data Analysis with R workshop☆20Apr 9, 2018Updated 8 years ago
- this is a Hugo continuous delivery site☆17Feb 15, 2021Updated 5 years ago
- Improved ESIM event camera simulator☆17Oct 4, 2024Updated last year
- Work relating to the OCR wish-list item "figure out an algorithm that would separate images into sets with no handwriting, little handwri…☆20Feb 22, 2013Updated 13 years ago
- 3D Gaussian Splatting SAM☆21Mar 19, 2024Updated 2 years ago
- ☆11Sep 22, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A sample Next.js website to showcase how you can make invite-only SPAs with Next.js, AirTable & Vercel☆19Aug 18, 2024Updated last year
- A git-style way of managing LLM chats☆31Jan 26, 2026Updated 3 months ago
- Pytest plugin that runs PyStack on slow or hanging tests.☆20Apr 8, 2026Updated last month
- ☆15Jan 7, 2023Updated 3 years ago
- Official repo of dataset-decomposition paper [NeurIPS 2024]☆21Jan 8, 2025Updated last year
- Multi-GPU Stable diffusion pipeline over webrtc☆25Apr 27, 2024Updated 2 years ago
- GGML bindings that aim to be idiomatic Rust rather than directly corresponding to the C/C++ interface☆20Sep 25, 2023Updated 2 years ago