papercast-dev / papercastLinks
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
☆52Updated 7 months ago
Alternatives and similar repositories for papercast
Users that are interested in papercast are comparing it to the libraries listed below
Sorting:
- Solve Geometric & Graph Problems with Large Language Models☆33Updated 2 years ago
- Complex data extraction and orchestration framework designed for processing unstructured documents. It integrates AI-powered document pip…☆72Updated last week
- Tutorial and template for a semantic search app powered by the Atlas Embedding Database, Langchain, OpenAI and FastAPI☆113Updated 2 years ago
- Built with Fast Dash, this app uses Embedchain, which abstracts the entire process of loading and chunking datasets, creating embeddings,…☆65Updated last year
- Repository for deepdoctection tutorial notebooks☆45Updated 4 months ago
- Use AI to personify books, so that you can talk to them 🙊☆18Updated 2 years ago
- examples and guides to using Nomic Atlas☆38Updated 6 months ago
- Integrated LLM-based document and data Q&A with knowledge graph visualization☆22Updated last year
- 🖍️ Highlight text in documents☆109Updated 6 months ago
- MindMapper is an innovative program that empowers intelligent agents to navigate complex thought landscapes and collaboratively map their…☆29Updated last year
- Open-source, knowledge-grounded conversational assistant☆14Updated 3 months ago
- A simple tool that serves as a knowledge graph explorer utilizing the GPT 3.5 turbo model to help users explore information in an organiz…☆59Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆63Updated last year
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform the…☆23Updated 11 months ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and images☆41Updated last year
- AI assistant, based on the GPT-3.5 model by OpenAI, designed to enhance your proficiency in writing research papers. Allows you to adapt …☆31Updated 11 months ago
- Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.☆29Updated 2 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.☆26Updated 2 years ago
- Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable p…☆85Updated last year
- Analyzing and scoring reasoning traces of LLMs☆46Updated last year
- ☆14Updated last year
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆47Updated last year
- This repository serves as a collection of scrapers procuring and structuring various legal datasets☆17Updated 2 years ago
- ☆101Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpus☆13Updated 4 years ago
- A tutorial for building autonomous agents: with LangChain and from scratch☆31Updated 2 years ago
- Building a Chain of Thought RAG Model with DSPy, Qdrant and Ollama☆32Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆49Updated last year
- Record and replay LLM interactions for langchain☆82Updated last year
- Lightweight and Flexible Library for Creating Agents and Multi-Agent Conversations 🤖☆25Updated 3 weeks ago