papercast-dev / papercastLinks
A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GROBID, LangChain, listen as podcast. Customize your own pipelines.
β50Updated 3 months ago
Alternatives and similar repositories for papercast
Users that are interested in papercast are comparing it to the libraries listed below
Sorting:
- Solve Geometric & Graph Problems with Large Language Modelsβ29Updated 2 years ago
- Explore the use of DSPy for extracting features from PDFs πβ40Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.β35Updated last year
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.β23Updated last year
- Use AI to personify books, so that you can talk to them πβ18Updated 2 years ago
- β13Updated 10 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.β47Updated 9 months ago
- β Pytest-style test runner for langchain projectsβ25Updated 2 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documentsβ13Updated 10 months ago
- Median is an open-source flashcard application that leverages the power of spaced repetition and artificial intelligence to transform theβ¦β22Updated 7 months ago
- Pandas-LLMβ46Updated last year
- examples and guides to using Nomic Atlasβ37Updated 2 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA onβ¦β45Updated last year
- This repository serves as a collection of scrapers procuring and structuring various legal datasetsβ17Updated 2 years ago
- A tutorial on DSPy and whether automated prompt engineering lives up to the hypeβ23Updated last year
- Trained BERT and Word2Vec legal clause classifiers for SPACY using the Atticus Project's Open Source Contract Label Corpusβ14Updated 4 years ago
- Docutron Toolkit: detection and segmentation analysis for legal data extraction over documents.β26Updated last year
- Chrome Extension for exploring Hugging Face datasets πβ50Updated 9 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the projectβ41Updated 6 months ago
- YouTube Transcript Cleaner is a simple web-based application that improves the readability of YouTube transcripts.β26Updated 3 months ago
- β1Updated 11 months ago
- An advanced distributed knowledge fabric for intelligent document processing, featuring multi-document agents, optimized query handling, β¦β35Updated 10 months ago
- A personal knowledge base that I can dump information to and help me learnβ24Updated 3 weeks ago
- Structured outputs from DSPy and Jinja2β23Updated last month
- Building a Chain of Thought RAG Model with DSPy, Qdrant and Ollamaβ32Updated last year
- Applying Evaluation Driven Development (EDD) to aid in the design decision of RAG pipelinesβ31Updated last year
- Enhance your knowledge in medical research with the help of LLM and RAG.β35Updated 8 months ago
- Web application that allows you to interact with biomedical knowledge graphs and query biomedical questions.β32Updated last year
- Python client for PromptWatch.io - LLM tracking platformβ28Updated last year
- Network Analysis through LLMs for Knowledge Extractionβ30Updated last year