CambioML / uniflow-llm-based-pdf-extraction-text-cleaning-data-clustering
LLM-based text extraction from unstructured data like PDFs, Words and HTMLs. Transform and cluster the text into your desired format. Less information loss, more interpretation, and faster R&D!
☆187Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for uniflow-llm-based-pdf-extraction-text-cleaning-data-clustering
- Accurate, private and configurable document retrieval LLM☆130Updated this week
- AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning (NeurIPS 2024)☆170Updated this week
- STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)☆306Updated last month
- Pytorch Library for Relational Table Learning with LLMs.☆286Updated this week
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…☆355Updated this week
- An AI agent powered by LLMs that streamlines the entire process of data analysis. 🚀☆358Updated 3 months ago
- pykoi: Active learning in one unified interface☆410Updated 9 months ago
- The official implementation of Self-Play Preference Optimization (SPPO)☆499Updated this week
- The repository for the paper titled "Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks"☆184Updated 3 weeks ago
- Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"☆225Updated 4 months ago
- TxBKG - Knowledge Graph Generation for Any PDFs☆223Updated this week
- "AnyGraph: Graph Foundation Model in the Wild"☆187Updated 2 months ago
- A deployment, monitoring and autoscaling service towards serverless LLM serving.☆162Updated this week
- Your Automatic Prompt Engineering Assistant for GenAI Applications☆2,658Updated 7 months ago
- The official implementation of our pre-print paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs".☆153Updated last week
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆155Updated 4 months ago
- DrugAssist: A Large Language Model for Molecule Optimization☆129Updated 3 weeks ago
- A recipe for online RLHF and online iterative DPO.☆436Updated 2 weeks ago
- The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models"☆93Updated this week
- A curated list of awesome leaderboard-oriented resources for foundation models☆194Updated this week
- Awesome LLMs on Device: A Comprehensive Survey☆935Updated last month
- Grimoire is All You Need for Enhancing Large Language Models☆117Updated 8 months ago
- Engy is an AI-powered development tool that generates fully functional web applications from natural language, streamlining the process f…☆257Updated 2 weeks ago
- Multilingual Corpus of Web Fiction☆216Updated 4 months ago
- Code and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.☆278Updated last year
- An opensource legal prompts☆420Updated last year
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆208Updated last year
- Chatbot Portal with Agent: Streamlined Workflow for Building Agent-Based Applications☆265Updated this week
- Unified KV Cache Compression Methods for Auto-Regressive Models☆805Updated this week
- Simple python WebUI for fine-tuning ChatGPT (gpt-3.5-turbo)☆206Updated last year