CambioML / uniflow-llm-based-pdf-extraction-text-cleaning-data-clustering
LLM-based text extraction from unstructured data like PDFs, Words and HTMLs. Transform and cluster the text into your desired format. Less information loss, more interpretation, and faster R&D!
☆168Updated 3 months ago
Related projects: ⓘ
- Awesome LLMs on Device: A Comprehensive Survey☆613Updated this week
- An AI agent powered by LLMs that streamlines the entire process of data analysis. 🚀☆319Updated last month
- STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (https://stark.stanford.edu/)☆282Updated last month
- Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language m…☆634Updated this week
- The official implementation of Self-Play Preference Optimization (SPPO)☆461Updated last month
- The Official Repo of ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code (https://a…☆350Updated last week
- AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval (https://arxiv.org/abs/2406.11200)☆140Updated last month
- Pytorch Library for Relational Table Learning with LLMs.☆270Updated last week
- Accurate, private and configurable document retrieval LLM☆118Updated 3 weeks ago
- Code for paper "GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators"☆192Updated last month
- A deployment, monitoring and autoscaling service towards serverless LLM serving.☆152Updated last week
- TxBKG - Knowledge Graph Generation for Any PDFs☆224Updated 9 months ago
- An opensource legal prompts☆413Updated last year
- A curated list of awesome leaderboard-oriented resources for foundation models☆183Updated this week
- A multimodal agent framework for solving complex tasks☆505Updated last week
- Simple python WebUI for fine-tuning ChatGPT (gpt-3.5-turbo)☆206Updated last year
- pykoi: Active learning in one unified interface☆407Updated 7 months ago
- Your Automatic Prompt Engineering Assistant for GenAI Applications☆2,623Updated 4 months ago
- 使用deepspeed从头开始训练一个LLM,经过pretrain和sft阶段,验证llm学习知识、理解语言、回答问题的能力☆145Updated 2 months ago
- A recipe for online RLHF.☆376Updated 3 weeks ago
- Easiest and laziest way for building multi-agent LLMs applications.☆821Updated this week
- The Official Implementation of PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling☆480Updated last month
- Multilingual Corpus of Web Fiction☆211Updated 2 months ago
- Grimoire is All You Need for Enhancing Large Language Models☆115Updated 6 months ago
- The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"☆754Updated 2 weeks ago
- LLM-And-More is a professional, plug-and-play, llm trainer and application builder that guides you through the complete LLM workflow from…☆452Updated 2 months ago
- Code and Checkpoints for "Generate rather than Retrieve: Large Language Models are Strong Context Generators" in ICLR 2023.☆276Updated last year
- Harnessing the Power of AI to Navigate the Information Age – Uncovering Truth, Promoting Transparency, and Championing Fact-Based Discour…☆210Updated last year
- improve Llama-2's proficiency in comprehension, generation, and translation of Chinese.☆532Updated 5 months ago
- ☆356Updated 4 months ago