This script is designed to convert bodies of text into a question and answer JSON format using the GPT-4 language model. The process involves extracting text from PDF files, tokenizing the text, generating questions and answers, and then saving the results in a JSON file.
☆24Aug 22, 2023Updated 2 years ago
Alternatives and similar repositories for synthetic_data_generator
Users that are interested in synthetic_data_generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LAWLIA is an open-source computational legal framework designed to revolutionize legal reasoning and analysis. It combines the power of l…☆23Dec 6, 2023Updated 2 years ago
- 🦄 Use GPT to generate and label data☆25Apr 30, 2024Updated 2 years ago
- GPT4MAX is a free AI chatbot app built with Next.js, the Vercel AI SDK, and OpenAI GPT-4 Turbo.☆18May 10, 2024Updated 2 years ago
- The code used to evaluate embedding models on the Massive Legal Embedding Benchmark (MLEB).☆38Feb 24, 2026Updated 3 months ago
- ☆37Feb 5, 2026Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于langchain和chatglm6b构建的智能问答系统,支持自定义语料☆10Jun 25, 2023Updated 2 years ago
- Developing a legal research tool leveraging ChatGPT / GPT-4☆14Mar 10, 2024Updated 2 years ago
- Tool to apply Legal Matter Specification Standard (LMSS) to documents☆12Aug 15, 2024Updated last year
- Vector search with Pinecone and Openai to search through contract law textbook. If downloaded, remeber to install all dependencies. Refer…☆11Mar 30, 2023Updated 3 years ago
- Simple example of autonomous research ran in parallel from my Aetherius Ai Assistant project. Uses Openai's GPT-3.5, GPT-4, and Microsof…☆15May 11, 2023Updated 3 years ago
- Probe how GPT-n performs on statutory reasoning☆10Sep 17, 2024Updated last year
- A question answering AI tool for the content from the PDF files of the Civil Code, Criminal Code, Code of Criminal Procedure, Labor Stand…☆11May 14, 2023Updated 3 years ago
- AI Pull-Request Reviewer Companion (in the command line)☆13Apr 11, 2024Updated 2 years ago
- Synthetic QA generation for long documents.☆16Jul 22, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆18Oct 16, 2024Updated last year
- 基于pytorch_rnn的古诗词生成☆11Oct 24, 2021Updated 4 years ago
- Chatbot_CN项目的知识图谱模块☆12Mar 27, 2020Updated 6 years ago
- An RAG (retrieval augmented generation) app which iterates through a PDF document and can answer user's questions based on the document u…☆16Mar 23, 2025Updated last year
- ☆17Jul 16, 2024Updated last year
- ☆10Aug 28, 2018Updated 7 years ago
- ☆15Aug 3, 2024Updated last year
- Experiments codes for COLING '22 paper "Augmenting Legal Judgment Prediction with Contrastive Case Relations"☆11Apr 25, 2024Updated 2 years ago
- Graph QABot Demo| 图谱问答案例☆15Apr 11, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Changes in this fork has been merged to upstream.☆16Jun 10, 2025Updated 11 months ago
- ☆19Jan 15, 2026Updated 4 months ago
- AI医生☆10May 27, 2020Updated 5 years ago
- 爬取去哪网热门景点信息,抽取三元组信息,构建中文知识图谱☆13Apr 27, 2021Updated 5 years ago
- ☆29Sep 10, 2025Updated 8 months ago
- ☆49Jun 13, 2024Updated last year
- A OpenAI GPT3 based QnA agent for documents and links☆12Jul 11, 2023Updated 2 years ago
- A simple NextJS app that streams Langserve (python) streamings on NextJS frontend, using a hook to make it clean on components, and api c…☆10Mar 12, 2024Updated 2 years ago
- Waste Segregation @HackBash2021 : ML based deployed waste segregation web app☆12Apr 8, 2021Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple online editor of math formulas based on LaTeX syntax. Contains table of popular equations and chars for easy work with it to help …☆10Sep 13, 2019Updated 6 years ago
- ES5 - Javascript design pattern examples☆10Mar 28, 2017Updated 9 years ago
- Get up and running with Llama 2 and other large language models locally☆15Updated this week
- ☆17Mar 21, 2024Updated 2 years ago
- ☆13Oct 10, 2024Updated last year
- JIRA Automation Using GPT☆25May 15, 2023Updated 3 years ago
- ☆19Aug 1, 2024Updated last year