This script is designed to convert bodies of text into a question and answer JSON format using the GPT-4 language model. The process involves extracting text from PDF files, tokenizing the text, generating questions and answers, and then saving the results in a JSON file.
☆24Aug 22, 2023Updated 2 years ago
Alternatives and similar repositories for synthetic_data_generator
Users that are interested in synthetic_data_generator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LAWLIA is an open-source computational legal framework designed to revolutionize legal reasoning and analysis. It combines the power of l…☆22Dec 6, 2023Updated 2 years ago
- 🦄 Use GPT to generate and label data☆25Apr 30, 2024Updated 2 years ago
- GPT4MAX is a free AI chatbot app built with Next.js, the Vercel AI SDK, and OpenAI GPT-4 Turbo.☆18May 10, 2024Updated last year
- A simple GPT-3 interface to automate core legal writing tasks☆13Mar 8, 2023Updated 3 years ago
- Private-AI is an innovative AI project designed for asking questions about your documents using powerful Large Language Models (LLMs). Th…☆24Feb 26, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Your own GPT-powered Personal Assistant to whom you can ORDER or INSTRUCT to do some task or search for something using your VOICE comman…☆20Jul 23, 2023Updated 2 years ago
- 基于langchain和chatglm6b构建的智能问答系统,支持自定义语料☆10Jun 25, 2023Updated 2 years ago
- Developing a legal research tool leveraging ChatGPT / GPT-4☆14Mar 10, 2024Updated 2 years ago
- Vector search with Pinecone and Openai to search through contract law textbook. If downloaded, remeber to install all dependencies. Refer…☆11Mar 30, 2023Updated 3 years ago
- a QA bot on contents of given docs 用所给文档进行问答的聊天机器人☆12Apr 20, 2023Updated 3 years ago
- Simple example of autonomous research ran in parallel from my Aetherius Ai Assistant project. Uses Openai's GPT-3.5, GPT-4, and Microsof…☆15May 11, 2023Updated 2 years ago
- Probe how GPT-n performs on statutory reasoning☆10Sep 17, 2024Updated last year
- ☆16Jun 18, 2024Updated last year
- A question answering AI tool for the content from the PDF files of the Civil Code, Criminal Code, Code of Criminal Procedure, Labor Stand…☆11May 14, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AI Pull-Request Reviewer Companion (in the command line)☆13Apr 11, 2024Updated 2 years ago
- Build a Full stack Q&A Chatbot with Langchain, and LLM Models on Amazon Sagemaker☆12Nov 10, 2023Updated 2 years ago
- An automated E2E natural language test runner built on Claude Code☆23Aug 19, 2025Updated 8 months ago
- Synthetic QA generation for long documents.☆16Jul 22, 2022Updated 3 years ago
- pytorch+bert实现的意图识别与槽位填充☆11May 30, 2023Updated 2 years ago
- 基于pytorch_rnn的古诗词生成☆11Oct 24, 2021Updated 4 years ago
- A zero-configuration (no registry.json required), shadcn add / open in v0 compatible registry builder. With amazing visual feedback like …☆27Updated this week
- Nano Bots for Obsidian: small, AI-powered bots that can be easily shared as a single file, designed to support multiple providers such as…☆15Jan 13, 2024Updated 2 years ago
- ☆64Jan 28, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Chatbot_CN项目的知识图谱模块☆12Mar 27, 2020Updated 6 years ago
- An RAG (retrieval augmented generation) app which iterates through a PDF document and can answer user's questions based on the document u…☆16Mar 23, 2025Updated last year
- ☆17Jul 16, 2024Updated last year
- ☆15Aug 3, 2024Updated last year
- Experiments codes for COLING '22 paper "Augmenting Legal Judgment Prediction with Contrastive Case Relations"☆11Apr 25, 2024Updated 2 years ago
- Graph QABot Demo| 图谱问答案例☆15Apr 11, 2023Updated 3 years ago
- Changes in this fork has been merged to upstream.☆16Jun 10, 2025Updated 10 months ago
- ☆17Jan 15, 2026Updated 3 months ago
- WebRTC-HTTP Ingestion Protocol (WHIP) in Rust☆14Dec 17, 2025Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 本项目主要研究大模型在单独的法律数据集上的效果,现在支持belle和chatglm相关的模型训练,预测,验证和在线部署, 另外增加爬虫代码,langchain,结合数据库预测等功能。☆12Jul 16, 2023Updated 2 years ago
- AI医生☆11May 27, 2020Updated 5 years ago
- 爬取去哪网热门景点信息,抽取三元组信息,构建中文知识图谱☆13Apr 27, 2021Updated 5 years ago
- ☆29Sep 10, 2025Updated 7 months ago
- KL3M training data collection and preprocessing☆21Apr 14, 2025Updated last year
- ☆49Jun 13, 2024Updated last year
- Waste Segregation @HackBash2021 : ML based deployed waste segregation web app☆12Apr 8, 2021Updated 5 years ago