jehumtine / synthetic_data_generator

This script is designed to convert bodies of text into a question and answer JSON format using the GPT-4 language model. The process involves extracting text from PDF files, tokenizing the text, generating questions and answers, and then saving the results in a JSON file.
20Updated last year

Alternatives and similar repositories for synthetic_data_generator:

Users that are interested in synthetic_data_generator are comparing it to the libraries listed below