jehumtine / synthetic_data_generator
This script is designed to convert bodies of text into a question and answer JSON format using the GPT-4 language model. The process involves extracting text from PDF files, tokenizing the text, generating questions and answers, and then saving the results in a JSON file.
☆22Updated last year
Alternatives and similar repositories for synthetic_data_generator:
Users that are interested in synthetic_data_generator are comparing it to the libraries listed below
- An open-source Discord bot, created using LlamaIndex, that - Listens to your server conversations, continuously learns from them & answe…☆75Updated last year
- A starter app to build AI powered chat bots with Astra DB and LlamaIndex☆73Updated last year
- ☆45Updated 10 months ago
- A CrewAI script that uses 4 agents and 4 tools to research your next steps for your personal brand☆35Updated last year
- ☆37Updated last year
- RAG example using DSPy, Gradio, FastAPI☆78Updated last year
- A RAG powered web search with Tavily, LangChain, Mistral AI ( leveraging groq LPU) . The full stack web app build in Databutton.☆36Updated last year
- Function Calling Mistral 7B. Learn how to make functions call for open source LLMs.☆48Updated last year
- Powerful Auto Research powered by LangChain, and Anthropic.☆31Updated 9 months ago
- DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search …☆115Updated last year
- This repository demonstrates how a simple memory file can be incorporated into an Agents routine using the AutoGen framework.☆48Updated last year
- RAG Tool using Haystack, Mistral, and Chainlit. All open source stack on CPU.☆23Updated last year
- Question Answer Generation App using Mistral 7B, Langchain, and FastAPI.☆65Updated last year
- ☆57Updated last year
- This is a User Interface built for Autogen using ChainLit.☆115Updated 8 months ago
- ☆38Updated last year
- Tutorial for DSPy☆23Updated 11 months ago
- Python code which creates a semantic search bot over any available corpus☆17Updated last year
- Virtual focus group with custom personas, product details, and final analysis created with AutoGen, Ollama/Llama3, and Streamlit.☆45Updated 9 months ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- This repository holds enhanced Agents, built for the Microsoft AutoGen Framework. Debuting with a MemoryEnabledAgent with improvements in…☆112Updated last year
- Democratizing Function Calling Capabilities for Open-Source Language Models☆40Updated 11 months ago
- Use langchain to create a model that returns answers based on online PDFs that have been read.☆28Updated 9 months ago
- A collection of example AI programs built using DSPy and maitained by the Langtrace AI team.☆26Updated 5 months ago
- ☆37Updated last year
- ☆45Updated last year
- ☆59Updated last year
- An intellligent AI assistant that can do anything!☆53Updated 11 months ago
- Simple example of autonomous research ran in parallel from my Aetherius Ai Assistant project. Uses Openai's GPT-3.5, GPT-4, and Microsof…☆17Updated last year
- A library that allows interacting with Replit's code-exec API☆24Updated 4 months ago