alfredodeza / synthetic-datasets
Augment datasets using Large Language Models
β16Updated 8 months ago
Related projects β
Alternatives and complementary repositories for synthetic-datasets
- ππ§ A minimalistic tool to fine-tune your LLMsβ17Updated last year
- Projects completed under LinuxWorld Informatics Ltd. - MLOps Training.β12Updated 4 years ago
- This repository implements DSPy programs to tasks in Indian Languagesβ11Updated 10 months ago
- Convert any image into a Region Adjacency Graph (RAG)β12Updated 4 years ago
- Speech to Speech conversation using the OpenAI RealTime API in Python πβ19Updated this week
- Taipy Demo of a Realtime Dashboard of Air Pollution around a Factoryβ15Updated 2 months ago
- Example Code to Supplement the Label Studio Blogβ16Updated last week
- Visual Embeddings with OpenAI and Nomicβ12Updated last year
- A chatbot made using the Chatterbot library in Python and locally hosted using Streamlit. Dataset used were collected during ConvAI2 compβ¦β15Updated 3 years ago
- MirrorDataGenerator is a python tool that generates synthetic data based on user-specified causal relations among features in the data. Iβ¦β19Updated 2 years ago
- Using ChatGPT to build a Kedro ML pipeline and Streamlit frontendβ30Updated last year
- A few end to end examples that use data-describeβ16Updated last year
- β18Updated 2 months ago
- Unstract's interface to LLMs, Embeddings and VectorDBs.β16Updated 3 months ago
- Finetune Llama 2 on Colab for free on your own data: step-by-step tutorialβ20Updated 6 months ago
- Causal Agent based on Large Language Modelβ30Updated 3 months ago
- Sample demonstrating deployment of Pytorch models through ONNX within Azure Functionsβ12Updated 7 months ago
- LLM plugin for models hosted by Anyscale Endpointsβ32Updated 7 months ago
- Entity resolution, also known as Data Matching or Record linkage is the task of finding a data set that refer to the same or similar realβ¦β22Updated last month
- This repository contains different algorithms that are used to build taxonomy from text corpus.β8Updated 3 years ago
- A library to create and load tfrecord files as tf.data.Datasetβ9Updated 6 months ago
- This repository is part of a blog post that guides users through creating a visual search application using Amazon SageMaker and Amazon Eβ¦β10Updated last year
- β41Updated last month
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scrollβ¦β26Updated 6 months ago
- The goal is to pilot Microsoft Cognitive Services to unlock the strategic value of UN unstructured content by building on AI and semanticβ¦β12Updated last year
- β12Updated last year
- Azure Machine Learning - MLOps Python SDKv2β10Updated last year
- Data extraction from documents with ML (research and experimental code repo)β16Updated last year
- The official repo of our research work "Interactive Editing for Text Summarization".β22Updated last year
- Operations Research Algorithmsβ17Updated 8 months ago