MoritzLaurer / synthetic-data-blogView external linksLinks
This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data
☆68Feb 18, 2024Updated last year
Alternatives and similar repositories for synthetic-data-blog
Users that are interested in synthetic-data-blog are comparing it to the libraries listed below
Sorting:
- Knowledge Graph Generator app☆34Apr 18, 2024Updated last year
- My Gen AI research☆11Jun 3, 2024Updated last year
- A multilingual DeBERTa model fine-tuned on political communication to classify discrete emotions☆14Nov 10, 2023Updated 2 years ago
- Format and Complete Few-Shot LLM Prompts☆19Jan 14, 2025Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Feb 5, 2025Updated last year
- Chrome Extension for exploring Hugging Face datasets 🔎☆48Sep 18, 2024Updated last year
- Serving hugging face guidance behind a server☆13Jun 14, 2023Updated 2 years ago
- 🏠🔍 Auto check for new apartments in Hamburg from various real estate provides☆16Jun 2, 2024Updated last year
- A scikit-learn compliant implementation of Monroe et al.'s Fightin' Words analysis method.☆11Mar 10, 2019Updated 6 years ago
- Workshop and collection of howtos for dealing with git / github☆16Feb 17, 2021Updated 5 years ago
- Using modal.com to process FineWeb-edu data☆20Apr 5, 2025Updated 10 months ago
- Use Hermes-2-Pro-Mistral-7B function calling with your OpenAI API compatible code.☆18May 7, 2024Updated last year
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Online materials for Social Media Data Analysis at the University of Konstanz☆10Oct 13, 2025Updated 4 months ago
- Literature 📄 and datasets 📚 on automatic populism detection☆19Mar 15, 2025Updated 11 months ago
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Jul 9, 2019Updated 6 years ago
- ☆20Jan 27, 2024Updated 2 years ago
- This is the official implementation of RGNet: A Unified Retrieval and Grounding Network for Long Videos☆19Mar 3, 2025Updated 11 months ago
- A library for working with prompt templates locally or on the Hugging Face Hub.☆55Mar 5, 2025Updated 11 months ago
- Textual statistics for quanteda☆18Jul 9, 2025Updated 7 months ago
- The course introduces the use of open-source large language models (LLMs) from the Hugging Face ecosystem for research in the behavioral …☆52Jun 12, 2024Updated last year
- awesome synthetic (text) datasets☆323Jan 8, 2026Updated last month
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Feb 9, 2024Updated 2 years ago
- ☆30Mar 10, 2024Updated last year
- Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024☆2,806Updated this week
- ☆68May 26, 2024Updated last year
- ☆11Sep 16, 2024Updated last year
- ☆35May 30, 2022Updated 3 years ago
- MIT iQuHACK 2022 x Microsoft x IonQ Challenge☆10Jan 30, 2022Updated 4 years ago
- Repository related to Cranfield's AAI MSCs GDP☆11Apr 8, 2023Updated 2 years ago
- benchmarks for LLM tokenizers☆16Jan 15, 2026Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196May 6, 2024Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆136Mar 14, 2024Updated last year
- Framework for Self-Organizing Python Agents☆29Feb 4, 2024Updated 2 years ago
- ☆80Jun 5, 2024Updated last year
- ☆13Feb 8, 2019Updated 7 years ago
- Introduction to statistics using R and Rstudio☆10Mar 18, 2021Updated 4 years ago
- [ACM Multimedia 2025] "Multi-Agent System for Comprehensive Soccer Understanding"☆66Oct 31, 2025Updated 3 months ago
- Teaching materials for BDACA and related courses, after overhaul☆13Dec 9, 2025Updated 2 months ago