Documenting large text datasets πΌοΈ π
β14Dec 17, 2024Updated last year
Alternatives and similar repositories for data-portraits
Users that are interested in data-portraits are comparing it to the libraries listed below
Sorting:
- β13Oct 20, 2022Updated 3 years ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Modelsβ17Jul 17, 2024Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from eβ¦β28May 23, 2024Updated last year
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in β¦β30Nov 25, 2021Updated 4 years ago
- Planβ is a platform for creating and publishing digital planning servicesβ17Updated this week
- State-of-the-art paired encoder and decoder models (17M-1B params)β59Aug 6, 2025Updated 7 months ago
- This repository contains the Parasol processor, which enables next-generation privacy preserving applications. Users can run arbitrary coβ¦β11Feb 25, 2026Updated last week
- An unofficial Python 3 version of jemdoc.β11Feb 8, 2026Updated 3 weeks ago
- An Educational Framework Based on PyTorch for Deep Learning Education and Explorationβ10Dec 24, 2023Updated 2 years ago
- https://icml.cc/virtual/2023/poster/24354β10Aug 15, 2023Updated 2 years ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generatiβ¦β10Sep 23, 2023Updated 2 years ago
- [EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agentsβ16Sep 16, 2025Updated 5 months ago
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.β25Oct 20, 2025Updated 4 months ago
- A Model Agnostic function to directly remove specified layers from the LLMβ10May 23, 2024Updated last year
- Python platform for parallel Surrogate-Based Optimizationβ12Nov 27, 2024Updated last year
- β10Oct 2, 2024Updated last year
- Reading comprehension based question-answering model for news articles.β11Jun 22, 2022Updated 3 years ago
- Guide to interviewing for industry machine learning roles (data/applied/research scientist, ML engineer, etc).β11Dec 28, 2022Updated 3 years ago
- Machine Learning for Mathematical Formalizationβ11Jul 20, 2024Updated last year
- β15Sep 7, 2025Updated 6 months ago
- Direct transcription of an optimal control problem and resolutionβ12Updated this week
- β24Feb 18, 2026Updated 2 weeks ago
- A RAG that can scale π§π»βπ»β11May 28, 2024Updated last year
- Python package for measuring memorization in LLMs.β183Jul 16, 2025Updated 7 months ago
- Whispers in the Machine: Confidentiality in Agentic Systemsβ41Dec 11, 2025Updated 2 months ago
- β14Jun 24, 2024Updated last year
- Find bottlenecks in your test suitesβ17Updated this week
- Implementation of Reinforce for educational purposes.β12Jun 12, 2023Updated 2 years ago
- Driver for coupled AMR-Wind/Nalu-Wind simulationsβ13Nov 10, 2025Updated 3 months ago
- A conda-smithy repository for ollama.β10Updated this week
- See https://github.com/cuda-mode/triton-index/ instead!β11May 8, 2024Updated last year
- β15Jun 30, 2025Updated 8 months ago
- Given a Substack newsletter, save the contents into an sqlite db and format it as an epubβ13Jan 11, 2024Updated 2 years ago
- β16Jan 29, 2026Updated last month
- β14Dec 12, 2023Updated 2 years ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.β12Nov 4, 2025Updated 4 months ago
- PlantDetector provides easy development (training and prediction) for object detection. DETR (End-to-End Object Detection with Transformeβ¦β11Aug 1, 2022Updated 3 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Modelsβ10Oct 27, 2023Updated 2 years ago
- A list where most values will be None (or default)β11Jul 19, 2023Updated 2 years ago