Main repository for all code and data related to Language Analytics (F24.)
☆27Mar 11, 2025Updated last year
Alternatives and similar repositories for cds-language
Users that are interested in cds-language are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Main repository for all code and data related to Visual Analytics (F25)☆22Mar 21, 2025Updated last year
- Primary repository for the NLP course as part of the CogSci masters program at Aarhus University.☆23Sep 16, 2025Updated 6 months ago
- Converting irregularly spaced time series, such as eletronic health records, into dataframes for tabular classification.☆19Jun 17, 2025Updated 9 months ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆79Sep 20, 2021Updated 4 years ago
- Sentida☆22Dec 14, 2021Updated 4 years ago
- Driver for LIWC2015 analysis. LIWC2015 dictionary not included.☆16Nov 24, 2022Updated 3 years ago
- This repository contains code for the paper "RP-DNN: A Tweet level propagation context based deep neural networks for early rumor detecti…☆34Apr 29, 2025Updated 10 months ago
- The Prism Alignment Project☆90Apr 25, 2024Updated last year
- API Documentation☆128Aug 14, 2024Updated last year
- Comprehensive NLP Evaluation System☆188Aug 8, 2024Updated last year
- Statistical Rethinking Course Winter 2020/2021☆655Mar 3, 2021Updated 5 years ago
- Linguistic Inquiry and Word Count (LIWC) analyzer☆235Dec 20, 2021Updated 4 years ago
- Robustness Gym is an evaluation toolkit for machine learning.☆445Jun 28, 2022Updated 3 years ago
- Vector Hub - Library for easy discovery, and consumption of State-of-the-art models to turn data into vectors. (text2vec, image2vec, vide…☆561Aug 20, 2024Updated last year
- Facepager was made for fetching public available data from YouTube, Twitter and other websites on the basis of APIs and webscraping.☆536Nov 26, 2025Updated 3 months ago
- Interactive data tables for R☆670Feb 24, 2026Updated 3 weeks ago
- Statistical Rethinking course winter 2022☆4,100Mar 15, 2022Updated 4 years ago
- A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coher…☆1,266Jul 24, 2025Updated 7 months ago
- Pushshift API☆1,411Apr 6, 2023Updated 2 years ago
- Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the o…☆2,878Jun 24, 2024Updated last year
- Perform data science on data that remains in someone else's server☆9,865Jul 15, 2025Updated 8 months ago
- MTEB: Massive Text Embedding Benchmark☆3,166Mar 15, 2026Updated last week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients. Published in Nature.☆3,424Jul 25, 2025Updated 7 months ago
- RAG (Retrieval Augmented Generation) Framework for building modular, open source applications for production by TrueFoundry☆4,329Mar 13, 2026Updated last week
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conve…☆4,235Aug 25, 2025Updated 6 months ago
- Notebooks using the Hugging Face libraries 🤗☆4,487Mar 12, 2026Updated last week
- Data augmentation for NLP☆4,652Jun 24, 2024Updated last year
- Collection of useful data science topics along with articles, videos, and code☆4,181Dec 2, 2025Updated 3 months ago
- The "Python Machine Learning (3rd edition)" book code repository☆5,000Apr 19, 2023Updated 2 years ago
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆7,327Feb 27, 2026Updated 3 weeks ago
- This repository contains demos I made with the Transformers library by HuggingFace.☆11,531Mar 9, 2026Updated 2 weeks ago
- High accuracy RAG for answering questions from scientific documents with citations☆8,298Updated this week
- A collection of projects designed to help developers quickly get started with building deployable applications using the Claude API☆15,441Feb 5, 2026Updated last month
- Open source annotation tool for machine learning practitioners.☆10,583Updated this week
- A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.☆9,744Mar 3, 2026Updated 2 weeks ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,774Mar 11, 2026Updated last week
- High-speed Large Language Model Serving for Local Deployment☆8,834Jan 24, 2026Updated last month
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,379Oct 28, 2024Updated last year
- Official inference library for Mistral models☆10,730Feb 26, 2026Updated 3 weeks ago