DigitalHarborFoundation / FlexEval
โ12Updated this week
Alternatives and similar repositories for FlexEval:
Users that are interested in FlexEval are comparing it to the libraries listed below
- โ33Updated 2 years ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Dataโ92Updated this week
- NAACL 2024. Code & Dataset for "๐ Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakeโฆโ36Updated 9 months ago
- Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"โ24Updated 2 years ago
- Bayesian IRT models in Pythonโ138Updated last week
- A repository with several curated datasets of counter-narratives to fight online hate speech.โ88Updated last year
- โ53Updated last year
- โ42Updated last year
- ๐งฎ MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023โ51Updated last month
- This is the data associated with the PERSUADE Corpus 2.0 versionโ41Updated 5 months ago
- Dataset used to evaluate Skill Extraction systems based on the ESCO skills taxonomy.โ13Updated 9 months ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"โ83Updated 8 months ago
- An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutorsโ9Updated 2 weeks ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answersโ128Updated last year
- Clustering sentence embeddings to extract message intentโ173Updated 3 years ago
- โ68Updated last year
- โ21Updated 3 years ago
- โ34Updated 6 months ago
- Robust and fast topic models with sentence-transformers.โ48Updated this week
- A python package for benchmarking interpretability techniques on Transformers.โ212Updated 6 months ago
- ๐บ๏ธ Data Cleaning and Textual Data Visualization ๐บ๏ธโ168Updated 10 months ago
- โ158Updated 10 months ago
- Sample notebooks and prompts for LLM evaluationโ124Updated this week
- โ42Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperโฆโ101Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasksโ124Updated 3 months ago
- Inquisitive Parrots for Searchโ190Updated last year
- Repository for research in the field of Responsible NLP at Meta.โ199Updated 4 months ago
- Efficient Attention for Long Sequence Processingโ93Updated last year
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: โฆโ333Updated last year