allenai / figura11y
AI Assistance for Writing Scientific Alt Text
☆10Updated last year
Alternatives and similar repositories for figura11y:
Users that are interested in figura11y are comparing it to the libraries listed below
- Aligned, Review-Informed Edits of Scientific Papers☆50Updated last year
- ☆10Updated 4 years ago
- A python library for the Semantic Scholar (S2) API with typed pydantic objects and various nifty functionalities.☆21Updated 3 years ago
- ☆10Updated 3 months ago
- ☆34Updated last year
- Dataset of scientific abstracts for the purpose of sentence classification☆10Updated 5 years ago
- SciRepEval benchmark training and evaluation scripts☆73Updated 10 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆47Updated last year
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆16Updated last year
- GHOSTS dataset☆38Updated last year
- Open Access PDF harvester, metadata aggregator and full-text ingester☆61Updated 10 months ago
- Forecasting high-impact research topics via machine learning on evolving knowledge graphs☆26Updated last month
- ☆20Updated 3 weeks ago
- human_detectors hosts the data released from the paper "People who frequently use ChatGPT for writing tasks are accurate and robust detec…☆23Updated last month
- CORWA: A Citation-Oriented Related Work Annotation Dataset, NAACL 2022☆16Updated 4 months ago
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆85Updated 7 months ago
- ☆21Updated last month
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆39Updated 4 months ago
- Fine tuning Mistral-7b with PEFT(Parameter Efficient Fine-Tuning) and LoRA(Low-Rank Adaptation) on Puffin Dataset(multi-turn conversation…☆12Updated last year
- ☆19Updated 4 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- Official dataset repository for "SciReviewGen: A Large-scale Dataset for Automatic Literature Review Generation."☆17Updated last year
- Dataset and evaluation suite enabling LLM instruction-following for scientific literature understanding.☆37Updated last week
- ☆85Updated 10 months ago
- ☆91Updated 10 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆18Updated last month
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆32Updated this week
- Training hybrid models for dummies.☆20Updated 2 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year
- ☆31Updated last year