vigilant-umbrella / wikiHowUnofficialAPILinks
API to extract data from wikiHow
☆17Updated 3 years ago
Alternatives and similar repositories for wikiHowUnofficialAPI
Users that are interested in wikiHowUnofficialAPI are comparing it to the libraries listed below
Sorting:
- Self-hosted GPT-4V api☆29Updated last year
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated last year
- ☆50Updated 3 weeks ago
- A curated list of resources about long-context in large-language models and video understanding.☆31Updated last year
- ☆50Updated last year
- ☆17Updated last year
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Updated last year
- ☆71Updated 6 months ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆28Updated 11 months ago
- ☆17Updated last year
- ☆69Updated 3 weeks ago
- IRFL: Image Recognition of Figurative Language☆11Updated last year
- Empirical Study Towards Building An Effective Multi-Modal Large Language Model☆22Updated last year
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated last year
- Source code for EMNLP2022 long paper: Parameter-Efficient Tuning Makes a Good Classification Head☆14Updated 2 years ago
- Official Github Repo for the Findings of EMNLP 2021 paper "An animated picture says at least a thousand words: Selecting Gif-based Replie…☆32Updated 3 years ago
- Multimodal-Procedural-Planning☆92Updated 2 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated this week
- The Conceptual Coverage Across Languages Benchmark for Text-to-Image Models☆12Updated 8 months ago
- Code repository for CVPR 2022 paper "VALHALLA: Visual Hallucination for Machine Translation"☆28Updated 2 years ago
- List of papers on Self-Correction of LLMs.☆73Updated 6 months ago
- Implementation of LREC-COLING 2024 paper A Frustratingly Simple Decoding Method for Neural Text Generation☆19Updated last year
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆19Updated last year
- ☆29Updated 10 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated last year
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆44Updated last year
- MaXM is a suite of test-only benchmarks for multilingual visual question answering in 7 languages: English (en), French (fr), Hindi (hi),…☆13Updated last year
- Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models☆44Updated last year
- [ACL 2021] mTVR: Multilingual Video Moment Retrieval☆27Updated 2 years ago