vigilant-umbrella / wikiHowUnofficialAPILinks
API to extract data from wikiHow
☆17Updated 4 years ago
Alternatives and similar repositories for wikiHowUnofficialAPI
Users that are interested in wikiHowUnofficialAPI are comparing it to the libraries listed below
Sorting:
- An official codebase for paper " CHAMPAGNE: Learning Real-world Conversation from Large-Scale Web Videos (ICCV 23)"☆52Updated 2 years ago
- [2024-ACL]: TextBind: Multi-turn Interleaved Multimodal Instruction-following in the Wildrounded Conversation☆47Updated 2 years ago
- ☆50Updated 8 months ago
- ☆50Updated 2 years ago
- Vision Large Language Models trained on M3IT instruction tuning dataset☆17Updated 2 years ago
- ☆17Updated 2 years ago
- Official code for infimm-hd☆16Updated last year
- ☆74Updated last year
- Self-hosted GPT-4V api☆27Updated 2 years ago
- ☆18Updated last year
- This repo contains code and data for ICLR 2025 paper MIA-Bench: Towards Better Instruction Following Evaluation of Multimodal LLMs☆36Updated 11 months ago
- A curated list of the papers, repositories, tutorials, and anythings related to the large language models for tools☆68Updated 2 years ago
- [EMNLP 2023]Context Compression for Auto-regressive Transformers with Sentinel Tokens☆25Updated 2 years ago
- SummScreen: A Dataset for Abstractive Screenplay Summarization (ACL 2022)☆41Updated 3 years ago
- [TMLR 2024] Official implementation of "Sight Beyond Text: Multi-Modal Training Enhances LLMs in Truthfulness and Ethics"☆20Updated 2 years ago
- A curated list of resources about long-context in large-language models and video understanding.☆31Updated 2 years ago
- Multimodal-Procedural-Planning☆93Updated 2 years ago
- [ACL 2023] Code for paper “Tailoring Instructions to Student’s Learning Levels Boosts Knowledge Distillation”(https://arxiv.org/abs/2305.…☆38Updated 2 years ago
- Touchstone: Evaluating Vision-Language Models by Language Models☆83Updated 2 years ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated last year
- Official implementation of our paper "Finetuned Multimodal Language Models are High-Quality Image-Text Data Filters".☆69Updated 9 months ago
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆44Updated last year
- ☆66Updated 2 years ago
- Reproduction of LLaVA-v1.5 based on Llama-3-8b LLM backbone.☆65Updated last year
- ☆65Updated 2 years ago
- PyTorch implementation of StableMask (ICML'24)☆15Updated last year
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆48Updated 11 months ago
- Source code for paper: "AltDiffusion: A multilingual Text-to-Image diffusion model"☆44Updated last year
- Video dataset dedicated to portrait-mode video recognition.☆55Updated 3 months ago
- Official implementation for the paper "Transferring Visual Knowledge with Pre-trained Models for Multimodal Machine Translation", publish…☆20Updated last year