epfl-dlab/homepage2vec

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/epfl-dlab/homepage2vec)

epfl-dlab / homepage2vec

Language-Agnostic Website Embedding and Classification

☆48

Alternatives and similar repositories for homepage2vec

Users that are interested in homepage2vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

X-LANCE / weblm
View on GitHub
[WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
☆18Mar 6, 2024Updated 2 years ago
X-LANCE / TIE
View on GitHub
[NAACL 2022] TIE: Topological Information Enhanced Structural Reading Comprehension on Web Pages
☆22Jun 3, 2022Updated 4 years ago
epfl-dlab / pairformance
View on GitHub
Tool to perform paired evaluation of automatic systems
☆13Oct 20, 2021Updated 4 years ago
epfl-dlab / forc
View on GitHub
Framework for Cost-Effective Language Model Choice
☆16Dec 12, 2023Updated 2 years ago
epfl-dlab / Quotebank
View on GitHub
Code and data for the WSDM '21 paper "Quotebank: A Corpus of Quotations from a Decade of News"
☆22Jul 23, 2021Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
X-LANCE / WebSRC-Baseline
View on GitHub
[EMNLP 2021] The baseline code for WebSRC dataset.
☆51Apr 2, 2025Updated last year
jlevin2 / twitch-scraper
View on GitHub
Python script that pulls from twitch API and sends email alerts
☆10Oct 20, 2017Updated 8 years ago
iliassarbout / CityOfLight
View on GitHub
City of Light (COL) is a geospatially faithful, Unity-based digital twin of Paris enabling high-performance embodied simulation for AI an…
☆48Mar 31, 2026Updated 3 months ago
sz128 / pretrained_word_embeddings
View on GitHub
It is about how to load and aggregate pretrained word embeddings in pytorch, e.g., ELMo\BERT\XLNET.
☆12Mar 2, 2020Updated 6 years ago
lyyf2002 / ASGEA
View on GitHub
Code for ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment
☆12Feb 28, 2024Updated 2 years ago
tarekziade / mwcat
View on GitHub
MediaWiki Categories Model
☆13Feb 14, 2024Updated 2 years ago
roife / BUAACalendarHelper
View on GitHub
A tiny iOS app for fetching classes from BUAA. (course assignment for BUAA-Swift)
☆10May 8, 2021Updated 5 years ago
louisbarclay / awesome-web-research-tools
View on GitHub
A curated list of awesome tools, frameworks, and resources for web research data collection, including tools that support observational m…
☆23Dec 16, 2025Updated 6 months ago
microsoft / multimodal-aligned-recipe-corpus
View on GitHub
☆18Jun 5, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
wantedly / intern-info
View on GitHub
Wantedlyのインターン情報や新卒採用についてのインフォメーションです
☆11Apr 5, 2022Updated 4 years ago
shibing624 / text2vec-service
View on GitHub
Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务，支持GPU多卡、多worker、多客户端调用，开箱即用。
☆12May 24, 2022Updated 4 years ago
StanfordHCI / FeedMonitor
View on GitHub
☆35May 6, 2026Updated last month
liyown / get_bibtex
View on GitHub
A Python tool for fetching citations from multiple sources.
☆15Jun 11, 2026Updated 3 weeks ago
OekoJ / softwarefootprint
View on GitHub
How much is the footprint of a piece of software? This script scans the process statistics for the appearance of a given command name and…
☆11Nov 16, 2023Updated 2 years ago
X-LANCE / META-GUI-baseline
View on GitHub
[EMNLP 2022] The baseline code for META-GUI dataset
☆16Jul 9, 2024Updated last year
jerrylin0809 / pac-bayesian-dendrogram-cut
View on GitHub
☆12May 10, 2021Updated 5 years ago
DFKI-NLP / LLMCheckup
View on GitHub
Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…
☆13Mar 24, 2024Updated 2 years ago
iwiwi / epochraft-hf-fsdp
View on GitHub
Example of using Epochraft to train HuggingFace transformers models with PyTorch FSDP
☆11Jan 29, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ben-aaron188 / textwash
View on GitHub
☆36Feb 22, 2026Updated 4 months ago
hlt-mt / TranscRater
View on GitHub
An open-source tool for automatic speech recognition ASR quality estimation.
☆24Dec 12, 2019Updated 6 years ago
SapienzaNLP / usea
View on GitHub
Universal Semantic Annotator (LREC 2022)
☆17Jan 29, 2025Updated last year
lionelclement / Elvex
View on GitHub
A Natural Language Generation System
☆14Updated this week
hardik-vala / FreNetic
View on GitHub
API for WOLF, a free French WordNet
☆14May 4, 2018Updated 8 years ago
Totobarjo / bsqli_en_GO
View on GitHub
BSQLi de coffinxp réécrie en GO, son repos a été reporté en masse, il a donc été fermé.
☆13Jul 26, 2024Updated last year
tibor / movetodon
View on GitHub
☆10Dec 19, 2022Updated 3 years ago
SapienzaNLP / srl4e
View on GitHub
☆11Aug 2, 2022Updated 3 years ago
cnap / smt-for-gec
View on GitHub
☆12Sep 8, 2017Updated 8 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
LoicGrobol / decofre
View on GitHub
Neural coreference resolution
☆12Sep 3, 2024Updated last year
Richar-Du / Virgo
View on GitHub
Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*
☆20May 27, 2025Updated last year
ChunhuaLiu596 / WAX
View on GitHub
The respository describing a novel datasets for word association explanations
☆13Sep 21, 2023Updated 2 years ago
htoyryla / minidiffusion
View on GitHub
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
☆18Jun 20, 2023Updated 3 years ago
nazem-Aldroubi / emotion-cause-extraction-dl-final-proj
View on GitHub
Emotion-cause pair extraction
☆13May 4, 2021Updated 5 years ago
QianRuan / histruct
View on GitHub
ACL 2022 Findings paper
☆16Jul 4, 2022Updated 4 years ago
Heidelberg-NLP / LMs4Implicit-Knowledge-Generation
View on GitHub
Code for equipping pretrained language models (BART, GPT-2, XLNet) with commonsense knowledge for generating implicit knowledge statement…
☆15Jul 27, 2021Updated 4 years ago