DBP15K_ZH_EN

class DBP15K_ZH_EN(root: str | Path, download: bool | None = False, train_ratio: float | None = 0.2, dtype: dtype = torch.float32, seed: int | None = 0)[source]

Bases: Dataset

A pair of Chinese to English version of multi-lingual DBpedia networks. The dataset is proposed by the “Cross-lingual Entity Alignment via Joint Attribute-Preserving Embedding” paper, and the node attributes are given by pre-trained and aligned monolingual word embeddings from the “Cross-lingual Knowledge Graph Alignment via Graph Matching Neural Network” paper. There are 15,000 pairs of aligned entities in DBP15K (Chinese to English).

Graph

#nodes

#edges

#node attrs

#edge attrs

ZH

19,388

70,414

300

0

EN

19,572

95,142

300

0