MCL Research on Negative Sampling for Knowledge Graph Learning

By Zhiruo Zhou | October 17, 2021 | News

A knowledge graph is a collection of factual triples (h, r, t) consisting of two entities and one relation. Most knowledge graphs suffer from the incompleteness that there are many missing relations between entities. To predict missing links, each relation is modeled by a binary classifier to predict whether the links between two entities exist or not. Negative sampling is a task to draw negative samples efficiently and effectively from the unobserved triples to train the classifiers. The quality and quantity of the negative samples will highly affect the performance on link prediction.

Naive negative sampling [1] suggests generating negative samples by corrupting one of the entities in the observed triples, e.g. (h’, r, t) or (h, r, t’). Despite the simplicity of naive negative samples, the generated negative samples carry little semantics. For example, given a positive triple (Hulk, movie_genre, Science Fiction), a negative sample (Hulk, movie_genre, New York City) might be generated by naive negative sampling, which will never be a valid triple in the real-world scenario. Instead, we are looking for negative examples, such as (Hulk, movie_genre, Romance), that provide more information to the classifiers. Based on the observation, we only draw the corrupted entities within the set of observed entities that have been linked by the given relation, also known as the ‘range’ for the relation. However, a drawback is that the chance of drawing false negatives is high. Therefore, we further filter the drawn corrupted entities based on the entity-entity co-occurrence. For example, it’s not likely for us to generate a negative sample (Hulk, movie_genre, Adventure) because we know from the dataset that movie genres ‘Science Fiction’ and ‘Adventure’ are highly co-occurred. The first figure shows how the positive and negative samples could be derived from the given knowledge graph.

To generate more effective negative samples, previous work has tried to use GAN models [2] and self-adversarial learning [3] to generate negative samples. In our work, self-adversarial negative samples are drawn from the mis-classified negative samples from the previous iteration and are used in the later iterations. The second figure demonstrates the effectiveness of range-constrained and self-adversarial negative sampling compared to naive negative sampling.

Reference:

[1] Wang, Zhen, et al. “Knowledge graph embedding by translating on hyperplanes.” Proceedings of the AAAI Conference on Artificial Intelligence. Vol. 28. No. 1. 2014.

[2] Cai, Liwei, and William Yang Wang. “Kbgan: Adversarial learning for knowledge graph embeddings.” arXiv preprint arXiv:1711.04071 (2017).

[3] Sun, Zhiqing, et al. “Rotate: Knowledge graph embedding by relational rotation in complex space.” arXiv preprint arXiv:1902.10197 (2019).

About the Author: Zhiruo Zhou

Thesis Title: Green Unsupervised Single Object Tracking: Technologies and Performance Evaluation, September 2023. Employment: Apple, Inc., San Diego, California, USA The 170th PhD from MCL

MCL Research on Negative Sampling for Knowledge Graph Learning

Share This Story, Choose Your Platform!

About the Author: Zhiruo Zhou

You May Also Like

MCL Research on Multi-Stage XGBoost

Welcome New MCL Member James Zhan

Welcome New MCL Member Alek Yegazarian

Welcome New MCL Member Jimmy Xiao

Welcome New MCL Member Kevin Lim

Welcome New MCL Member Qi Cao

MCL Research on Image Classification

Congratulations to Wei Wang for Passing Her Defense!

Congratulations to Qingyang Zhou for Passing His Defense!

MCL Research on Prostate Segmentation

MCL Research on Green Image Super-resolution

MCL Research on Nuclei Segmentation

MCL Research on Seismic Data Processing

MCL Research on Image Denoising

MCL Research on Video-Text Retrieval

MCL Research on Enhanced Object Detection

MCL Research on Video Camouflaged Object Detection (VCOD)

MCL Research on Image Dehazing

Reunion of MCL Alumni at Southern California

MCL Research on Image Demosaicing

MCL Research on Transfer Learning

MCL Research on EDA

Professor Kuo Gave a Keynote at AIxMM 2025

Welcome New MCL Member Cynthia Huang

Congratulations to Ganning Zhao for Passing Her Defense!

Welcome to the Spring 2025 semester!

Happy New Year!

Merry Christmas!

MCL Research on Green Learning for Medical Imaging

Research on Green Image Segmentation

MCL’s Thanksgiving Luncheon

MCL Research on Radar Signal Processing: Jamming signal detection

Congratulations to Professor Kuo for Receiving NTU Distinguished Alumni Award

MCL Research on Feedforward Visual Attention

MCL Research on Word Embedding Dimension Reduction

Welcome New MCL Member Hong-En Chen

Welcome New MCL Member Laurence Palmer

Professor C.-C. Jay Kuo Named Inaugural Ming Hsieh Chair Holder

Welcome New MCL Member Alexander Jou

Welcome New MCL Member Qixin Hu

Welcome New MCL Member Youngrae Kim

Congratulations to Vasileios Magoulianitis for Passing His Defense

Congratulations to Zhanxuan Mei for Passing His Defense

MCL Research on Supervised Feature Learning

MCL Research on Green Saliency-guided Blind Image Quality Assessment (GSBIQA)

MCL Research on Green Raw Image Demosaicking

MCL Research on Green Saliency-guided Blind Image Quality Assessment (GSBIQA)

MCL Research on Prostate Lesion Detection from MRI Images

MCL Research on Green Image Super-resolution

Professor Kuo Attended ICME in Niagara Falls, Canada

MCL Research on 3D Perception with Large Foundational Models

MCL Research on Prostate MRI Image Segmentation

Professor Kuo Met MCL Alumni in Thailand

Professor Kuo visited Singapore

Professor Kuo met MCL Alumni in Taiwan

MCL Research on Nuclei Segmentation for Histological Images

MCL Research on Seismic Data Processing

MCL Research on Point Cloud Surface Reconstruction

Congratulations to Chengwei Wei for Passing His Defense

Congratulations on MCL Members Attending Ph.D. Hooding Ceremony

Welcome New MCL Member Dingyi Nie

MCL Research on Parsing Tree Construction

MCL Research on Video Camouflaged Object

MCL Research on Green Learning for Electronic Design Automation (EDA)

MCL Research on 3D Perception with Large Foundational Models

MCL Research on Green Image Coding

MCL Research on Green Point Cloud Surface Reconstruction

MCL Research on POS Tagging Prediction

MCL Research on Saliency Detection Method

Welcome New MCL Member Xuechun Hua

MCL Research on Transfer Learning

MCL Research on Image Demosaicing

MCL Research on LQBoost Regressor

MCL Research on SLMBoost Classifier

Congratulations to Xuejing Lei for Passing Her Defense

Congratulations to Yifan Wang for Passing His Defense