MCL Research on Word Embedding

By Yijing Yang | June 2, 2019 | News

Word embeddings have been widely applied across several NLP tasks. The goal for word embedding is to transferring words into vector representations which embeds both syntactic and semantic information. General word embedding is usually generated by training on a large corpus like the whole wiki text data.

Our first work is mainly focus on improving the performance over trained word embedding models to make is more representative. The motivations are: (1) Even though current model are trained without considering the order of each dimension. But the obtain word embedding is usually carries a large mean and the variance is mostly lies on the first several principal components. This could lead hubness problem and we would like to analysis the statistics to make the whole space more iso-tropical. (2) The information of ordered input sequences is lost because of the context-based training scheme. From the above analysis, we proposed two ways to perform post-processing of word embedding call Post-processing via Variance Normalization (PVN) and Post-processing via Dynamic Embedding (PDE). The effectiveness of our model is verified over both intrinsic and extrinsic evaluation methods. For details, please refer to: [1].

During the past several years, word embedding is very popular, but the evaluation is mainly conducted over intrinsic evaluation methods because of their convenience. In Natural Language Processing society, we care about more the effective of word embedding on real NLP tasks like translation, sentiment analysis and question answering. Our second word focus on the word embedding quality and its relationship with evaluation methods. We have discussed criterions that a good word embedding should have and also for evaluation methods. Also, the properties of intrinsic evaluation methods are discussed because different intrinsic evaluator tests from different perspectives. Finally, the correlation study between intrinsic evaluation methods and real word applications are presented. For detailed, please refer to: [2].

Recently, neural-network-based methods becomes popular in contextualized word embedding and pre-training networks. But the original word vector representation is still of great interest because of its flexibility when applying to other tasks. One of the interesting perspectives for future research could be the study on how to measure the relationship between words based on their vector representations. Also, popular model in word embedding has also been applied to graph embedding model and achieve the state-of-the-are result like DeepWalk and node2vec.

–Author: Bin Wang

Reference

[1]: Wang, Bin, et al. “Post-Processing of Word Representations via Variance Normalization and Dynamic Embedding.” 2019 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2019.

[2]: Wang, Bin, et al. “Evaluating Word Embedding Models: Methods and Experimental Results.” arXiv preprint arXiv:1901.09785 (2019).

About the Author: Yijing Yang

Thesis Title: Advanced Techniques for Object Classification: Methodologies and Performance Evaluation, June 2022. Employment: Huawei Technologies Co. Ltd., Beijing, China The 164th PhD from MCL

MCL Research on Word Embedding

Share This Story, Choose Your Platform!

About the Author: Yijing Yang

You May Also Like

MCL Research on Mouse Motion Behavior

MCL Research on Microscopic Blood Vessel Segmentation

MCL Research Presented at WACV 2026

MCL Research on Medical Image Classification

MCL Research on Green Image Generation

MCL Research on EEG Data Analysis

MCL Research on Variable-Length Word Embeddings

MCL Research on Renal Image Segmentation

MCL Research on Renal Imaging Analysis

MCL Research on Whole Slide Image Analysis

MCL Research on Video Quality Assessment

MCL Research on Green Image Coding

MCL Research on Feature Learning for Image Classification

Congratulations to Jiaxin Yang for passing his Qualifying Exam

Congratulations to Jintang Xue for passing his Qualifying Exam

Congratulations to Kevin Yang for Passing His Defense

Congratulations to Haiyi Li for passing her Qualifying Exam

MCL Research on Image Super-Resolution

Happy New Year!

Merry Christmas!

MCL Research on Video Quality Assessment

MCL Research on Segmentation of Mice Brain Images

MCL Thanksgiving Luncheon

MCL Research on Video-Text Alignment

MCL Research on Kidney Segmentation

MCL Research on Seismic Data Processing

MCL Research on Mice Navigation Pattern Strategies

MCL Research on 3D Whole-Brain Image Analysis in Mice

MCL Research on EEG Analysis

MCL Research on Green Modulation Classification

Welcome New MCL Member Li-Heng Wang

Welcome New MCL Member Claire Wang

Wei Wang, Jie-En Yao, Xinyu Wang, Haiyi Li, Vasileios Magoulianitis Attended ICIP 2025

Congratulations to Aolin Feng for Passing His Qualifying Exam

Congratulations to Mahtab Movahhedrad for Passing Her Qualifying Exam

MCL Research on Green IR Drop Prediction

Attendance at MIPR 2025 – San Jose

MCL Research on Eosinophilic esophagitis (EoE) Diagnosis

MCL Research on MRI Prostate Image Quality Assessment

MCL Research on Biomarker Prediction for Kidney Cancer

MCL Research on Wavelet-Based Green Learning

MCL Research on Motion YOLO

MCL Research on Multi-Stage XGBoost

Welcome New MCL Member James Zhan

Welcome New MCL Member Alek Yegazarian

Welcome New MCL Member Jimmy Xiao

Welcome New MCL Member Kevin Lim

Welcome New MCL Member Qi Cao

MCL Research on Image Classification

Congratulations to Wei Wang for Passing Her Defense!

Congratulations to Qingyang Zhou for Passing His Defense!

MCL Research on Prostate Segmentation

MCL Research on Green Image Super-resolution

MCL Research on Nuclei Segmentation

MCL Research on Seismic Data Processing

MCL Research on Image Denoising

MCL Research on Video-Text Retrieval

MCL Research on Enhanced Object Detection

MCL Research on Video Camouflaged Object Detection (VCOD)

MCL Research on Image Dehazing

Reunion of MCL Alumni at Southern California

MCL Research on Image Demosaicing

MCL Research on Transfer Learning

MCL Research on EDA

Professor Kuo Gave a Keynote at AIxMM 2025

Welcome New MCL Member Cynthia Huang

Congratulations to Ganning Zhao for Passing Her Defense!

Welcome to the Spring 2025 semester!

Happy New Year!

Merry Christmas!

MCL Research on Green Learning for Medical Imaging

Research on Green Image Segmentation

MCL’s Thanksgiving Luncheon

MCL Research on Radar Signal Processing: Jamming signal detection

Congratulations to Professor Kuo for Receiving NTU Distinguished Alumni Award

MCL Research on Feedforward Visual Attention