传播IB方法的研究

项目来源

国家自然科学基金(NSFC)

项目主持人

叶阳东

项目受资助机构

郑州大学

立项年度

2017

立项时间

未公开

项目编号

61772475

研究期限

未知 / 未知

项目级别

国家级

受资助金额

62.00万元

学科

信息科学-计算机科学-信息安全

学科代码

F-F02-F0206

基金类别

面上项目

关键词

传播IB ; 信息度量 ; 传播机制 ; 分层模型 ; 多源异构数据 ; Propagation Information Bottleneck ; Information Measurement ; Multiple Heterogeneous Data ; Propagation Mechanism ; Hierarchical Model

参与者

姬波；卢红星；朱真峰；娄铮铮；吴云鹏；闫小强；吴宾；胡世哲；时增林

参与机构

郑州大学

项目标书摘要：本项目针对现有IB方法处理多源异构数据存在的局限性，提出传播IB方法，拟解决相关模型确定、传播机制构建、综合平衡参数调整、深度度量函数确定及应用适用性等关键问题。基于多信息和交互信息度量，对传播IB方法相关模型中变量间关系进行建模，构建模式参数确定策略；以因子图结构为核心，构建信息传播机制，使传播IB方法充分考虑异构数据对象的关联性和模式结构的层次性；使用自适应LASSO求解传播IB方法中的综合平衡参数；基于传播IB方法度量复杂数据的层次模型，用K-近邻估计法计算复杂数据模型中各层间、各层与相关变量间的互信息，提高深度度量方法的鲁棒性；开展隐藏信息分析、多传感器监控以及信息推荐的应用研究，力图发现传播IB方法所适用问题的特征及规律。该项目在传播机制、复杂数据模型度量方面的研究是原创性的，对多源异构数据处理的研究将进一步拓展IB方法的应用范围。项目的相关研究力图将IB方法推向新的研究阶段。

Application Abstract: This project proposes a propagation Information Bottleneck(IB)method which aims at remedying the limitations of current solutions on multiple heterogenous data.It intends to solve important problems such as determining related model,generating propagation mechanism,adjusting a series of balance parameters,selecting deep measurement function,and practicality of application.Based on criteria such as multi-information and interactive-information,we model the relationship of the latent variables in propagation IB method and propose a framework for determining the pattern parameters.Propagation IB can make use of the correlation of hetergeneous data object and the hierarchy of pattern structure by constructing the information propagation mechanisms based on the factor graph structure.The adaptive LASSO method is used to get the values of a series of balance parameters in propagation IB.To measure the complex data hierarchical models by propagation IB,K-Nearest Neighbor estimation method is used to compute the mutual information of each layers and the mutual information between each layer and relevant variables in complex data models.As a result,the robustness of the deep measurement function can be guaranteed.We will apply propagation IB to various application fields,including the analysis of hidden information,the warning of multi-sensor surveillance and information recommendation,in order to find the common patterns in problems which are solvable by propagation IB.The original contributions include the research on propagation mechanism and the measurement of complex data model.The research therein on multiple heterogeneous data will extend the field of IB method application.The works in this project will fill the research gap in literature and further open a new page for IB method.

项目受资助省

河南省

项目结题报告(全文)

项目针对传播IB方法及相关算法进行了深入的研究，超额完成了申报书中的任务，取得了丰硕的研究成果。1在传播IB方法中信息传播模型及相关算法的研究方面，提出了基于关联关系传播IB模型、双层关联的传播 IB模型、融合异构特征的协作IB模型、联合个性和共性信息的传播IB模型、视觉上下文IB模型、多任务联合IB算法、面向高维共现数据的交互IB模型，并研究了相关的优化算法。2在传播IB方法的权重学习研究方面，引入了不同的权重学习机制，提出了簇加权多视角IB算法、动态自动加权多视角联合IB聚类算法、基于内容和上下文的加权多视角IB聚类算法、双重加权的多视角IB聚类算法，实现了自动赋权和算法优化互相促进，从而提高了传播IB方法的有效性和灵活性。3在传播IB方法中信息度量及互信息最大化研究方面，提出了深度互信息最大最小化方法、组约束信息最大化聚类方法、多任务图像聚类的深度相关性挖掘方法、异构双任务聚类方法、基于信息最大化的多任务视频聚类算法、聚类模式参数的确定算法。4在传播IB方法的应用适应性研究方面，进行了传播IB方法在推荐系统、人群计数、多模态数据分析等方面的应用拓展研究，提出了相应的模型和相关算法，充分验证了传播IB方法的有效性和适应性。项目取得的研究成果发表在国内外重要会议或期刊上，如CVPR 2018、AAAI2021、SDM 2020、IEEE ICASSP 2020、IEEE Transactions on Image Processing、IEEE Transactions on Knowledge and Data Engineering、IEEE Transactions on Cybernetics、IEEE Transaction on Industrial Informatics、IEEE Transactions on Multimedia、Information Fusion、Pattern Recognition、ACM Transactions on Knowledge Discovery from Data、Information Sciences、Expert Systems with Applications、Knowledge-Based Systems、Applied Soft Computing、中国科学：信息科学、计算机学报等。

排序方式：时间相关性
显示方式：列表摘要

1.Deep Mutual Information Maximin for Cross-Modal Clustering

关键词：
MULTIVIEW

Mao, Yiqiao;Yan, Xiaoqiang;Guo, Qiang;Ye, Yangdong
《THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE》
2021年
35卷
期
期刊

Cross-modal clustering (CMC) aims to enhance the clustering performance by exploring complementary information from multiple modalities. However, the performances of existing CMC algorithms are still unsatisfactory due to the conflict of heterogeneous modalities and the high-dimensional non-linear property of individual modality. In this paper, a novel deep mutual information maximin (DMIM) method for cross-modal clustering is proposed to maximally preserve the shared information of multiple modalities while eliminating the superfluous information of individual modalities in an end-to-end manner. Specifically, a multi-modal shared encoder is firstly built to align the latent feature distributions by sharing parameters across modalities. Then, DMIM formulates the complementarity of multi-modalities representations as a mutual information maximin objective function, in which the shared information of multiple modalities and the superfluous information of individual modalities are identified by mutual information maximization and minimization respectively. To solve the DMIM objective function, we propose a variational optimization method to ensure it converge to a local optimal solution. Moreover, an auxiliary overclustering mechanism is employed to optimize the clustering structure by introducing more detailed clustering classes. Extensive experimental results demonstrate the superiority of DMIM method over the state-of-the-art cross-modal clustering methods on IAPR-TC12, ESP-Game, MIRFlickr and NUSWide datasets.

...

2.细粒度建模用户兴趣的序列化推荐方法

关键词：
胶囊网络;序列化推荐;门单元机制;隐式反馈;推荐系统

张麒;吴宾;孙中川;叶阳东
《中国科学:信息科学》
2022年
卷
10期
期刊

序列化推荐因其实用性和较高推荐精度在近期受到了人们广泛关注.不同于传统推荐方法,序列化推荐的核心在于如何基于用户近期交互行为来捕获用户的短期兴趣.现有工作或者依次考虑用户交互序列中物品之间的成对关系,忽略了更为重要的多对

...

3.DMIB: Dual-Correlated Multivariate Information Bottleneck for Multiview Clustering

关键词：
Correlation; Clustering algorithms; Clustering methods; Convergence;Bayes methods; Cybernetics; Reliability; Multivariate informationbottleneck (MIB); multiview clustering (MVC); unsupervised learning;FEATURES

Hu, Shizhe;Shi, Zenglin;Ye, Yangdong
《IEEE TRANSACTIONS ON CYBERNETICS》
2022年
52卷
6期
期刊

Multiview clustering (MVC) has recently been the focus of much attention due to its ability to partition data from multiple views via view correlations. However, most MVC methods only learn either interfeature correlations or intercluster correlations, which may lead to unsatisfactory clustering performance. To address this issue, we propose a novel dual-correlated multivariate information bottleneck (DMIB) method for MVC. DMIB is able to explore both interfeature correlations (the relationship among multiple distinct feature representations from different views) and intercluster correlations (the close agreement among clustering results obtained from individual views). For the former, we integrate both view-shared feature correlations discovered by learning a shared discriminative feature subspace and view-specific feature information to fully explore the interfeature correlation. This allows us to attain multiple reliable local clustering results of different views. Following this, we explore the intercluster correlations by learning the shared mutual information over different local clusterings for an improved global partition. By integrating both correlations, we formulate the problem as a unified information maximization function and further design a two-step method for optimization. Moreover, we theoretically prove the convergence of the proposed algorithm, and discuss the relationships between our method and several existing clustering paradigms. The experimental results on multiple datasets demonstrate the superiority of DMIB compared to several state-of-the-art clustering methods.

...

4.View-wise VS Cluster-wise Weight:Which Is Better for Multi-view Clustering?

胡世哲；娄铮铮；叶阳东；
《》
0年
卷
期
期刊

5.Incremental Multiview Clustering With Continual Information Bottleneck Method

关键词：
Consistency mining; deep clustering; incremental learning; informationbottleneck (IB); multiview clustering (MVC)

Yan, Xiaoqiang;Mao, Yiqiao;Ye, Yangdong;Yu, Hui
《IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS》
2024年
卷
期
期刊

Multiview clustering (MVC) provides a natural formulation to generate clusters for multiview data, which is fundamental to lots of industrial tasks like autonomous driving, defect detection, and multisensor information fusion, as part of the foundation models. Most existing MVC methods suppose that the data of multiple views are available during the clustering process. However, that is a very strong assumption and is impractical when the views are incremental over time. In addition, if directly applying existing MVC approaches to the clustering setting with incremental views, the massive redundant information in each view might limit the knowledge sharing between historical and newly arrived views. To solve these problems, a continual information bottleneck (CIB) method is presented in this article, which addresses the incremental MVC issue by maximally preserving the consistency of a sequence of views and removing the redundant information in each view. In particular, to facilitate the knowledge transfer from historical views to incoming one, we build a knowledge library to store the representative samples in historical views. When adding a new view, we first construct a view-specific encoder with information-theoretic constraints to learn a compact and discriminative representation, in which redundant information in the new view is eliminated. Then, to capture the consistency information between historical views and the new view, a shared encoder is devised after retrieving the global neighbors in the library for the new view, which is performed by contrasting the cluster assignment and feature representation simultaneously. Finally, a unified objective function is devised to simultaneously optimize the knowledge library and clustering process, in which the knowledge library is updated by maximizing the mutual information between the new view and all historical ones to keep tracking knowledge about the earlier views. Extensive experiment on nine multiview benchmarks has verified the superiority of the CIB method over 19 baselines.

...

6.Graph-Augmented Social Translation Model for Next-Item Recommendation

关键词：
Graph neural networks; next-item recommendation; social network;translation mechanism;NEURAL-NETWORK

Wu, Bin;Zhong, Lihong;Ye, Yangdong
《IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS》
2023年
19卷
11期
期刊

Next-item recommendation has been a hot research topic in academia and industry, which aims to help users discover the next interesting item. In this article, we propose a novel solution, namely graph-augmented social translation model (GAST), which investigates the utility of dynamic social influence for the task of next-item recommendation. Specifically, we introduce a gated graph convolution module to better model long-term user preference. Furthermore, we design a cogating module to capture dynamic patterns at both sequential level and social level. In addition, a social-enhanced translation mechanism is devised to measure the intensity of user-item relationships. Extensive experiments under different recommendation scenarios demonstrate the rationality and effectiveness of our proposed GAST method over several state-ofthe-art methods.

...

7.Graph-Augmented Co-Attention Model for Socio-Sequential Recommendation

关键词：
Social networking (online); Motion pictures; Convolution; Representationlearning; Recurrent neural networks; Predictive models; Matrixdecomposition; Attention mechanisms; graph convolutional networks;sequential recommendation; social influence;FACTORIZATION

Wu, Bin;He, Xiangnan;Wu, Le;Zhang, Xue;Ye, Yangdong
《IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS》
2023年
卷
期
期刊

A sequential recommendation has become a hot research topic, which seeks to predict the next interesting item for each user based on his action sequence. While previous methods have made many efforts to capture the dynamics of sequential patterns, we contend that they still suffer from two inherent limitations: 1) they fail to model item transition patterns in an efficient and time-sensitive manner and 2) they are unaware of the importance of dynamically capturing social influence, resulting in suboptimal performance. We introduce a new concept dubbed socio-sequential recommendation, where the challenge mainly lies in dynamically modeling social influences and capturing item-to-item transition patterns in a time-sensitive manner. In light of this, we contribute a novel solution named GCARec (short for graph-augmented co-attention model), which takes into account the joint effect of dynamic sequential patterns and dynamic social influences. GCARec decomposes socio-sequential recommendation workflow into two steps. First, we adopt a light graph embedding module to model long-term user preference. Then, we propose a time-sensitive attention mechanism and a social-aware attention mechanism to capture dynamic patterns at sequential-level and social-level, respectively. Extensive experiments have been conducted on eight real-world datasets from different scenarios, demonstrating the superiority of GCARec against several state-of-the-art methods. The codes and datasets have been released at: https://github.com/wubinzzu/GCARec.

...

8.Multiview Clustering With Propagating Information Bottleneck

关键词：
Information bottleneck (IB); information prop-agation; multiviewclustering (MVC); self-guided learning;REPRESENTATIONS

Hu, Shizhe;Shi, Zenglin;Yan, Xiaoqiang;Lou, Zhengzheng;Ye, Yangdong
《IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS》
2023年
卷
期
期刊

In many practical applications, massive data are observed from multiple sources, each of which contains multiple cohesive views, called hierarchical multiview (HMV) data, such as image-text objects with different types of visual and textual features. Naturally, the inclusion of source and view relationships offers a comprehensive view of the input HMV data and achieves an informative and correct clustering result. However, most existing multiview clustering (MVC) methods can only process single-source data with multiple views or multisource data with single type of feature, failing to consider all the views across multiple sources. Observing the rich closely related multivariate (i.e., source and view) information and the potential dynamic information flow interacting among them, in this article, a general hierarchical information propagation model is first built to address the above challenging problem. It describes the process from optimal feature subspace learning (OFSL) of each source to final clustering structure learning (CSL). Then, a novel self-guided method named propagating information bottleneck (PIB) is proposed to realize the model. It works in a circulating propagation fashion, so that the resulting clustering structure obtained from the last iteration can "self-guide " the OFSL of each source, and the learned subspaces are in turn used to conduct the subsequent CSL. We theoretically analyze the relationship between the cluster structures learned in the CSL phase and the preservation of relevant information propagated from the OFSL phase. Finally, a two-step alternating optimization method is carefully designed for optimization. Experimental results on various datasets show the superiority of the proposed PIB method over several state-of-the-art methods.

...

9.一种联合成对排序的协同过滤推荐算法研究

关键词：
物品推荐;成对排序;协同过滤;隐式反馈;矩阵分解

陈允
指导老师：郑州大学叶阳东
0年
学位论文

随着互联网和信息技术的快速发展,大量信息快速地涌入互联网,丰富的信息在给用户带来便利的同时,也导致了信息过载。信息检索领域一直被认为是解决信息过载问题的有效方法之一,帮助用户快速地从海量信息中获取有价值的信息。相比于传统的信息检索技术而言,推荐系统能够主动向用户提供可能感兴趣的信息且无需用户的明确需求等特性,而成为缓解信息过载问题的重要工具之一,并在业界得到了广泛研究与应用。众多电子商务网站和多媒体平台在现有的系统基础上能够较为容易地嵌入个性化推荐技术,例如,亚马逊购物网站可以帮助用户推送感兴趣的商品信息,今日头条向用户推送当日其可能感兴趣的新闻信息。推荐技术的使用在一定程度上不仅增加了用户的参与度及对应用的信任度和依赖度,而且为使用该技术应用上的商家带来可观的收入,例如电影,新闻和POI推荐等。在现实生活中,用户的消费行为复杂而多样,通常受到许多因素的影响。用户所做出购买决策不仅出于自身喜好,还会考虑历史购买物品与即将购买的物品在功能上的关系。本文主要贡献点如下:（1）现有的推荐算法大多仅从用户的角度更细粒度地构建模型,而忽略了物品之间在功能上的互补关系对用户做出购买决策的影响。针对此问题,本文从用户和物品这两个角度,依据用户-物品之间的交互关系和物品之间在功能上的互补关系分别对特定用户和物品构建样本对,提出了联合成对排序推荐模型。（2）对于成对排序方法而言,负样本的选取将直接影响模型的收敛速度和推荐精度。在依据上述两种关系分别对用户和物品构建样本对时,为加快模型的收敛速度,构建了一种新颖的排序感知采样策略。该策略根据正样本排名位置,选取更为有效的负样本并定义了权重函数动态控制模型学习梯度。（3）设计了一种高效的协同过滤推荐算法CPR。在四个数据集上的实验结果表明,本文算法在在多个指标（Precision、Recall、MAP和NDCG）及收敛速度上均优于当前主流的推荐算法。

...

10.Gating augmented capsule network for sequential recommendation

Zhang, Qi ; Wu, Bin ; Sun, Zhongchuan ; Ye, Yangdong
《Knowledge-Based Systems》
2022年
247卷
期
期刊

Sequential recommendation has become a popular and indispensable component of various online services, which aims to predict the next interested item based on the sequence of a certain user. To deduce users’ actual interests, sequential recommenders concentrate on analyzing the complex transition dependency from the user's recent action sequence. The key types of item transition patterns can be generally divided into item-level and factor-level. However, most existing works directly focus on a one-channel chain of interaction sequence, and only capture item co-occurrence patterns from item-level. They neglect the availability of transitions among items’ latent attributes. Toward this end, we propose a Gating Augmented Capsule Network (GAC), which models both personalized item- and factor-level transitions in a fine-grained manner. Specifically, to distill user-specific information, we present a personalized gating module to replace the convolution operation of the traditional capsule network, so as to augment the links between the user and each item. Moreover, we design an item-routing component and a factor-routing component to build a two-channel routing module for capturing item- and factor-level interactions, respectively, while preserving the relative order of items in the action sequence. Extensive experiments on four public benchmarks demonstrate the effectiveness of our proposed GAC compared to several state-of-the-art baselines. © 2022 Elsevier B.V.

...

排序方式：时间相关性
显示方式：列表摘要