Self-configured framework for scalable link prediction in twitter: Towards autonomous spark framework
Document Type
Article
Publication Date
11-1-2022
Abstract
Scalable link prediction in social networks allow dynamic social interaction gathering, potential friend suggestions, and community detection. Distributed open-source frameworks such as Hadoop and Spark facilitate efficient link prediction especially in large-scale social networks. The frameworks provide different kinds of tunable properties for users to manually configure the parameters for the applications. However, manual configurations are open to performance issues when the applications start scaling tremendously, which are hard to set up and are exposed to human errors. This paper proposes a novel Self-Configured Framework (SCF) to provide an autonomous feature in Spark that predicts and sets the best configuration instantly before the application execution using the XGBoost classifier. The framework with a self-configuration setting demonstrates a 40% reduction in prediction time as well as a balanced resource consumption that makes full use of resources, especially for limited number and size of clusters. The presented framework establishes its efficiency for link prediction in large-scale social networks by automatically configuring the best configuration suitable for a specific application given the varying dataset size of the Twitter social network, workload, and cluster specification. (C) 2022 Elsevier B.V. All rights reserved.
Keywords
Self-Configured Framework, Link prediction, Social network, Large-scale
Divisions
fsktm
Funders
Ministry of Education, Malaysia [FRGS/1/2017/ICT04/UM/02/2]
Publication Title
Knowledge-Based Systems
Volume
255
Publisher
Elsevier
Publisher Location
RADARWEG 29, 1043 NX AMSTERDAM, NETHERLANDS