Forecasting Publication’s Success Using Machine Learning

Rand Alchokr (Corresponding author), Rayed Haider (Corresponding author), Yusra Shakeel (Corresponding author), Thomas Leich, Gunter Saake, Jacob Krüger

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

97 Downloads (Pure)

Abstract

Measuring the success and impact of a scientific publication is an important, thus controversial matter. Despite all the criticism, it is widespread that citation counts is considered a popular indication of a publication‘s success. Therefore, in this paper, we use a machine learning framework to test the ability of alternative metrics (altmetrics) to predict the future impact of papers reflected in the citation counts. To achieve the experiment, we extracted 7,588 papers from 10 computer science journals. To build the feature space for the prediction problem, 14 different altmetric indices were collected, 3 feature selection approaches, namely, Variance threshold, Pearson’s Correlation, and Mutual information method, were used to minimize the feature space and rank the features according to their contribution to the original dataset. To identify the classification performance of these features, three classifiers were used: Decision Tree, Random Forest, and Support Vector Machines. According to the experimental data, altmetrics can predict future citations and the most useful altmetrics indications are social media count, tweets, news count, capture count, and full-text view, with Random Forest outperforming the other classifiers.
Original languageEnglish
Title of host publicationBIR 2023 : Bibliometric-enhanced Information Retrieval
Subtitle of host publicationProceedings of the 13th International Workshop on Bibliometric-enhanced Information Retrieval co-located with 45th European Conference on Information Retrieval (ECIR 2023)
EditorsIngo Frommholz, Philipp Mayr, Guillaume Cabanac, Suzan Verberne, Jordan Brennan
PublisherCEUR-WS.org
Pages77-89
Number of pages13
Publication statusPublished - 2023
Event13th International Workshop on Bibliometric-enhanced Information Retrieval, BIR 2023 - Dublin, Ireland
Duration: 2 Apr 20232 Apr 2023

Publication series

NameCEUR Workshop Proceedings
Volume3617
ISSN (Electronic)1613-0073

Workshop

Workshop13th International Workshop on Bibliometric-enhanced Information Retrieval, BIR 2023
Abbreviated titleBIR 2023
Country/TerritoryIreland
CityDublin
Period2/04/232/04/23

Keywords

  • Bibliometric
  • Alternative metrics,
  • Machine learning
  • Computer Science

Fingerprint

Dive into the research topics of 'Forecasting Publication’s Success Using Machine Learning'. Together they form a unique fingerprint.

Cite this