Abstract
There are a vast number of quantitative and qualitative technological forecasting methods. In the last decade, advanced quantitative technological forecasting methods based on the various applications of data science approaches have been proposed. Text mining is one of the key approaches used to examine large datasets consisting of scientific publications and patent documents with the aim of offering foresight for a selected area. However, the existing related studies either perform a qualitative approach by analysing the recent data to identify the emerging topics or use extrapolation techniques to predict the future values of some statistical terms or the future frequency of some important keywords. In this study, different from such related studies, we propose a deep learning-based framework to predict future co-similarity matrix representing the possible new and disappearing interactions between the words in the future. For this purpose, word vectors are generated using a word embedding technique and the temporal changes of the associations between the words are modelled using Long Short-Term Memory networks for the future estimation of the word embedding matrix. The text mining area is chosen as a case study. The clusters of the terms extracted from the predicted word embedding matrices were analysed and potentially emerging areas were identified for different prediction horizon lengths. The accuracy of the proposed model was analysed based on a set of evaluation metrics that measure the amount of overlapping between the actual and predicted word maps. The quantitative analysis showed that the proposed system can successfully identify the emerging and disappearing areas and can be used as a decision-making tool for the future projection of other areas.
Original language | English |
---|---|
Article number | 122520 |
Number of pages | 16 |
Journal | Technological Forecasting and Social Change |
Volume | 191 |
Early online date | 28 Mar 2023 |
DOIs | |
Publication status | Published - 1 Jun 2023 |
Keywords
- Deep learning
- Emerging topics
- Natural language processing
- Technological forecasting
- Text mining
- Trend analysis