IDEAS home Printed from http://ideas.repec.org/a/spr/scient/v128y2023i11d10.1007_s11192-023-04819-x.html
   My bibliography  Save this article

Identification of emerging technology topics (ETTs) using BERT-based model and sematic analysis: a perspective of multiple-field characteristics of patented inventions (MFCOPIs)

Author

Listed:
  • Bowen Song

    (Dalian University of Technology)

  • Chunjuan Luan

    (Dalian University of Technology
    Dalian University of Technology)

  • Danni Liang

    (Dalian University of Technology)

Abstract

The proliferation of large language models (LLMs) has significantly expanded the landscape of research on technology opportunity identification. However, there remains a crucial need to enhance the accuracy and interpretability of results obtained through emerging technology topic identification. In this paper, we present a novel approach that leverages a BERT-based model and semantic analysis to identify emerging technology topics (ETTs) from the perspective of multiple-field characteristics of patented inventions (MFCOPIs). By utilizing a unique dataset encompassing MFCOPI, our methodology emphasizes an increased proportion of novel technical processes in the analysis content while mitigating the interference of redundant technical information. To enhance the interpretability of recognition results, our proposed model employs the BERT model for detecting potential content similarities in inventive characteristics and incorporates semantic structure analysis to expand the technical process content. We empirically validate our model by employing nanotechnology as a case study, demonstrating its effectiveness and accuracy. Through our research, we extend the existing methodologies for recognizing emerging technology, ultimately elevating the quality of recognition results.

Suggested Citation

  • Bowen Song & Chunjuan Luan & Danni Liang, 2023. "Identification of emerging technology topics (ETTs) using BERT-based model and sematic analysis: a perspective of multiple-field characteristics of patented inventions (MFCOPIs)," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(11), pages 5883-5904, November.
  • Handle: RePEc:spr:scient:v:128:y:2023:i:11:d:10.1007_s11192-023-04819-x
    DOI: 10.1007/s11192-023-04819-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-023-04819-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: http://libkey.io/10.1007/s11192-023-04819-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Janghyeok Yoon & Sungchul Choi & Kwangsoo Kim, 2011. "Invention property-function network analysis of patents: a case of silicon-based thin film solar cells," Scientometrics, Springer;Akadémiai Kiadó, vol. 86(3), pages 687-703, March.
    2. David J. TEECE, 2008. "Profiting from technological innovation: Implications for integration, collaboration, licensing and public policy," World Scientific Book Chapters, in: The Transfer And Licensing Of Know-How And Intellectual Property Understanding the Multinational Enterprise in the Modern World, chapter 5, pages 67-87, World Scientific Publishing Co. Pte. Ltd..
    3. Zhang, Yi & Lu, Jie & Liu, Feng & Liu, Qian & Porter, Alan & Chen, Hongshu & Zhang, Guangquan, 2018. "Does deep learning help topic extraction? A kernel k-means clustering method with word embedding," Journal of Informetrics, Elsevier, vol. 12(4), pages 1099-1117.
    4. Sıla Öcalan Özel & Julien Pénin, 2016. "Exclusive or open? An economic analysis of university intellectual property patenting and licensing strategies," Journal of Innovation Economics, De Boeck Université, vol. 0(3), pages 133-153.
    5. Florian Kreuchauff & Vladimir Korzinov, 2017. "A patent search strategy based on machine learning for the emerging field of service robotics," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(2), pages 743-772, May.
    6. Qi Wang, 2018. "A bibliometric model for identifying emerging research topics," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 69(2), pages 290-304, February.
    7. Sungchul Choi & Janghyeok Yoon & Kwangsoo Kim & Jae Yeol Lee & Cheol-Han Kim, 2011. "SAO network analysis of patents for technology trends identification: a case study of polymer electrolyte membrane technology in proton exchange membrane fuel cells," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(3), pages 863-883, September.
    8. Choudhury, Nazim & Faisal, Fahim & Khushi, Matloob, 2020. "Mining Temporal Evolution of Knowledge Graphs and Genealogical Features for Literature-based Discovery Prediction," Journal of Informetrics, Elsevier, vol. 14(3).
    9. Mendonca, Sandro & Pereira, Tiago Santos & Godinho, Manuel Mira, 2004. "Trademarks as an indicator of innovation and industrial change," Research Policy, Elsevier, vol. 33(9), pages 1385-1404, November.
    10. Adam B. Jaffe & Gaétan de Rassenfosse, 2017. "Patent citation data in social science research: Overview and best practices," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(6), pages 1360-1374, June.
    11. Sara Reardon, 2014. "Text-mining offers clues to success," Nature, Nature, vol. 509(7501), pages 410-410, May.
    12. R. Zhang & Y. Zhang & Z. C. Dong & S. Jiang & C. Zhang & L. G. Chen & L. Zhang & Y. Liao & J. Aizpurua & Y. Luo & J. L. Yang & J. G. Hou, 2013. "Chemical mapping of a single molecule by plasmon-enhanced Raman scattering," Nature, Nature, vol. 498(7452), pages 82-86, June.
    13. Rotolo, Daniele & Hicks, Diana & Martin, Ben R., 2015. "What is an emerging technology?," Research Policy, Elsevier, vol. 44(10), pages 1827-1843.
    14. Jan M. Gerken & Martin G. Moehrle, 2012. "A new instrument for technology monitoring: novelty in patents measured by semantic patent analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(3), pages 645-670, June.
    15. Hyunseok Park & Janghyeok Yoon & Kwangsoo Kim, 2012. "Identifying patent infringement using SAO based semantic technological similarities," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 515-529, February.
    16. Ma, Tingting & Zhou, Xiao & Liu, Jia & Lou, Zhenkai & Hua, Zhaoting & Wang, Ruitao, 2021. "Combining topic modeling and SAO semantic analysis to identify technological opportunities of emerging technologies," Technological Forecasting and Social Change, Elsevier, vol. 173(C).
    17. Lee, Changyong, 2021. "A review of data analytics in technological forecasting," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    18. Yuan Zhou & Fang Dong & Yufei Liu & Zhaofu Li & JunFei Du & Li Zhang, 2020. "Forecasting emerging technologies using data augmentation and deep learning," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(1), pages 1-29, April.
    19. Zhang, Yi & Wu, Mengjia & Miao, Wen & Huang, Lu & Lu, Jie, 2021. "Bi-layer network analytics: A methodology for characterizing emerging general-purpose technologies," Journal of Informetrics, Elsevier, vol. 15(4).
    20. Péter Érdi & Kinga Makovi & Zoltán Somogyvári & Katherine Strandburg & Jan Tobochnik & Péter Volf & László Zalányi, 2013. "Prediction of emerging technologies based on analysis of the US patent citation network," Scientometrics, Springer;Akadémiai Kiadó, vol. 95(1), pages 225-242, April.
    21. Saeed-Ul Hassan & Mubashir Imran & Sehrish Iqbal & Naif Radi Aljohani & Raheel Nawaz, 2018. "Deep context of citations using machine-learning models in scholarly full-text articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 1645-1662, December.
    22. Ernst, Holger, 2003. "Patent information for strategic technology management," World Patent Information, Elsevier, vol. 25(3), pages 233-242, September.
    23. Yi Zhang & Guangquan Zhang & Donghua Zhu & Jie Lu, 2017. "Scientific evolutionary pathways: Identifying and visualizing relationships for scientific topics," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(8), pages 1925-1939, August.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liu, Zhenfeng & Feng, Jian & Uden, Lorna, 2023. "Technology opportunity analysis using hierarchical semantic networks and dual link prediction," Technovation, Elsevier, vol. 128(C).
    2. Yuan Zhou & Fang Dong & Yufei Liu & Liang Ran, 2021. "A deep learning framework to early identify emerging technologies in large-scale outlier patents: an empirical study of CNC machine tool," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 969-994, February.
    3. Zhenyu Yang & Wenyu Zhang & Zhimin Wang & Xiaoling Huang, 2024. "A deep learning-based method for predicting the emerging degree of research topics using emerging index," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 4021-4042, July.
    4. Roman Jurowetzki, 2015. "Unpacking Big Systems - Natural Language Processing meets Network Analysis. A Study of Smart Grid Development in Denmark," SPRU Working Paper Series 2015-15, SPRU - Science Policy Research Unit, University of Sussex Business School.
    5. Huang, Lu & Chen, Xiang & Ni, Xingxing & Liu, Jiarun & Cao, Xiaoli & Wang, Changtian, 2021. "Tracking the dynamics of co-word networks for emerging topic identification," Technological Forecasting and Social Change, Elsevier, vol. 170(C).
    6. Richarz, Jan & Wegewitz, Stephan & Henn, Sarah & Müller, Dirk, 2023. "Graph-based research field analysis by the use of natural language processing: An overview of German energy research," Technological Forecasting and Social Change, Elsevier, vol. 186(PB).
    7. Jose M. Vicente-Gomila & Anna Palli & Begoña Calle & Miguel A. Artacho & Sara Jimenez, 2017. "Discovering shifts in competitive strategies in probiotics, accelerated with TechMining," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1907-1923, June.
    8. Wooseok Jang & Yongtae Park & Hyeonju Seol, 2021. "Identifying emerging technologies using expert opinions on the future: A topic modeling and fuzzy clustering approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6505-6532, August.
    9. Joung, Junegak & Kim, Kwangsoo, 2017. "Monitoring emerging technologies for technology planning using technical keyword based analysis from patent data," Technological Forecasting and Social Change, Elsevier, vol. 114(C), pages 281-292.
    10. Yoon, Janghyeok & Park, Hyunseok & Seo, Wonchul & Lee, Jae-Min & Coh, Byoung-youl & Kim, Jonghwa, 2015. "Technology opportunity discovery (TOD) from existing technologies and products: A function-based TOD framework," Technological Forecasting and Social Change, Elsevier, vol. 100(C), pages 153-167.
    11. Lu Huang & Xiang Chen & Yi Zhang & Changtian Wang & Xiaoli Cao & Jiarun Liu, 2022. "Identification of topic evolution: network analytics with piecewise linear representation and word embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5353-5383, September.
    12. Chao Yang & Donghua Zhu & Xuefeng Wang & Yi Zhang & Guangquan Zhang & Jie Lu, 2017. "Requirement-oriented core technological components’ identification based on SAO analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1229-1248, September.
    13. Zhang, Yi & Wu, Mengjia & Miao, Wen & Huang, Lu & Lu, Jie, 2021. "Bi-layer network analytics: A methodology for characterizing emerging general-purpose technologies," Journal of Informetrics, Elsevier, vol. 15(4).
    14. Yang, Chao & Huang, Cui & Su, Jun, 2018. "An improved SAO network-based method for technology trend analysis: A case study of graphene," Journal of Informetrics, Elsevier, vol. 12(1), pages 271-286.
    15. Zamani, Mehdi & Yalcin, Haydar & Naeini, Ali Bonyadi & Zeba, Gordana & Daim, Tugrul U, 2022. "Developing metrics for emerging technologies: identification and assessment," Technological Forecasting and Social Change, Elsevier, vol. 176(C).
    16. Ren, Haiying & Zhao, Yuhui, 2021. "Technology opportunity discovery based on constructing, evaluating, and searching knowledge networks," Technovation, Elsevier, vol. 101(C).
    17. Jung, Sukhwan & Segev, Aviv, 2022. "DAC: Descendant-aware clustering algorithm for network-based topic emergence prediction," Journal of Informetrics, Elsevier, vol. 16(3).
    18. Farshad Madani, 2015. "‘Technology Mining’ bibliometrics analysis: applying network analysis and cluster analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(1), pages 323-335, October.
    19. Puccetti, Giovanni & Giordano, Vito & Spada, Irene & Chiarello, Filippo & Fantoni, Gualtiero, 2023. "Technology identification from patent texts: A novel named entity recognition method," Technological Forecasting and Social Change, Elsevier, vol. 186(PB).
    20. Li, Munan & Porter, Alan L. & Suominen, Arho & Burmaoglu, Serhat & Carley, Stephen, 2021. "An exploratory perspective to measure the emergence degree for a specific technology based on the philosophy of swarm intelligence," Technological Forecasting and Social Change, Elsevier, vol. 166(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:128:y:2023:i:11:d:10.1007_s11192-023-04819-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.