Exploring Developers Discussion Forums for Quantum Software Engineering: A Fine-Grained Classification Approach Using Large Language Model (ChatGPT)

Mobashir Husain, Muhammad Sohail Khan, Javed Ali Khan, Nek Dil Khan,, Arif Khan, Muhammad Azeem Akbar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Quantum Software Engineering (QSE) has recently emerged as a potential research and development direction frequently practiced by many tech joints. However, quantum developers face challenges in optimizing quantum computing and QSE concepts. Quantum developers use the Stack Overflow (SO) platform to report and discuss quantum-related challenges. Also, quantum practitioners use specialized quantum tags to label quantum-related posts in developers' forums. However, these quantum tags referred to more technical quantum aspects than the developer posts. Therefore, categorizing quantum practitioners' questions based on quantum concepts can help quantum developers better identify frequently occurring challenges to QSE. For this purpose, we conducted qualitative and quantitative studies to classify quantum developers' questions into various frequently occurring quantum-related challenges. We extracted 2829 developers' questions from various Q&A platforms using queries and filters based on quantum-related tags. Next, the developers' posts on the Q&A forums were critically analyzed to identify frequently discussed quantum-related challenges and develop a novel grounded theory. The frequent quantum developer challenges identified by analyzing practitioners' discussions in the Q&A forums include Tooling, Theoretical, Learning, Conceptual, Errors, and API Usage. Moreover, using content analysis and grounded theory, the developers' discussions were annotated with commonly reported quantum challenges to develop a ground truth and a novel dataset. A Large Language model (ChatGPT) was used to validate the human annotation and overcome disagreements. Finally, various fine-tuned Deep and Machine learning (D&ML) classifiers automatically classify developer discussions into commonly reported quantum challenges. Additionally, to improve the classification results, we utilized textual data augmentation approaches, such as random deletion, swapping, and insertion with the D&ML classifiers. We obtained average accuracies of 89%, 86%, 84%, 84%, and 80% with FNN, CNN, LSTM, GRU, and RNN classifiers, respectively. This helps quantum researchers and vendors propose solutions and tools to frequently occurring issues for quantum developers.
Original languageEnglish
Title of host publicationFSE Companion '25: Proceedings of the 33rd ACM International Conference on the Foundations of Software Engineering
EditorsJingyue Li
PublisherACM Press
Pages1742-1755
Number of pages14
ISBN (Electronic)9798400712760
DOIs
Publication statusPublished - 28 Jul 2025
EventFSE Companion '25: 33rd ACM International Conference on the Foundations of Software Engineering - Trondheim, Norway
Duration: 23 Jun 202528 Jun 2025

Publication series

NameProceedings of the ACM SIGSOFT Symposium on the Foundations of Software Engineering
ISSN (Print)1539-7521

Conference

ConferenceFSE Companion '25: 33rd ACM International Conference on the Foundations of Software Engineering
Country/TerritoryNorway
Period23/06/2528/06/25

Keywords

  • Natural language processing
  • Quantum Software Engineering
  • Repository mining
  • Stack overflow
  • developer forums
  • machine and deep learning

Fingerprint

Dive into the research topics of 'Exploring Developers Discussion Forums for Quantum Software Engineering: A Fine-Grained Classification Approach Using Large Language Model (ChatGPT)'. Together they form a unique fingerprint.

Cite this