Data Mining from software project data in industries

We have applied various data mining techniques to data collected from industries in Japan. The aim of this study is to investigate significant factors to either of the quality of software product, cost for the development, and duration of the development.

So far, we used association rules mining, Bayesian belief network, decision trees, Bayesian classifier, factor analysis, and so on.

Related paper

  • J. Debari, O. Mizuno, T. Kikuno, N. Kikuchi, M. Hirayama, "Mining Project Improvement Hints from Cross-Company Data Using Association Rules," Trans. of Information Processing Society of Japan, 49(8), pp. 2791-2801, August 2008.
  • Y. Hamano, S. Amasaki, O. Mizuno, T. Kikuno, "Application of Association Rules Mining to Analysis of Risk Factors in Software Development Projects," JSSST Computer Software, 24(2), pp. 79-87, February 2007.
  • Y. Nakano, O. Mizuno, T. Kikuno, Y. Anan, M. Tanaka, "Analysis on Impact of Defect Density and Efficiency of Coding Review to Software Quality," SEC journal, 2(4), pp. 10-17, November 2006.
  • N. Kikuchi, T. Andou, O. Mizuno, T. Kikuno, "Key Factors in Process Management for Improving the Field Quality of Telecommunication Software Development," SEC journal, 2(1), pp. 26-35, January 2006.
  • S. Amasaki, Y. Takagi, O. Mizuno, and T. Kikuno, "Constructing a Bayesian Belief Network to Predict Final Quality in Embedded System Development," IEICE Trans. on Information and Systems, E88-D(6), pp. 1134-1141, June 2005. (JCR: 0.242 (2005))
  • S. Amasaki, T. Yoshitomi, O. Mizuno, Y. Takagi, and T. Kikuno, "A New Challenge for Applying Time Series Metrics Data to Software Quality Estimation," Software Quality Journal, 13(2), pp. 177-193, June 2005. (JCR: 0.529 (2005))
  • J. Debari, O. Mizuno, T. Kikuno, N. Kikuchi, and M. Hirayama, "On Deriving Actions for Improving Cost Overrun by Applying Association Rule Mining to Industrial Project Repository," In Proc. of International Conference on Software Process 2008 (ICSP2008), LNCS 5006, pp. 51-62, May 2008. (Leipzig, Germany) (Acceptance rate: 30%)
  • S. Amasaki, Y. Hamano, O. Mizuno, and T. Kikuno, "Characterization of Runaway Software Projects Using Association Rule Mining," In Proc. of 7th International Conference on Product Focused Software Process Improvement (PROFES2006), LNCS 4034, pp. 402-407, June 2006. (Amsterdam, The Netherlands) (Acceptance rate: 47.2%, 26/55)
  • S. Amasaki, Y. Takagi, O. Mizuno, and T. Kikuno, "A Bayesian Belief Network for Assessing the Likelihood of Fault Content," In Proc. of 14th International Symposium on Software Reliability Engineering (ISSRE2003), pp. 215-226, November 2003. (Denver, CO, USA) (Acceptance rate: 20%, 41/200)
  • O. Mizuno, E. Shigematsu, Y. Takagi, and T. Kikuno, "On Estimating Testing Effort Needed to Assure Field Quality in Software Development," In Proc. of 13th International Symposium on Software Reliability Engineering (ISSRE2002), pp. 139-146, November 2002. (Annapolis, MD, USA.) (Acceptance rate: 45%, 33/73)
  • N. Kikuchi, O. Mizuno, and T. Kikuno, "Identifying Key Attributes of Projects That Affect the Field Quality of Communication Software," In Proc. of 24th Annual International Computer Software and Applications Conference (COMPSAC2000), pp. 176-178, October 2000. (Taipei, Taiwan.)
  • O. Mizuno, T. Kikuno, K. Inagaki, Y. Takagi, and K. Sakamoto, "Analyzing Effects of Cost Estimation Accuracy on Quality and Productivity," In Proc. of 20th International Conference on Software Engineering (ICSE98), pp. 410-419, April 1998. (Kyoto, Japan.) (Acceptance rate: 19%, 41/209)
  • K. Inagaki, Y. Takagi, K. Sakamoto, and O. Mizuno, "Analyzing the Cost Estimation Accuracy in Software Project Respect to Productivity and Quality," In Proc. of International Symposium on Future Software Technology 97 (ISFST97), pp. 372-377, October 1997. (Xiamen, China.)
  • E. Choi and O. Mizuno, "Towards Quality Improvement and Analysis of Combinatorial Testing," In IPSJ/SIGSE Winter Workshop 2017 in Hida-Takayama (WWS2017), pp. 13-14, January 2017.
  • J. Debari, K. Ogata, T. Kikuno, O. Mizuno, N. Kikuchi, M. Hirayama, "A Reserch of the Cause of the Faults by Applying Association Rules to the Software Development Data," 情報処理学会研究報告 ソフトウェア工学(SE), 2010-SE-167(3), pp. 1-8, March 2010. (東京都)
  • J. Debari, K. Ogata, T. Kikuno, O. Mizuno, N. Kikuchi, M. Hirayama, "Extracting Relationships between Risk Factors of Software Projects with Association Rule Mining," 情報処理学会創立50周年記念全国大会(第72回全国大会), 5B-1, March 2010. (東京大学)
  • J. Debari, T. Kikuno, O. Mizuno, N. Kikuchi, M. Hirayama, "Extracting Risks of Software Projects by Clustering Association Rules," ウィンターワークショップ2010・イン・倉敷 論文集, pp. 115-116, January 2010. (倉敷)
  • T. Iida, O. Mizuno, T. Kikuno, S. Yoshioka, Y. Anan, M. Tanaka, "An Analysis of Causes of Faults After Release by Rule Mining on Software Metrics," Technical Report of IEICE, 108(384, KBSE2008-50), pp. 79-84, January 2009. (東京)
  • J. Debari, O. Mizuno, T. Kikuno, N. Kikuchi, M. Hirayama, "Analysis of Fault Density by Association Rule Mining Using Cross-Company Data," Technical Report of IEICE, 107(275, SS2007-36), pp. 35-40, October 2007. (宮城大学)
  • Y. Sasaki, S. Abe, O. Mizuno, T. Kikuno, S. Yoshioka, Y. Anan, M. Tanaka, "Selecting Metrics for Effective Software Quality Management Using Over-Sampling Method," Technical Report of IEICE, 107(4, SS2007-8), pp. 41-46, April 2007. (会津大学)
  • Y. Hamano, O. Mizuno, T. Kikuno, N. Kikuchi, M. Hirayama, "Software Productivity Analysis Using Association Rules Mining," IPSJ SIGSE Technical Report, 2007(33, 2007-SE-155), pp. 65-72, March 2007. (東京)
  • K. Kanemura, O. Mizuno, T. Kikuno, Y. Takagi, K. Sakamoto, "Studies on the Effects of Review Efficiency on Field Quality of Software Product," 電子情報通信学会技術研究報告, 99(682-683, SS99-69), pp. 1-7, March 2000.
  • K. Inagaki, Y. Takagi, K. Sakamoto, O. Mizuno, T. Kikuno, "Effects of Cost Estimation Accuracy on Quality and Productivity," 電子情報通信学会技術研究報告, 97(260-261, SS97-27), pp. 15-22, September 1997.