+ All Categories
Home > Documents > Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline...

Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline...

Date post: 19-Aug-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
14
Chair of Software Engineering for Business Information Systems (sebis) Faculty of Informatics Technische Universität München wwwmatthes.in.tum.de Master thesis: Automatic Extraction of Design Decision Relationships from a Task Management System Matthias Ruppel, 8 th of November 2017, Munich
Transcript
Page 1: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Chair of Software Engineering for Business Information Systems (sebis) Faculty of InformaticsTechnische Universität Münchenwwwmatthes.in.tum.de

Master thesis: Automatic Extraction of Design Decision Relationships from a Task Management SystemMatthias Ruppel, 8th of November 2017, Munich

Page 2: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

I. Introduction and MotivationII. Concepts

• Architectural Design Decision• Quality Attributes

III. OSS and NFR DatasetIV. Classification by KeywordsV. MethodologyVI. ResultsVII.Conclusion and Outlook

Outline

© sebis 2Matthias Ruppel, 8th of November 2017, Master thesis final presentation

Page 3: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Introduction | Motivation

• Many architectural design decisions are made during development & maintenance• Documenting takes a lot of effort, time & costs• Architectural design decisions are hard to capture• Current design decisions may interfere with previous design decisions

• Implicitly taken, not explicitly captured & documented• Rational / Cause / Concern is not evident in the documentation

© sebisMatthias Ruppel, 8th of November 2017, Master thesis final presentation 3

Page 4: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Architectural Design Decision

© sebis 4

Source: Jansen, A. G. J. (2008). Architectural design decisions s.n.Source: Zimmermann et. al. (2009). Managing architectural decision models with dependency relations, integrity constraints, and production rules

DefinitionA description of the choice and considered alternatives that (partially) realize one or more requirements. Alternatives consist of a set of architectural additions, subtractions and modifications to the software architecture, the rationale, and the design rules, design constraints and additional requirements.

Matthias Ruppel, 10th of November 2017, Master thesis final presentation

Page 5: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Non-Functional Requirements: ISO 9126-1 – Types of Quality

© sebis 5

ISO 9126-1 classifies software quality in a structured set of characteristics and sub-characteristics. Each quality sub-characteristic is further divided into attributes.

Source: SO/IEC 9126-1:2001 Software engineering - Product quality - Part 1: Quality model.

Matthias Ruppel, 10th of November 2017, Master thesis final presentation

Page 6: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

OSS and NFR Dataset

© sebis 6Matthias Ruppel, 10th of November 2017, Master thesis final presentation

0 50 100 150 200 250 300

Functinal (F)Availability (A)

Fault Tolerance (FT)Legal (L)

Look & Feel (LF)Maintainability (MN)

Operational (O)Performance (PE)

Portability (PO)Scalability (SC)

Security (SE)Usability (US)

DataExtraction

DataCuration

ManualLabeling

OSS Dataset

NFR Dataset- Dataset of requirements of a software project,

provided by PROMISE- Used by other scholars within text

classification publications- 40% FR and 60% NFR- Potential issue: underrepresentation of

certain Quality Attributes

OSS Dataset- Apache Spark and Apache Hadoop OSS- Public available Jira Issues- Complex and extensive open source

frameworks provided for Scala, Java & Python

- Limited documentation

Page 7: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Classification by Keywords

© sebis 7

Quality Attribute Keywords

Security Confidentiality, integrity, completeness, accuracy, perturbation, virus, access, authorization, rule, validation, audit, biometrics, card, key, password, alarm, encryption, noise

Performance Space, time, memory, storage, response,throughput, peak, mean, index, compress, uncompress, runtime, perform, execute, dynamic, offset, reduce, fixing, early, late

Source: Cleland-Huang et. Al. (2007). Automated classification of non-functional requirements

Matthias Ruppel, 10th of November 2017, Master thesis final presentation

Results• Source were puplication, which extracted

keyword to predict quality attributes• Dependent on Context• Only for a few NFRs• Poor performance on OSS and NFR dataset:

• Very low precision rate i.e. 1% (Usability with keywords from Slankas et al., (2013)

• Recall rate on NFR dataset is very high, with mostly 92% - 100%

Design Decision

KeywordMatching

QualityAttribute

Keyword Classification

Page 8: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Methodology: Text Classification

© sebis 8Matthias Ruppel, 10th of November 2017, Master thesis final presentation

Design Decision

Quality AttributeMachine

Learning

Algorithm

FeatureExtraction

remov authent test …

Features

Trai

ning

Documents to Categorize

Quality Attribute

Classifier

Model

FeatureExtraction

cooki token on …

Features

Pred

ictio

n

Source: Adapted from Witten (2016). Practical Machine Learning Tools and Techniques

Page 9: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Methodology: Feature Extraction and Selection

© sebis 9Matthias Ruppel, 10th of November 2017, Master thesis final presentation

Quality AttributeMachine

Learning

Algorithm

FeatureExtraction

remov authent test …

Trai

ning

Source: Own Illustration

Removing Digits&

Punctuation MarksStemming

Removing

Stop WordsTokenizing Text

Feature Extraction

Feature Selection

with

InformationGain

Page 10: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Methodology: Tokenization and Machine Learning Algorithms

© sebis 10Matthias Ruppel, 10th of November 2017, Master thesis final presentation

Tokenizing TextBag of Words

N-gram

Machine Learning Algorithm

SVM C4.5 Multinomial Naïve Bayes

OSSDataset

NFRDataset

Page 11: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Methodology: Features

© sebis 11Matthias Ruppel, 10th of November 2017, Master thesis final presentation

Features

Bag of words

remov,authent,test,add,support,upgrad,configur,unsaferow,perform,column,renam,auth,token,credenti,spee,cooki, password

N-gramremov,add,authent,test,support,upgrad,perform,improv,unsaferow in, column, renam, unsaferow, support unsaferow, in,auth, support unsaferow,token,remove it, credenti, speed, improve perform,is based on, cooki, authentication mechan,the authent,http authent,password

Bag of words

second,onli,us,no,access,with,interfac,avail,user,than,oper,minut,time,year,compli,author,easi,more,under,90%,hour,player,allow,server,support,after,standard,respons,let,includ,updat,0,can,class,per,train,longer,regul,mainten,ensur,environ,successfulli,simultan,expect

N-gram

us,second,onli,user,the,shall,no,product shall b,access,with,interfac,interface with, avail, be avail, to,oper,minut,time,than,of,year,after,comply with, compli,author,updat,interface with th,easi,inund,under,allow,90%,90% of,shall interfac,hour,shall allow,shall be avail,be easi,to us,server,shall interface with,standard,and,the product must,product must,shall be easi,player,users shal,system shall let,5 second,response tim,shall let,let,be available for,available for, includ, by, 0, per, respons, train, longer, regul,class,displai,mainten,ensur,environ,for us,shall ensur,only author,shall ensure that,ensure that,easyto,be easy to,longer than,successfully,available for us,seconds th,using th,simultan,no mor,no more than,expect,expected to,have access to,have access,to successfulli,in under 5,be no mor,under 5,be no

OSSDataset

NFRDataset

Page 12: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Results: Performance Evaluation

© sebis 12Matthias Ruppel, 10th of November 2017, Master thesis final presentation

0

0,2

0,4

0,6

0,8

1

F A L LF MN O PE SC SE US FT PO

F-M

easu

re

NFR Dataset • Bag of Words

J4.8 NaiveBayesMult SVM

0

0,2

0,4

0,6

0,8

1

PO F PE MN FT O US SE A

F-M

easu

re

OSS Dataset • Bag of Words

J4.8 NaiveBayesMult SVM

0

0,2

0,4

0,6

0,8

1

F A L LF MN O PE SC SE US FT PO

F-M

easu

re

NFR Dataset • N-gram

J4.8 NaiveBayesMult SVM

0

0,2

0,4

0,6

0,8

1

PO F PE MN FT O US SE A

F-M

easu

re

OSS Dataset • N-gram

J4.8 NaiveBayesMult SVM

Page 13: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Conclusion and Outlook

© sebis 13

- Quality Attributes (QAs) are often considered as the most important decision drivers and have a positive influence on the satisfaction of stakeholders

- During the elicitation process, requirements are kept in various documents and different formats, and usually they are not properly categorized.Ø Information should kept on a central place

- A framework should be used to capture design decisionsØ How could be this included into the development process of a real project?

Matthias Ruppel, 10th of November 2017, Master thesis final presentation

Page 14: Master thesis:Automatic Extraction of Design Decision … · VII.Conclusion and Outlook Outline Matthias Ruppel, 8 th of November 2017, ... Automated classification of non-functional

Technische Universität MünchenFaculty of InformaticsChair of Software Engineering for Business Information Systems

Boltzmannstraße 385748 Garching bei München

Tel +49.89.289.Fax +49.89.289.17136

wwwmatthes.in.tum.de

Matthias [email protected]


Recommended