Date post: | 19-Dec-2015 |
Category: |
Documents |
View: | 222 times |
Download: | 0 times |
ThoughtsI want to combine Data Mining tools + Visualization
tools
I am motivated in using information in various forms to make informed decisions
I believe inherit software structure (compliable source code) has an advantage over free-form text from a data mining perspective
I wish to “mine” data from source code and “build” visual models of code representation that are useful from a software engineer’s perspective
Exploring “Code City” for Visual Representation
Classes are represented as buildings in the city. Packages are depicted as the districts in which the buildings reside.
CodeCity is programmed in VisualWorks Smalltalk on top of the Moose platform, uses OpenGL for rendering
Conceptual Model
SourceCode
DataMining
“Engine”
“Mining”Algorithms
DataOutput
Visualization“Engine”
VisualResults
Thesis Approach – Part iTheoretical Discussion
◦Data mining and visualization investigation
◦80’s and 90’s focus on program comprehension What worked What were dead-ends
(as important as what worked IMHO)
◦Literature review on program comprehension Gestalt principles were explored in previous class
◦Results of past empirical studies
Thesis Approach – Part 1Motivating Scenario
◦Problem that is not too big, but not too small ◦“Bob the programmer was given the assignment to
add enhancement X to legacy system Y.”
◦Bob has ability to mine data from source code and visualize results
◦Question: What information is MOST relevant for Bob to succeed? (bound problem)
Thesis Approach – Part 2Implementation
◦Moose tools for software analysis◦Code City for software visualization◦Source Code Analysis:
Public domain: Analyzing JHotDraw
Private domain: Analyzing 20+ year old legacy system at present
employer
Thesis Approach – Part 3• Empirical Study – Compare resultant artifacts
JHotDrawSource Code
DataMining
“Engine”+
Visualization“Engine”
JHotDrawArtifacts
Legacy SystemSource Code
Legacy SystemArtifacts
Compareto existing
JHotDraw artifacts
Compareto existing
Legacy System “expertise”
Thesis Approach – Part 4• Results and Conclusions…
“Rule of Thumb” Mathematical Model
“I am very curious how close to a workable mathematical model I can create based on the findings of my empirical study”