SlideShare ist ein Scribd-Unternehmen logo
1 von 19
Downloaden Sie, um offline zu lesen
Visualization of Knowledge Distribution
across Development Teams using
2.5D Semantic Software Maps
IVAPP | February 8th
2022, Vienna
Daniel Atzberger, Tim Cech, Adrian Jobst, Willy Scheibel,
Daniel Limberger, Matthias Trapp, and Jürgen Döllner
Hasso-Plattner-Institute, Digital Engineering Faculty, University of Potsdam, Germany
08.02.2022
Introduction | Motivation
”The people working in a software organization are its greatest
assets. It is expensive to recruit and retain good people, and it is up
to software managers to ensure that the engineers working on a
project are as productive as possible. In successful companies and
economies, this productivity is achieved when people are respected
by the organization and are assigned responsibilities that reflect
their skills and experience.”
I. Sommerville, Software Engineering. 9th Ed., Harlow 2016, p. 652
2 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Introduction | Problem Statement
• Focus of existing approaches: Mining expertise of developers from different domains, e.g.
source code (e.g. Linstead et al. (2007) Mining Developer Contributions via
author-topic models)
• In general, no interactive visualization provided for understanding raw analyses
• Idea: Visualize correlation between concepts and developer expertise on a 2,5D-map
3 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Introduction | Challenges
1 | Mining developer expertise
• Formal description of developer similarity based on their coding activities
• Extracting skill levels in general concepts, e.g., „machine learning“, or „blockchain“
2 | Visualization Requirements
• Displaying similarity between developers
• Displaying attributes of developers
• Interaction techniques for analyzing knowledge distributions
4 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Introduction | Idea & Approach
1 | Mining developer expertise
• Get meaningful corpus with natural language processing (NLP) techniques
• Application of Latent Dirichlet Allocation (LDA) on the commit history of developers
• Training an Labeled LDA (LLDA) model on a corpus of GitHub projects for extracting
vocabulary of a concept
2 | Visualization Requirements
• Based on extracted topics and document-topic distributions developers are placed on a
2D reference space
• Distances display semantic relatedness
• Data related to the expertise of developers mapped onto the visual variables of 3D glyphs
5 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Road to KnowhowMap | Process Overview
6 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Road to KnowhowMap | Mining Expertise
Assumption: Developers knowledge is directly encoded in the source code.
• Similar developers use a common vocabulary (e.g. Saxena and Pedanekar (2017): [...]
Mining candidate expertise from github repositories)
• Statistical language models can be used to describe developers as high-dimensional feature
vectors (e.g. Linstead et al. (2007): Mining eclipse developer contributions via
author-topic models)
7 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Road to KnowhowMap | Preprocessing – NLP
• Crawling source code files
• Remove symbols and split up words
• Remove very common words (stop words)
• Get corpus per concept and developer
8 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Road to KnowhowMap | Preprocessing – Vocabulary
Size of the vocabulary for number of GitHub projects that are tagged with the same concept
9 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Road to KnowhowMap | Preprocessing – LDA
10 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Road to KnowhowMap | Concept Mining with LLDA
• Document has only a non-zero value in a topic, when it is marked with its associated tag
• Training on a corpus of GitHub projects, leads to concept-specific vocabulary
• Locating keywords of a concept in the commit history of developer results in a skill level
Machine Learning Cryptocurrency Database Server Data Visualization
th order db request chart
tensor crypto table server series
self binance key header axis
cuda price name http pixi
model trade value body datum
license wallet opt response point
layer exchange sql message style
11 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Road to KnowhowMap | Layout Computation
• Input
• Vocabulary V
• Corpus of source code files C
• Topics ϕ1, . . . , ϕK as distributions over
the vocabulary V
• Document-topic-distributions θ1, . . . , θm
• Dissimilarity matrix according to
Jensen-Shannon distance Λ
• Output
• Reduced Topics ϕ̄1, . . . , ϕ̄K (with
Multidimensional Scaling over Λ)
• Position of a developer is given by
ÂŻ
di =
K
P
j=1
θ
(j)
i Ď•Ě„j
Topics visualized using LDAvis
12 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Visualization Approach | Visual Mapping
Exemplary atlas of 3D glyphs for representing developers and topics
13 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Visualization Approach | Annotations
On demand further details about the skills of a developer are displayed
14 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Visualization Approach | Example result
KnowhowMap for the Bitcoin Core project (github.com/bitcoin/bitcoin) based on 2000 commits.
15 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Conclusions
Contributions
• 2.5D visualization, showing semantic relatedness between developers based on their
source code activities
• Novel method for extracting skills in high-level concepts by training an LLDA model on a
dynamically generated corpus of GitHub projects
• Allows various visual mappings for different use cases
Future Work
• Further evaluation of the proposed expertise mining technique
• User study to evaluate the effectiveness of our visualization approach
16 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
Contact
• Daniel Atzberger
• Tim Cech,
tim.cech@hpi.uni-potsdam.de
• Adrian Jobst
• Willy Scheibel
• Daniel Limberger
• Dr. Matthias Trapp
• Prof. Dr. Jürgen Döllner
Acknowledgements
This work is part of the „Software-DNA“ project, which is funded
by the European Regional Development Fund (ERDF or EFRE in
German) and the State of Brandenburg (ILB). This work is part of
the KMU project „KnowhowAnalyzer“ (Förderkennzeichen
01IS20088B), which is funded by the German Ministry for
Education and Research (Bundesministerium fĂĽr Bildung und
Forschung).
17 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
References I
[Atzberger et al., 2022] Atzberger, D., Cech, T., Jobst, A., Scheibel, W., Limberger, D., Trapp,
M., and Döllner, J. (2022). Visualization of knowledge distribution across development
teams using 2.5d semantic software maps. In Proc. 13th International Conference on
Information Visualization Theory and Applications, IVAPP ’22. INSTICC, SciTePress.
[Blei et al., 2003] Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003). Latent dirichlet allocation.
Journal of Machine Learning Research, 3:993–1022.
[Cox and Cox, 2008] Cox, M. A. and Cox, T. F. (2008). Multidimensional scaling. In
Handbook of Data Visualization, pages 315–347. Springer.
[Kuhn et al., 2008] Kuhn, A., Loretan, P., and Nierstrasz, O. (2008). Consistent layout for
thematic software maps. In Proc. 15th Working Conference on Reverse Engineering, WCRE
’08, pages 209–218. IEEE.
[Linstead et al., 2007] Linstead, E., Rigor, P., Bajracharya, S., Lopes, C., and Baldi, P. (2007).
Mining eclipse developer contributions via author-topic models. In Proc. 4th International
Workshop on Mining Software Repositories, MSR ’07, pages 30:1–4.
18 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
References II
[Ramage et al., 2009] Ramage, D., Hall, D., Nallapati, R., and Manning, C. D. (2009).
Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. In
Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing,
pages 248–256, Singapore. Association for Computational Linguistics.
[Saxena and Pedanekar, 2017] Saxena, R. and Pedanekar, N. (2017). I know what you coded
last summer: Mining candidate expertise from github repositories. In Companion of the 2017
ACM Conference on Computer Supported Cooperative Work and Social Computing, pages
299–302.
[Sievert and Shirley, 2014] Sievert, C. and Shirley, K. (2014). Ldavis: A method for visualizing
and interpreting topics. In Proc. Workshop on Interactive Language Learning, Visualization,
and Interfaces, pages 63–70. ACL.
19 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021

Weitere ähnliche Inhalte

Ă„hnlich wie Visualizing Knowledge Distribution using 2.5D Semantic Maps

Understanding and Addressing Architectural Challenges of Cloud- Based Systems
Understanding and Addressing Architectural Challenges of Cloud- Based SystemsUnderstanding and Addressing Architectural Challenges of Cloud- Based Systems
Understanding and Addressing Architectural Challenges of Cloud- Based SystemsCREST @ University of Adelaide
 
DEVNET-1125 Partner Case Study - “Project Hybrid Engineer”
DEVNET-1125	Partner Case Study - “Project Hybrid Engineer”DEVNET-1125	Partner Case Study - “Project Hybrid Engineer”
DEVNET-1125 Partner Case Study - “Project Hybrid Engineer”Cisco DevNet
 
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...Dilnoza Bobokalonova
 
Enabling Social Network Analysis in Distributed Collaborative Software Develo...
Enabling Social Network Analysis in Distributed Collaborative Software Develo...Enabling Social Network Analysis in Distributed Collaborative Software Develo...
Enabling Social Network Analysis in Distributed Collaborative Software Develo...Hans-Joerg Happel
 
Seminar VU Amsterdam 2015
Seminar VU Amsterdam 2015Seminar VU Amsterdam 2015
Seminar VU Amsterdam 2015Philipp Leitner
 
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...Dilnoza Bobokalonova
 
Neo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
Neo4j GraphTalk Oslo - Building Intelligent Solutions with GraphsNeo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
Neo4j GraphTalk Oslo - Building Intelligent Solutions with GraphsNeo4j
 
Sudipta_Mukherjee_Resume_APR_2023.pdf
Sudipta_Mukherjee_Resume_APR_2023.pdfSudipta_Mukherjee_Resume_APR_2023.pdf
Sudipta_Mukherjee_Resume_APR_2023.pdfsudipto801
 
Sodc 1 Introduction
Sodc 1 IntroductionSodc 1 Introduction
Sodc 1 IntroductionPrabhavathi P
 
3rd International Conference on Data Mining and Software Engineering (DMSE 2022)
3rd International Conference on Data Mining and Software Engineering (DMSE 2022)3rd International Conference on Data Mining and Software Engineering (DMSE 2022)
3rd International Conference on Data Mining and Software Engineering (DMSE 2022)ijdpsjournal
 
Sounak Gupta_CV
Sounak Gupta_CVSounak Gupta_CV
Sounak Gupta_CV08071997
 
What's new in the latest source{d} releases!
What's new in the latest source{d} releases!What's new in the latest source{d} releases!
What's new in the latest source{d} releases!source{d}
 
Sudipta_Mukherjee_Resume-Nov_2022.pdf
Sudipta_Mukherjee_Resume-Nov_2022.pdfSudipta_Mukherjee_Resume-Nov_2022.pdf
Sudipta_Mukherjee_Resume-Nov_2022.pdfSudipta Mukherjee
 
Government GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 StandardsGovernment GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 StandardsNeo4j
 

Ă„hnlich wie Visualizing Knowledge Distribution using 2.5D Semantic Maps (20)

Understanding and Addressing Architectural Challenges of Cloud- Based Systems
Understanding and Addressing Architectural Challenges of Cloud- Based SystemsUnderstanding and Addressing Architectural Challenges of Cloud- Based Systems
Understanding and Addressing Architectural Challenges of Cloud- Based Systems
 
Resume
ResumeResume
Resume
 
DEVNET-1125 Partner Case Study - “Project Hybrid Engineer”
DEVNET-1125	Partner Case Study - “Project Hybrid Engineer”DEVNET-1125	Partner Case Study - “Project Hybrid Engineer”
DEVNET-1125 Partner Case Study - “Project Hybrid Engineer”
 
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
 
Enabling Social Network Analysis in Distributed Collaborative Software Develo...
Enabling Social Network Analysis in Distributed Collaborative Software Develo...Enabling Social Network Analysis in Distributed Collaborative Software Develo...
Enabling Social Network Analysis in Distributed Collaborative Software Develo...
 
Kunal bhatia resume mass
Kunal bhatia   resume massKunal bhatia   resume mass
Kunal bhatia resume mass
 
Introduction to MDE
Introduction to MDEIntroduction to MDE
Introduction to MDE
 
Seminar VU Amsterdam 2015
Seminar VU Amsterdam 2015Seminar VU Amsterdam 2015
Seminar VU Amsterdam 2015
 
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
Dilnoza Bobokalonova Resume | Embedded Systems Engineering | Backend Software...
 
Neo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
Neo4j GraphTalk Oslo - Building Intelligent Solutions with GraphsNeo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
Neo4j GraphTalk Oslo - Building Intelligent Solutions with Graphs
 
Sudipta_Mukherjee_Resume_APR_2023.pdf
Sudipta_Mukherjee_Resume_APR_2023.pdfSudipta_Mukherjee_Resume_APR_2023.pdf
Sudipta_Mukherjee_Resume_APR_2023.pdf
 
Sodc 1 Introduction
Sodc 1 IntroductionSodc 1 Introduction
Sodc 1 Introduction
 
3rd International Conference on Data Mining and Software Engineering (DMSE 2022)
3rd International Conference on Data Mining and Software Engineering (DMSE 2022)3rd International Conference on Data Mining and Software Engineering (DMSE 2022)
3rd International Conference on Data Mining and Software Engineering (DMSE 2022)
 
Sounak Gupta_CV
Sounak Gupta_CVSounak Gupta_CV
Sounak Gupta_CV
 
What's new in the latest source{d} releases!
What's new in the latest source{d} releases!What's new in the latest source{d} releases!
What's new in the latest source{d} releases!
 
ID 259 Poster
ID 259 PosterID 259 Poster
ID 259 Poster
 
ID 259 Poster
ID 259 PosterID 259 Poster
ID 259 Poster
 
Sudipta_Mukherjee_Resume-Nov_2022.pdf
Sudipta_Mukherjee_Resume-Nov_2022.pdfSudipta_Mukherjee_Resume-Nov_2022.pdf
Sudipta_Mukherjee_Resume-Nov_2022.pdf
 
Cirad Concours
Cirad ConcoursCirad Concours
Cirad Concours
 
Government GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 StandardsGovernment GraphSummit: And Then There Were 15 Standards
Government GraphSummit: And Then There Were 15 Standards
 

Mehr von Matthias Trapp

Interactive Control over Temporal Consistency while Stylizing Video Streams
Interactive Control over Temporal Consistency while Stylizing Video StreamsInteractive Control over Temporal Consistency while Stylizing Video Streams
Interactive Control over Temporal Consistency while Stylizing Video StreamsMatthias Trapp
 
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...Matthias Trapp
 
A Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
A Framework for Interactive 3D Photo Stylization Techniques on Mobile DevicesA Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
A Framework for Interactive 3D Photo Stylization Techniques on Mobile DevicesMatthias Trapp
 
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...Matthias Trapp
 
A Service-based Preset Recommendation System for Image Stylization Applications
A Service-based Preset Recommendation System for Image Stylization ApplicationsA Service-based Preset Recommendation System for Image Stylization Applications
A Service-based Preset Recommendation System for Image Stylization ApplicationsMatthias Trapp
 
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...Matthias Trapp
 
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...Matthias Trapp
 
Efficient GitHub Crawling using the GraphQL API
Efficient GitHub Crawling using the GraphQL APIEfficient GitHub Crawling using the GraphQL API
Efficient GitHub Crawling using the GraphQL APIMatthias Trapp
 
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic VisualizationNon-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic VisualizationMatthias Trapp
 
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...Matthias Trapp
 
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...Matthias Trapp
 
Web-based and Mobile Provisioning of Virtual 3D Reconstructions
Web-based and Mobile Provisioning of Virtual 3D ReconstructionsWeb-based and Mobile Provisioning of Virtual 3D Reconstructions
Web-based and Mobile Provisioning of Virtual 3D ReconstructionsMatthias Trapp
 
Real-time Screen-space Geometry Draping for 3D Digital Terrain Models
Real-time Screen-space Geometry Draping for 3D Digital Terrain ModelsReal-time Screen-space Geometry Draping for 3D Digital Terrain Models
Real-time Screen-space Geometry Draping for 3D Digital Terrain ModelsMatthias Trapp
 
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & MorphingFERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & MorphingMatthias Trapp
 
Interactive Editing of Signed Distance Fields
Interactive Editing of Signed Distance FieldsInteractive Editing of Signed Distance Fields
Interactive Editing of Signed Distance FieldsMatthias Trapp
 
Integration of Image Processing Techniques into the Unity Game Engine
Integration of Image Processing Techniques into the Unity Game EngineIntegration of Image Processing Techniques into the Unity Game Engine
Integration of Image Processing Techniques into the Unity Game EngineMatthias Trapp
 
Interactive GPU-based Image Deformation for Mobile Devices
Interactive GPU-based Image Deformation for Mobile DevicesInteractive GPU-based Image Deformation for Mobile Devices
Interactive GPU-based Image Deformation for Mobile DevicesMatthias Trapp
 
Interactive Photo Editing on Smartphones via Intrinsic Decomposition
Interactive Photo Editing on Smartphones via Intrinsic DecompositionInteractive Photo Editing on Smartphones via Intrinsic Decomposition
Interactive Photo Editing on Smartphones via Intrinsic DecompositionMatthias Trapp
 
Service-based Analysis and Abstraction for Content Moderation of Digital Images
Service-based Analysis and Abstraction for Content Moderation of Digital ImagesService-based Analysis and Abstraction for Content Moderation of Digital Images
Service-based Analysis and Abstraction for Content Moderation of Digital ImagesMatthias Trapp
 
A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...
A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...
A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...Matthias Trapp
 

Mehr von Matthias Trapp (20)

Interactive Control over Temporal Consistency while Stylizing Video Streams
Interactive Control over Temporal Consistency while Stylizing Video StreamsInteractive Control over Temporal Consistency while Stylizing Video Streams
Interactive Control over Temporal Consistency while Stylizing Video Streams
 
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
A Framework for Art-directed Augmentation of Human Motion in Videos on Mobile...
 
A Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
A Framework for Interactive 3D Photo Stylization Techniques on Mobile DevicesA Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
A Framework for Interactive 3D Photo Stylization Techniques on Mobile Devices
 
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
ALIVE-Adaptive Chromaticity for Interactive Low-light Image and Video Enhance...
 
A Service-based Preset Recommendation System for Image Stylization Applications
A Service-based Preset Recommendation System for Image Stylization ApplicationsA Service-based Preset Recommendation System for Image Stylization Applications
A Service-based Preset Recommendation System for Image Stylization Applications
 
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
Design Space of Geometry-based Image Abstraction Techniques with Vectorizatio...
 
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
A Benchmark for the Use of Topic Models for Text Visualization Tasks - Online...
 
Efficient GitHub Crawling using the GraphQL API
Efficient GitHub Crawling using the GraphQL APIEfficient GitHub Crawling using the GraphQL API
Efficient GitHub Crawling using the GraphQL API
 
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic VisualizationNon-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
Non-Photorealistic Rendering of 3D Point Clouds for Cartographic Visualization
 
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
TWIN4ROAD - Erfassung Analyse und Auswertung mobiler Multi Sensorik im Strass...
 
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
Interactive Close-Up Rendering for Detail+Overview Visualization of 3D Digita...
 
Web-based and Mobile Provisioning of Virtual 3D Reconstructions
Web-based and Mobile Provisioning of Virtual 3D ReconstructionsWeb-based and Mobile Provisioning of Virtual 3D Reconstructions
Web-based and Mobile Provisioning of Virtual 3D Reconstructions
 
Real-time Screen-space Geometry Draping for 3D Digital Terrain Models
Real-time Screen-space Geometry Draping for 3D Digital Terrain ModelsReal-time Screen-space Geometry Draping for 3D Digital Terrain Models
Real-time Screen-space Geometry Draping for 3D Digital Terrain Models
 
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & MorphingFERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
FERMIUM - A Framework for Real-time Procedural Point Cloud Animation & Morphing
 
Interactive Editing of Signed Distance Fields
Interactive Editing of Signed Distance FieldsInteractive Editing of Signed Distance Fields
Interactive Editing of Signed Distance Fields
 
Integration of Image Processing Techniques into the Unity Game Engine
Integration of Image Processing Techniques into the Unity Game EngineIntegration of Image Processing Techniques into the Unity Game Engine
Integration of Image Processing Techniques into the Unity Game Engine
 
Interactive GPU-based Image Deformation for Mobile Devices
Interactive GPU-based Image Deformation for Mobile DevicesInteractive GPU-based Image Deformation for Mobile Devices
Interactive GPU-based Image Deformation for Mobile Devices
 
Interactive Photo Editing on Smartphones via Intrinsic Decomposition
Interactive Photo Editing on Smartphones via Intrinsic DecompositionInteractive Photo Editing on Smartphones via Intrinsic Decomposition
Interactive Photo Editing on Smartphones via Intrinsic Decomposition
 
Service-based Analysis and Abstraction for Content Moderation of Digital Images
Service-based Analysis and Abstraction for Content Moderation of Digital ImagesService-based Analysis and Abstraction for Content Moderation of Digital Images
Service-based Analysis and Abstraction for Content Moderation of Digital Images
 
A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...
A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...
A Non-Photorealistic Rendering Techniquefor Art-directed Hatching of 3D Point...
 

KĂĽrzlich hochgeladen

Lucknow đź’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow đź’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow đź’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow đź’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSĂ©rgio Sacani
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfSumit Kumar yadav
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 

KĂĽrzlich hochgeladen (20)

Lucknow đź’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow đź’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow đź’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow đź’‹ Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCESTERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 

Visualizing Knowledge Distribution using 2.5D Semantic Maps

  • 1. Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps IVAPP | February 8th 2022, Vienna Daniel Atzberger, Tim Cech, Adrian Jobst, Willy Scheibel, Daniel Limberger, Matthias Trapp, and JĂĽrgen Döllner Hasso-Plattner-Institute, Digital Engineering Faculty, University of Potsdam, Germany 08.02.2022
  • 2. Introduction | Motivation ”The people working in a software organization are its greatest assets. It is expensive to recruit and retain good people, and it is up to software managers to ensure that the engineers working on a project are as productive as possible. In successful companies and economies, this productivity is achieved when people are respected by the organization and are assigned responsibilities that reflect their skills and experience.” I. Sommerville, Software Engineering. 9th Ed., Harlow 2016, p. 652 2 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 3. Introduction | Problem Statement • Focus of existing approaches: Mining expertise of developers from different domains, e.g. source code (e.g. Linstead et al. (2007) Mining Developer Contributions via author-topic models) • In general, no interactive visualization provided for understanding raw analyses • Idea: Visualize correlation between concepts and developer expertise on a 2,5D-map 3 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 4. Introduction | Challenges 1 | Mining developer expertise • Formal description of developer similarity based on their coding activities • Extracting skill levels in general concepts, e.g., „machine learning“, or „blockchain“ 2 | Visualization Requirements • Displaying similarity between developers • Displaying attributes of developers • Interaction techniques for analyzing knowledge distributions 4 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 5. Introduction | Idea & Approach 1 | Mining developer expertise • Get meaningful corpus with natural language processing (NLP) techniques • Application of Latent Dirichlet Allocation (LDA) on the commit history of developers • Training an Labeled LDA (LLDA) model on a corpus of GitHub projects for extracting vocabulary of a concept 2 | Visualization Requirements • Based on extracted topics and document-topic distributions developers are placed on a 2D reference space • Distances display semantic relatedness • Data related to the expertise of developers mapped onto the visual variables of 3D glyphs 5 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 6. Road to KnowhowMap | Process Overview 6 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 7. Road to KnowhowMap | Mining Expertise Assumption: Developers knowledge is directly encoded in the source code. • Similar developers use a common vocabulary (e.g. Saxena and Pedanekar (2017): [...] Mining candidate expertise from github repositories) • Statistical language models can be used to describe developers as high-dimensional feature vectors (e.g. Linstead et al. (2007): Mining eclipse developer contributions via author-topic models) 7 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 8. Road to KnowhowMap | Preprocessing – NLP • Crawling source code files • Remove symbols and split up words • Remove very common words (stop words) • Get corpus per concept and developer 8 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 9. Road to KnowhowMap | Preprocessing – Vocabulary Size of the vocabulary for number of GitHub projects that are tagged with the same concept 9 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 10. Road to KnowhowMap | Preprocessing – LDA 10 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 11. Road to KnowhowMap | Concept Mining with LLDA • Document has only a non-zero value in a topic, when it is marked with its associated tag • Training on a corpus of GitHub projects, leads to concept-specific vocabulary • Locating keywords of a concept in the commit history of developer results in a skill level Machine Learning Cryptocurrency Database Server Data Visualization th order db request chart tensor crypto table server series self binance key header axis cuda price name http pixi model trade value body datum license wallet opt response point layer exchange sql message style 11 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 12. Road to KnowhowMap | Layout Computation • Input • Vocabulary V • Corpus of source code files C • Topics Ď•1, . . . , Ď•K as distributions over the vocabulary V • Document-topic-distributions θ1, . . . , θm • Dissimilarity matrix according to Jensen-Shannon distance Λ • Output • Reduced Topics Ď•Ě„1, . . . , Ď•Ě„K (with Multidimensional Scaling over Λ) • Position of a developer is given by ÂŻ di = K P j=1 θ (j) i Ď•Ě„j Topics visualized using LDAvis 12 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 13. Visualization Approach | Visual Mapping Exemplary atlas of 3D glyphs for representing developers and topics 13 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 14. Visualization Approach | Annotations On demand further details about the skills of a developer are displayed 14 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 15. Visualization Approach | Example result KnowhowMap for the Bitcoin Core project (github.com/bitcoin/bitcoin) based on 2000 commits. 15 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 16. Conclusions Contributions • 2.5D visualization, showing semantic relatedness between developers based on their source code activities • Novel method for extracting skills in high-level concepts by training an LLDA model on a dynamically generated corpus of GitHub projects • Allows various visual mappings for different use cases Future Work • Further evaluation of the proposed expertise mining technique • User study to evaluate the effectiveness of our visualization approach 16 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 17. Contact • Daniel Atzberger • Tim Cech, tim.cech@hpi.uni-potsdam.de • Adrian Jobst • Willy Scheibel • Daniel Limberger • Dr. Matthias Trapp • Prof. Dr. JĂĽrgen Döllner Acknowledgements This work is part of the „Software-DNA“ project, which is funded by the European Regional Development Fund (ERDF or EFRE in German) and the State of Brandenburg (ILB). This work is part of the KMU project „KnowhowAnalyzer“ (Förderkennzeichen 01IS20088B), which is funded by the German Ministry for Education and Research (Bundesministerium fĂĽr Bildung und Forschung). 17 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 18. References I [Atzberger et al., 2022] Atzberger, D., Cech, T., Jobst, A., Scheibel, W., Limberger, D., Trapp, M., and Döllner, J. (2022). Visualization of knowledge distribution across development teams using 2.5d semantic software maps. In Proc. 13th International Conference on Information Visualization Theory and Applications, IVAPP ’22. INSTICC, SciTePress. [Blei et al., 2003] Blei, D. M., Ng, A. Y., and Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3:993–1022. [Cox and Cox, 2008] Cox, M. A. and Cox, T. F. (2008). Multidimensional scaling. In Handbook of Data Visualization, pages 315–347. Springer. [Kuhn et al., 2008] Kuhn, A., Loretan, P., and Nierstrasz, O. (2008). Consistent layout for thematic software maps. In Proc. 15th Working Conference on Reverse Engineering, WCRE ’08, pages 209–218. IEEE. [Linstead et al., 2007] Linstead, E., Rigor, P., Bajracharya, S., Lopes, C., and Baldi, P. (2007). Mining eclipse developer contributions via author-topic models. In Proc. 4th International Workshop on Mining Software Repositories, MSR ’07, pages 30:1–4. 18 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021
  • 19. References II [Ramage et al., 2009] Ramage, D., Hall, D., Nallapati, R., and Manning, C. D. (2009). Labeled LDA: A supervised topic model for credit attribution in multi-labeled corpora. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing, pages 248–256, Singapore. Association for Computational Linguistics. [Saxena and Pedanekar, 2017] Saxena, R. and Pedanekar, N. (2017). I know what you coded last summer: Mining candidate expertise from github repositories. In Companion of the 2017 ACM Conference on Computer Supported Cooperative Work and Social Computing, pages 299–302. [Sievert and Shirley, 2014] Sievert, C. and Shirley, K. (2014). Ldavis: A method for visualizing and interpreting topics. In Proc. Workshop on Interactive Language Learning, Visualization, and Interfaces, pages 63–70. ACL. 19 Visualization of Knowledge Distribution across Development Teams using 2.5D Semantic Software Maps Tim Cech 08.02.2021