+ All Categories
Home > Documents > Scholar Trajectory: Visualizing the Migration of Talents · 2021. 2. 8. · Scholar Trajectory:...

Scholar Trajectory: Visualizing the Migration of Talents · 2021. 2. 8. · Scholar Trajectory:...

Date post: 11-Sep-2021
Category:
Upload: others
View: 15 times
Download: 0 times
Share this document with a friend
2
Scholar Trajectory: Visualizing the Migration of Talents Zhou Shao Nanjing University of Science and Technology [email protected] Jie Tang Tsinghua University [email protected] Yutao Zhang Tsinghua University [email protected] Bo Gao Tsinghua University [email protected] Yongli Wang Nanjing University of Science and Technology [email protected] ABSTRACT In this paper, we present Scholar Trajectory to visualize and analyze the migration of research scholars. We extract the temporal and spatial footprints of scholars from the affiliation strings in their pa- pers. The visual analyses are conducted at both individual level and group level. At individual level, we illustrate the location statistic and the academic achievements of each individual scholar. At group level, we explore the collective migration pattern of a research com- munity. Empirical case studies verifies the informativeness and intuitiveness of the system. ACM Reference Format: Zhou Shao, Jie Tang, Yutao Zhang, Bo Gao, and Yongli Wang. 2018. Scholar Trajectory: Visualizing the Migration of Talents. In Proceedings of 24th ACM SIGKDD Conference (KDD 2018). ACM, New York, NY, USA, 2 pages. https://doi.org/10.1145/nnnnnnn.nnnnnnn 1 INTRODUCTION The temporal and spatial trajectories exhibit the career mobility of a person, which is valuable for talent management, recruitment, and trend analysis. In this paper, we present Scholar Trajectory 1 for mining and visualizing migration trajectories of research scholars in different fields. Scholar Trajectory is a data-driven temporal-spatial mining sys- tem based on AMiner Academic Graph 2 . We generate migration trajectory for researchers based on the affiliations mentioned on their publications. There are three main challenges to build such a system: 1) Author disambiguation [2, 4], i.e. constructing scholar profiles from publication databases by distinguishing authors with same names. 2) Geo-location extraction [1], i.e. mapping the affili- ation string mentioned on publications to a specific geographical coordinate. 3) Trajectory visualization, i.e. designing informative and intuitive visualizations to illustrate the temporal-spatial pat- terns of individual scholars and research communities. In this work, Corresponding author (email: [email protected]) 1 https://traj.aminer.cn/trajectory-index 2 https://www.aminer.cn/ Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the owner/author(s). KDD 2018, August 19-23 2018, London, United Kingdom © 2018 Copyright held by the owner/author(s). ACM ISBN 978-x-xxxx-xxxx-x/YY/MM. https://doi.org/10.1145/nnnnnnn.nnnnnnn we leverage the disambiguated scholar profiles from AMiner [3] and focus on solving the latter two problems. Figure 1 gives an overview of the main user interface, where the yellow curves indicate the migration trajectories. Figure 1: An overview of Scholar Trajectory System 2 SYSTEM OVERVIEW There are two main components in the system, namely Individual Trajectory and Group Trajectory. Individual Trajectory focuses on the temporal-spatial footprint of an individual scholar, while Group Trajectory analyses the collective migration pattern of mul- tiple scholars. We generate the migration trajectories of scholars based on AMiner Academic Graph, which consists of more than 130 million researchers and 270 million publications. Specifically, we extract the name of the institution mentioned in the affiliation string, and leverage Google Map API to map the institution name to its corresponding geographic location. This is non-trivial due to the inherent ambiguity and variability of natural language. To this end, we formulate the problem as a sequential labeling task. An affiliation string is broken into a quadruplet scholarId, institution, year, geoLocation using CRF model, where scholarId is the unique identifier of a researcher, year is the publication year, institution is the extracted institution name from the affiliation string, and geoLocation is the corresponding geographic coordinates. 2.1 Visual Design To intuitively illustrate the temporal and spatial patterns, we design three essential visual elements: Point, Line, and Heat. Figure 2 gives an example of the three visual elements. Point denotes a geograph- ical location of scholars, where the color of a Point indicates the number of the scholars. Line is a directed curve indicating the mi- gration route of scholars from one location to another in a certain year. Heat shows the degree of influence of the scholars within certain area. Figure 2 gives an example of the three visual elements.
Transcript
Page 1: Scholar Trajectory: Visualizing the Migration of Talents · 2021. 2. 8. · Scholar Trajectory: Visualizing the Migration of Talents Zhou Shao Nanjing University of Science and Technology

Scholar Trajectory: Visualizing the Migration of TalentsZhou Shao

Nanjing University of Science andTechnology

[email protected]

Jie Tang∗Tsinghua University

[email protected]

Yutao ZhangTsinghua University

[email protected]

Bo GaoTsinghua [email protected]

Yongli WangNanjing University of Science and

[email protected]

ABSTRACTIn this paper, we present Scholar Trajectory to visualize and analyzethe migration of research scholars. We extract the temporal andspatial footprints of scholars from the affiliation strings in their pa-pers. The visual analyses are conducted at both individual level andgroup level. At individual level, we illustrate the location statisticand the academic achievements of each individual scholar. At grouplevel, we explore the collective migration pattern of a research com-munity. Empirical case studies verifies the informativeness andintuitiveness of the system.ACM Reference Format:Zhou Shao, Jie Tang, Yutao Zhang, Bo Gao, and Yongli Wang. 2018. ScholarTrajectory: Visualizing the Migration of Talents. In Proceedings of 24thACM SIGKDD Conference (KDD 2018). ACM, New York, NY, USA, 2 pages.https://doi.org/10.1145/nnnnnnn.nnnnnnn

1 INTRODUCTIONThe temporal and spatial trajectories exhibit the career mobility ofa person, which is valuable for talent management, recruitment,and trend analysis. In this paper, we present Scholar Trajectory1 formining and visualizing migration trajectories of research scholarsin different fields.

Scholar Trajectory is a data-driven temporal-spatial mining sys-tem based on AMiner Academic Graph2. We generate migrationtrajectory for researchers based on the affiliations mentioned ontheir publications. There are three main challenges to build such asystem: 1) Author disambiguation [2, 4], i.e. constructing scholarprofiles from publication databases by distinguishing authors withsame names. 2) Geo-location extraction [1], i.e. mapping the affili-ation string mentioned on publications to a specific geographicalcoordinate. 3) Trajectory visualization, i.e. designing informativeand intuitive visualizations to illustrate the temporal-spatial pat-terns of individual scholars and research communities. In this work,∗Corresponding author (email: [email protected])1https://traj.aminer.cn/trajectory-index2https://www.aminer.cn/

Permission to make digital or hard copies of part or all of this work for personal orclassroom use is granted without fee provided that copies are not made or distributedfor profit or commercial advantage and that copies bear this notice and the full citationon the first page. Copyrights for third-party components of this work must be honored.For all other uses, contact the owner/author(s).KDD 2018, August 19-23 2018, London, United Kingdom© 2018 Copyright held by the owner/author(s).ACM ISBN 978-x-xxxx-xxxx-x/YY/MM.https://doi.org/10.1145/nnnnnnn.nnnnnnn

we leverage the disambiguated scholar profiles fromAMiner [3] andfocus on solving the latter two problems. Figure 1 gives an overviewof the main user interface, where the yellow curves indicate themigration trajectories.

Figure 1: An overview of Scholar Trajectory System

2 SYSTEM OVERVIEWThere are two main components in the system, namely IndividualTrajectory and Group Trajectory. Individual Trajectory focuseson the temporal-spatial footprint of an individual scholar, whileGroup Trajectory analyses the collective migration pattern of mul-tiple scholars. We generate the migration trajectories of scholarsbased on AMiner Academic Graph, which consists of more than130 million researchers and 270 million publications. Specifically,we extract the name of the institution mentioned in the affiliationstring, and leverage Google Map API to map the institution nameto its corresponding geographic location. This is non-trivial due tothe inherent ambiguity and variability of natural language. To thisend, we formulate the problem as a sequential labeling task. Anaffiliation string is broken into a quadruplet ⟨ scholarId, institution,year, geoLocation ⟩ using CRF model, where scholarId is the uniqueidentifier of a researcher, year is the publication year, institutionis the extracted institution name from the affiliation string, andgeoLocation is the corresponding geographic coordinates.

2.1 Visual DesignTo intuitively illustrate the temporal and spatial patterns, we designthree essential visual elements: Point, Line, and Heat. Figure 2 givesan example of the three visual elements. Point denotes a geograph-ical location of scholars, where the color of a Point indicates thenumber of the scholars. Line is a directed curve indicating the mi-gration route of scholars from one location to another in a certainyear. Heat shows the degree of influence of the scholars withincertain area. Figure 2 gives an example of the three visual elements.

Page 2: Scholar Trajectory: Visualizing the Migration of Talents · 2021. 2. 8. · Scholar Trajectory: Visualizing the Migration of Talents Zhou Shao Nanjing University of Science and Technology

KDD 2018, August 19-23 2018, London, United Kingdom Zhou Shao, Jie Tang, Yutao Zhang, Bo Gao, and Yongli Wang

Figure 2: Three essential visual elements in Scholar Trajec-tory: Point, Line, and Heat

2.2 Individual TrajectoryIndividual Trajectory shows the migration pattern of an individualscholar. Besides visualizing the temporal-spatial trajectory of thescholar, we provide detailed analysis of their academic achieve-ments. Figure 3 illustrates an example trajectory analysis of anindividual scholar. We plot the migration history of a scholar ona timeline where different locations are color-coded (Figure 3a).Figure 3b shows the academic achievement of the scholar in differ-ent years. Figure 3c provides the statistic of locations throughoutthe career of the scholar, where the two levels in the chord graphindicates countries and cities respectively.

(a) Trajectory timeline (b) Academic achievement over-time

(c) Location statistics

Figure 3: Individual Trajectory analysis

2.3 Group TrajectoryGroup Trajectory visualizes the collective migration pattern of mul-tiple scholars which helps to analyze the behaviors and trends in aresearch community. Figure 1 and Figure 4 are the group trajectoryvisualization of Data Mining community. Figure 4a shows the tem-poral spatial distribution of scholars in the community, where eachbar in the 3D chart indicates the number of Data Mining researcherswithin a country in a certain year. Figure 4b analyzes the careerswitch behaviors in the research community, where affiliations ofthe scholars are categorized into four different classes: Academia(Universities), Research Institutes, Industry (Companies), and Oth-ers. We calculate the number of switch behaviors between thesefour classes within a research community. Based on our case study,we observe that most researchers moving from the United States toChina are switching from industry to academia. Figure 4c showsthe loss and gain of talents in different countries, where the rightand left bar indicate the number of researchers moving into and

out of the country. From the figure, we can observe that there aresignificant amount of researchers moving out of some countries,which reveals the potential "brain drain" problem.

(a) Scholar distribution. (b) Career switch analysis

(c) Loss and gain analysis

Figure 4: Group Trajectory analysis

To predict the scholar distribution in the future, GDP long-termforecast data from Organization for Economic Co-operation andDevelopment3 is used as an evidence. Specifically, we match theGDP curve with the scholar heatmap in the corresponding yearsusing a LSTM. Figure 5 shows the predicted scholar heatmap in theyear 2050.

Figure 5: Future prediction

3 ACKNOWLEDGEMENTFundamental Research Funds for the Central Univ.(30918015103),Nanjing Sci. Tech. Development Plan Project(201805036).

REFERENCES[1] Wu Kan, Tang Jie, Shao Zhou, Xu Xinyi, Gao Bo, and Zhao Shu. 2018. CareerMap:

Visualizing Career Trajectory. Science China accepted (2018).[2] Jie Tang, A. C. M. Fong, Bo Wang, and Jing Zhang. 2012. A Unified Probabilistic

Framework for Name Disambiguation in Digital Library. TKDE’12 24, 6 (2012),975–987.

[3] Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, and Zhong Su. 2008. Ar-netminer: extraction and mining of academic social networks. In KDD’08. ACM,990–998.

[4] Yutao Zhang, Fanjin Zhang, Peiran Yao, and Jie Tang. 2018. Name Disambiguationin AMiner: Clustering, Maintenance, and Human in the Loop. KDD’18 (2018).

3https://data.oecd.org/gdp/gdp-long-term-forecast.htm


Recommended