社群媒體中顧客知識之挖掘：意見探勘技術開發review.management.ntu.edu.tw/paper/6243-P.pdf ·...

1

臺大管理論叢 2017/5第 27 卷第 2S期 1-28DOI:10.6226/NTUMR.2017.JUN.F104-008

社群媒體中顧客知識之挖掘：意見探勘技術開發

Mining Consumer Knowledge from Social Media: Development of an Opinion Mining Technique

摘要

資訊科技與網際網路的普及，促成眾多新興應用的蓬勃發展，大量與多樣的資料迅速累積，為了有效地自大量資料中挖掘出有趣的知識，巨量資料分析的概念孕育而生。意見探勘是巨量資料分析的一項核心技術，其目的是自大量使用者產生資料中，分析使用者對某些有興趣的實體（例如，產品、服務等）的主觀看法（例如，意見、情感、評價等），並將這些資訊適當地摘要彙整，專換成結構化的顧客知識。本研究專注在意見探勘中意見句子識別的工作，為改善傳統監督式學習法在準備訓練資料上所需投入的大量人力與時間，本研究提出僅需要使用者提供少量的關鍵字，再輔以社群媒體抓取來，未經人工標註的使用者產生資料，便能夠進行半監督式的學習，產生與監督式學習相似甚至更佳的探勘結果。具體而言，本研究採用類別關聯規則演算法，達配本研究設計的半監督式學習法，提出規則式意見句子識別技術 (R-OSI)。根據實驗評估結果，本研究的 R-OSI技術能夠達到與監督式方法相近甚至更優良的效能。

【關鍵字】意見句子識別、意見探勘、使用者產生資料、社群媒體分析、巨量資料分析

Abstract

With the popularization of information and network technology, many emerging and interesting applications have been developed vigorously. The volume and variety of data accumulates rapidly. These data are considered vital assets for supporting crucial business intelligence applications. To better manage and use the valuable data, big data analytics, which is the process of examining large datasets containing a variety of data types to uncover hidden, previously unknown, and potentially useful patterns and knowledge, has become a crucial research issues. In this study, we concentrate on an important big data analytic task, namely opinion mining. We propose a rule-based opinion sentence identification (R-OSI) technique, which can retrieve relevant review sentences to a specific product feature of interest from a large volume of consumer reviews. The novelty of the proposed technique is that it adopts a semi-supervised learning approach by requesting a user to provide keywords to describe the target product feature. In addition, a set of unannotated consumer reviews are retrieved from various social media websites. On the basis of the user-provided keywords and the set of unannotated consumer reviews, the class association rule mining algorithm is applied to learn a set of opinion sentence identification rules for the target product feature. Our empirical evaluation results suggest that the proposed R-OSI technique achieves promising performance in opinion sentence identification, even when a supervised learning approach is adopted as the performance benchmark.【Keywords】opinion sentence identification, opinion mining, user generated content,

social media analytics, big data analytics

楊錦生 / 元智大學資訊管理學系助理教授Chin-Sheng Yang, Assistant Professor, Department of Information Management, Yuan Ze University

謝佩芸 / 元智大學資訊管理學系碩士Pei-Yun Xie, Master, Department of Information Management, Yuan Ze University

施曉萍 / 元智大學資訊管理學系碩士Hsiao-Ping Shih, Master, Department of Information Management, Yuan Ze University

Received, 2015/6, Final revision received 2016/6


2

壹、緒論

資訊科技與網際網路的成熟與普及，促成眾多新興應用的蓬勃發展，各式資料開

始快速地產生與累積，這些大量的資料中隱藏著許多有價值的資訊與知識，值得妥善

分析與利用。以 Google的流行性感冒傳染趨勢研究為例，藉由分析搜尋引擎的大量使用者查詢資料，發現某些流感相關查詢字詞與真實流感傳染趨勢高度相關，可做為

快速、有效的預測工具 (Ginsberg, Mohebbi, Patel, Brammer, Smolinski, and Brilliant, 2009)。然而，隨著資料數量 (Volume) 與格式多樣性 (Variety) 的成長，對資料分析的高時效 (Velocity) 需求，以及多元資料帶來的資料品質和可靠度的真實性 (Veracity) 議題，上述巨量資料的 4V特性，增加了資料分析的難度，使得企業與個人皆身處一個空有大量資料，卻無法有效找出有價值知識的情境，如同 Naisbitt (1982) 在他的名著Megatrends中提到的 “We are drowning in information (data) but starved for knowledge”。根據 EMC委託 IDC所進行的一項數位世界研究報告 (EMC Digital Universe with

Research & Anglysis by IDC, 2014)，自 2013年起的 10年間，全球資料量每年將以40%速度成長，由 2013年的 4.4ZB (Zettabyte) 急速攀升到 2020年 44ZB。此外，資料的產生來源也由企業主導，轉移到由個人使用者產生，2013年的數據顯示，有三分之二的資料是由個人使用者產生的。雖然資料量大量累積，但是在 2013年時，其中只有 22%的資料在經過適當的標註與分析後可能會有價值，而真正具有高價值的資料 (High Value Data) 則僅占 1.5%。除了國外的數據外，國內也呈現相似的高速成長趨勢。以全國性繳費即時交易為例，2012與 2013年的交易筆數分別為 1,061萬筆與 1,250萬筆，增加 189萬筆，成長率為 17.8%；而 2013與 2014年上半年的比較，更發現高達24.1%的成長率（鍾珍珠與郭玉慧，2014）。此外，財政部財政資訊中心（蘇俊榮，2015）的統計數據 1顯示，政府目前的財政數據約有 740億筆資料，約為 74TB的大小。為了解決從大量資料中挖掘有用知識的需求，各式新穎巨量資料分析 (Big Data Analytics) 技術蓬勃發展 (Manyika, Chui, Brown, Bughin, Dobbs, Roxburgh, and Byers, 2011; Chen, Chiang, and Storey, 2012)。為有效支援巨量資料分析的發展，需要同時兼顧底層基礎技術（例如，Hadoop、

Hbase、MapReduce等）與上層分析技術（例如，社群媒體分析、意見探勘、資訊視覺化等）的發展。本研究的重點在於發展巨量資料分析的上層分析技術，並聚焦在意

見探勘議題，其可以協助自大量使用者產生資料 (User Generated Content) 中，挖掘出使用者表達的主觀意見與看法。藉由快速、有效的使用者意見挖掘與彙整，可以轉換

成顧客知識，用以協助各式商業智慧應用的進行。例如，製造商可以取得消費者的電

1 財政部財政資訊中心，蘇俊榮主任 104年 1月 9日的演講資料。

臺大管理論叢第27卷第2S期

3

子口碑 (Electronic Word-of-mouth)，了解消費者對其產品的真正看法，進行產品設計的改進或是新產品的開發；零售業者可以了解消費者的購物偏好，進行行銷智慧策略

的訂定，遂行各式精準行銷的工作，增加整體銷售業績；消費者則可以利用別人累積

的顧客知識，進行產品與服務的快速挑選，增加購物決策的效率與品質。此外，也可

以協助其他研究主題的進行，例如，提升商品銷售預測、意見領袖分析等應用的效能。

意見探勘的目的是分析人們對某些有興趣的實體（例如，產品、服務、組織、事

件等）的主觀看法（例如，意見、情感、評價等）(Liu, 2010, 2012; Pang and Lee, 2008; Feldman, 2013)，並將這些資訊依據評論的對象（稱為產品特徵）與表達的主觀意見（稱為意見傾向）適當地摘要彙整，專換成結構化的顧客知識。意見探勘主要由產品特徵

萃取（也稱為意見句子識別）與意見傾向判定兩個核心工作組成。產品特徵萃取的目

的是自使用者產生資料中萃取出使用者討論了哪些產品特徵，或是識別出那些句子有

討論特定的產品特徵；而意見傾向判定則是協助識別出使用者對前述的產品特徵或意

見句子的主觀意見傾向 (Hu and Liu, 2004a, 2004b; Popescu and Etzioni, 2005; Pang and Lee, 2008; Yang, Wei, and Yang, 2009; Liu, 2010; Wei, Chen, Yang, and Yang, 2010)。本研究的重點為意見探勘中意見句子的識別，也就是當使用者表明對特定產品的

某個特徵（例如，汽車的引擎、旅館的餐飲等）有興趣時，自動從大量的使用者產生

資料中，找出有在討論該產品特徵的句子。傳統意見句子識別多採用監督式學習法來

進行 (Wong and Lam, 2005, 2008; Yang, Tang, Wong, and Wei, 2010)，通常可以達到不錯的識別效果，但是需要花費大量的人力與時間來準備監督式學習所需的訓練資料，且

訓練資料的品質通常顯著影響學習的結果。因此，本研究專注在提出一個半監督式的

學習法，用以減輕監督式學習法的大量人力與時間需求，同時希望不減低意見句子識

別的效果。本研究提出的方法稱為規則式意見句子識別 (Rule-based Opinion Sentence Identification; R-OSI) 技術，其只需要使用者提供幾個關鍵字（例如，「鏡頭」、「光圈」、「快門」）來描述有興趣的產品特徵（例如，相機的鏡頭），此外就是使用網

頁爬行器自社群媒體中抓取無須人工標註的使用者產生資料，進行類別關聯規則 (Class Association Rules) 的學習，達成意見句子識別的目的。本研究收集了真實的資料，進行評估實驗，以了解本研究所提規則式意見句子識別技術的效能。結果顯示，

相較於使用監督式類別關聯規則演算法的技術，本研究 R-OSI技術可以產出相似、甚至更好的識別效果。

本文章的架構如下：第一章說明研究的相關背景、動機與目的；第二章摘要、簡

介意見探勘相關的文獻成果；第三章詳細介紹本研究所提規則式意見句子識別技術的

設計；第四章說明實驗評估的設計，並討論重要的實驗結果；最後，第五章彙整本研

究的重要結論與貢獻，並簡述未來的研究方向。


4

貳、文獻探討

意見探勘 (Opinion Mining)，也稱為情感分析 (Sentiment Analysis)，其目的是分析人們對某些有興趣的實體（例如，產品、服務、組織、事件等）的主觀看法（例如，

意見、情感、評價等）(Liu, 2010, 2012; Pang and Lee, 2008; Feldman, 2013)，並將這些資訊依據評論的對象（稱為產品特徵）與表達的主觀意見（稱為意見傾向）適當地摘

要彙整，專換成結構化的顧客知識，以協助企業更加了解廣大消費者的意見，並以之

發展重要的商業智慧應用，或提供一般使用者另一個訊息管道，避免被廠商官方資訊

誤導，做出更佳的消費決策。

以 “The battery life of this camera is good, but the price is unacceptable.” 這個句子為例，我們知道該句子評論的兩個產品特徵為電池壽命 (Battery Life) 與價格 (Price)，且使用者的意見傾向分別為正向 (Good) 與負向 (Unacceptable)。藉由對社群媒體上數量龐大的使用者產生資料進行上述的情感分析，並進行適當彙整，可以協助使用者快速

瞭解社群媒體上人們感興趣的實體與主觀看法。為有效完成上述的分析與彙整工作，

意見探勘一般由兩個核心的工作組成，分別是 (1)產品特徵萃取 (Product Feature Extraction)：也稱為意見句子識別 (Opinion Sentence Identification)，目的是自使用者產生資料中萃取出使用者討論了哪些產品特徵（開放式分析，沒有預設的產品特徵分析

對象），或是識別出那些句子有討論特定的產品特徵（聚焦式分析，已經訂好要分析

的產品特徵）；(2)意見傾向判定 (Opinion Orientation Determination)：判定使用者在前述找出的產品特徵或意見句子上的意見傾向（例如，正向、負向、中性）(Hu and Liu, 2004a, 2004b; Popescu and Etzioni, 2005; Pang and Lee, 2008; Yang et al., 2009; Liu, 2010; Wei et al., 2010)。

一、產品特徵擷取

產品特徵擷取技術大致上可分類成二個主要的方法：開放式 (Discovery-based) 和聚焦式 (Focus-based) (Yang et al., 2009)。開放式的產品特徵擷取技術並沒有預定要分析的產品特徵對象，聚焦式的產品特徵擷取技術需事先給定要分析的產品特徵對象。

開放式方法一般不需要準備訓練樣本，因此也可以稱為非監督式方法。例如，Hu and Liu (2004a, 2004b) 發展的技術，假設產品特徵必須是名詞或名詞片語，並採用關聯規則探勘演算法 (Agrawal and Srikant, 1994; Srikant and Agrawal, 1995) 來擷取在特定的一個顧客評論集中的所有產品特徵。Wei et al. (2010) 延續 Hu and Liu (2004a, 2004b) 的研究，加入一組正向和負向情感字詞 (Semantic Words) 和三個啟發式規則 (Heuristic Rules)，以提高產品特徵擷取的效能。除了關聯規則探勘之外，資訊擷取 (Information Extraction)技術提供了另一種可行解決方案，例如，Popescu and Etzioni (2005) 採用一個名為 KnowItAll (Etzioni, Cafarella, Downey, Popescu, Shaked, Soderland, Weld, and


5

Yates, 2005) 的網際網路資訊擷取系統，提出了 OPINE的技術來從顧客評論自動擷取產品特徵。此外，Kobayashi, Iida, Inui, and Matsumoto (2005) 與 Kobayashi, Inui, Matsumoto, Tateishi, and Fukushima (2004) 也提出了其他以資訊擷取為基礎的產品特徵擷取技術，其過程與 OPINE類似，不同點在於擷取模板 (Pattern) 的表示和建構，及採用的信心度測試。

而聚焦式的方法因為已經指定要分析的產品特徵是什麼，通常需要準備標註好的

訓練資料，因此也稱為監督式方法。因為有訓練資料集，因此聚焦式方法通常採用機

器學期的監督式學習法 (Supervised Learning Algorithms)，來建構一個識別（分類）模型，之後就可以使用這個識別模型，從未分類的新資料中，尋找有哪些句子討論了特

定的產品特徵，本研究給聚焦式方法另一個名稱，即意見句子識別 (Opinion Sentence Identification)，以利和開放式的產品特徵擷取技術做區分。例如，Wong and Lam (2005, 2008) 採用 Hidden Markov Model和 Conditional Random Field學習演算法，自拍賣網站中擷取產品的特徵。Yang et al. (2010) 採用兩種監督式學習演算法，即類別關聯規則 (Class Association Rules) 和貝式分類器 (Naive Bayes Classifier)，來判斷意見句子討論了那些產品特徵。

開放式和聚焦式的技術各有優缺點。聚焦式意見句子識別通常可以達到較好的分

析結果（正確率高），但準備訓練樣本是非常耗費時間和人力的。相反地，開放式的

產品特徵擷取的好處是不需要人工準備訓練樣本，但是卻面臨另一個大問題，使用者

可能使用不同的字來表示同一個產品特徵，因此需要對找到的產品特徵進行分群，以

避免異字同義的問題，例如，當擷取到 ‘Zoom,’ ‘Aperture,’ ‘Magnification,’ ‘Focal-length,’ 等字詞時，我們有很高的信心相信這幾個字詞都是在描述「鏡頭」這個產品特徵，可是如果沒有將這些字詞群聚在一起，他們會被視為不同的產品特徵。因此，另

有學者採用中庸的做法，即半監督式的學習法 (Semi-supervised Learning Algorithms) 來進行。例如，Yang et al. (2009) 採用資訊檢索 (Information Retrieval) 與協同過濾 (Collaborative Filtering) 的概念去識別意見句子，作法是使用者先提供一個查詢問句 (Query) 來表達感興趣的產品特徵，再利用協同過濾概念去抓取其他使用者的類似查詢問句，進行查詢問句擴充 (Query Expansion)，最後進行資訊檢索，找出相關的句子作為意見句子。

在上述的既有研究中，與本研究最相關的技術是 Yang et al. (2010) 的方法。本研究與 Yang et al. (2010) 的技術都使用類別關聯規則演算法來進行意見句子識別（分類）模型的學習，不同的地方主要在訓練資料的準備。Yang et al. (2010) 的方法採用完全監督式的做法，也就是需要人工將資料進行標註，作為訓練資料，再進行意見句子識

別模型的學習；而本研究的訓練資料準備是採用半監督式的作法，不需要進行人工資

料標註，相反地，僅需使用者提供少量的產品特徵關鍵字，之後系統會自動去尋找虛


6

擬訓練資料 (Pseudo-training Data)，進行識別模型學習。彙整而言，本研究的方法相較於 Yang et al. (2010) 方法的優勢是所需的人工成本低很多，以第四章的實驗設定為例，當要對某個產品特徵進行識別模型學習時，本研究的方法僅需使用者提供 1~3個關鍵字，而 Yang et al. (2010) 的方法需要人工閱讀所有 4,500個評論句子，並標註那些句子有討論特定的產品特徵。

此外，本研究與 Yang et al. (2009) 的方法也有一定的相似性，都是半監督式學習法，也就是都不需人工標註所有資料，但在分析的演算法上是不相同的，Yang et al. (2009) 是結合資訊檢索與協同過濾來進行意見句子識別，而本研究是使用類別關聯規則演算法。更重要的差異是，Yang et al. (2009) 的方法需要收集其他使用者的查詢期間 (Query Sessions)，才能夠進行協同過濾，而本研究的方法完全不需要，也不會面臨協同過濾常見的冷啟動 (Cold Start) 問題 (Schein, Popescul, Ungar, and Pennock, 2002; Zhou, Yang, and Zha, 2011)。

二、意見傾向判定

在產品特徵擷取完成後，意見傾向判定著重於決定產品特徵的情感類別，也就是

使為者的主觀評價是正向或負向。意見傾向判定最直覺的一個作法，就是將它視為句

子層級的情感分類 (Sentiment Classification)，也就是將每一個產品特徵所在的句子（或意見句子），給予一個適當的情感類別（通常分成正向、副向兩個類別） (Chen and Zhou, 2010; Das and Chen, 2007; Feldman, 2013; Liu, 2010, 2012; Pang, Lee, and Vaithyanathan, 2002; Pang and Lee, 2008; Stepinski and Mittal, 2007; Turney, 2002; Yang, Chen, and Chang, 2014)。情感分類的核心工作是利用一組標註好情感類別的訓練資料，以及一個監督式學習演算法（例如，類神經網路、決策樹、支援向量機、貝氏網路等），

去歸納學習一個分類器，可用來對未知類別的資料進行類別的預測。然而，一個句子

通常可能含有超過一個產品特徵，且針對同一個句子內的多個產品特徵，消費者可能

表達不同的情感傾向，若以情感分類的方式來處理，將無法分辨這種單一句子卻表達

不同意見傾向的現象。例如，本章第二段的例子，good是正向的意見字，用來描述battery life這個產品特徵；而 unacceptable 是負向意見字，用來描述 price這個產品特徵。因為他們同在一個句子中，傳統的情感分類技術不易處理這個現象。

另一種作法是以詞彙 (Lexicon) 為基礎的方法 (Ding, Liu, and Yu, 2008; Hu and Liu, 2004a, 2004b; Taboada, Brooke, Tofiloski, Voll, and Stede, 2011)，其採用一組正向與負向意見字詞（例如，分別使用好 (Good) 和壞 (Bad) 來表示正向和負向的意見字詞），和一些語言學規則 (Linguistic Rules) （例如，一個意見字詞（例如，bad）和否定字（例如，not）同時出現時，意見字詞的傾向需相反，即 bad的語意傾向是負向的，但 not bad的語意傾向卻是正向的），以決定在意見句子中產品特徵的語意傾向。


7

參、意見探勘技術開發

為協助自社群媒體中大量的使用者產生資料裡，挖掘出有價值的顧客知識，本研

究提出以半監督式學習的方式，自動產生某一產品特徵的類別關聯規則，用以自未經

標註的使用者產生資料中，找出討論該特定產品特徵的句子。本研究的規則式意見句

子識別 (Rule-based Opinion Sentence Identification; R-OSI) 技術架構如圖 1所示，主要由產品特徵識別規則學習 (Product Feature Identification Rule Learning) 與意見句子識別 (Opinion Sentence Identification) 兩個階段組成。以下詳述此兩階段的設計細節，相關變數的意義彙整如表 1。

圖 1 規則式意見句子識別技術架構

產品特徵識別規則學習 (Product Feature Identification Rule Learning)

意見句子識別 (Opinion Sentence Identification)

網頁爬行(Web Crawling)

前處理(Preprocessing)

規則學習(Rule Learning)

候選意見句子檢索(Candidate Opinion Sentence Retrieval)

網際網路(www)

產品特徵 fj

(Product Feature fj)

社群媒體文集(Social Media

Corpus)

產品特徵識別規則(Product Feature

Identification Rules)

使用者產生資料(User Generated

Content)

意見句子識別(Opinion Sentence

Identification)相關意見句子

(Relevant Opinion Sentences)


8

表 1 相關變數意義變數中文名稱意義

fj 產品特徵使用者感興趣的產品特徵，例如「電池」。一個產品可以有多項感興

趣的產品特徵。

SMC 社群媒體文集Social Media Corpus 的縮寫，為一組自社群媒體爬取，且未經標註的

使用者產生資料，用於產品特徵識別規則學習。

Qj 產品特徵關鍵字

一組描述產品特徵 fj 的關鍵字集合，例如，使用「「電池」、「充電」、

「鋰」、「鎳氫」」等關鍵字來描述產品特徵「電池」。Qj 可由一個

或多個關鍵字組成。

si 句子社群媒體文集或其他使用者產生資料中的任一句子。

COSj 候選意見句子集社群媒體文集中，包含 Qj 裡任一關鍵字的所有句子的集合。(COSj

SMC)

Wi 有意義字詞集合句子 si 在經過前處理步驟（包含詞性標註、詞幹還原、有意義字詞選

擇）後的字詞集合。wk 為 Wi 集合中的任一字詞。

PFIRj 產品特徵識別規則集合Product Feature Identification Rules 的縮寫，為類別關聯規則演算法

分析後的結果，一組可用來識別產品特徵 fj 的規則。

UGC 使用者產生資料文集

User Generated Content Corpus 的縮寫，為系統真正要進行情感分

析的目標文集，類似 Machine Learning 中的測試資料集。UGC 與

SMC 同樣都是自社群媒體爬取來的使用者產生資料，其交集可是空集

合或非空集合。

OSj 意見句子集Opinion Sentences 的縮寫，為 UGC 中系統判斷有討論產品特徵 fj 的

句子集合。(OSj UGC)

一、產品特徵識別規則學習

產品特徵識別規則學習階段的目的在利用使用者提供的少量產品特徵 fj描述，搭

配網際網路上擷取的未經標註的社群媒體文集，進行半監督式的學習，產生可以用來

識別句子是否討論產品特徵 fj的類別關聯規則。本階段主要有四個步驟，分別為網頁

爬行 (Web Crawling)、候選意見句子檢索 (Candidate Opinion Sentence Retrieval)、前處理 (Preprocessing) 與規則學習 (Rule Learning)。

（一）網頁爬行

為進行社群媒體的分析，首先得要撰寫網頁爬行器 (Web Crawler)，去抓取社群媒體上的使用者產生資料，形成社群媒體文集 (SMC)，以供後續分析之用。本研究自行撰寫網頁爬行器，根據研究主題的不同，至適當的網站，抓取相關主題的使用者評論，

作為社群媒體文集。此外，會先去除社群媒體文集中無意義的資料（例如，html標籤、廣告、無關外部連結等），並進行斷句，以句子作為文集中資料儲存與分析的單位。

（二）候選意見句子檢索

社群媒體文集 SMC中的句子可能討論各式各樣的產品特徵（例如，手機的電池、螢幕、價格⋯等），也可能是客觀的描述，因此要建構特定產品特徵 fj（例如，「電池」）


9

的識別規則前，得先從社群媒體文集眾多的句子中，挑選出確實討論產品特徵 fj的句

子來，這個過程稱為意見句子檢索。最簡單的進行方式為人工挑選，然而，社群媒體

文集的句子數量相當龐大，且要分析的產品與特徵也相當多，以人工挑選方式來進行

太過費時費工，因此本研究設計一個自動化的候選意見句子檢索機制。

本步驟的輸入資料為之前取得的社群媒體文集，加上一組由使用者提供的產品特

徵 fj描述關鍵字。假設我們要分析的對象為手機的「電池」特徵，則使用者可以提供「電池」、「充電」、「鋰」、「鎳氫」這類的關鍵字作為描述。我們將使用者提供

的產品特徵描述稱為 Qj，並利用 Qj對社群媒體文集 SMC進行布林檢索，也就是針對SMC中的每個句子 si，檢查是否出現 Qj中的關鍵字（Qj中如包含多個關鍵字，以 OR連接），如果有，則稱 si為候選意見句子。完成本步驟後，可得到產品特徵 fj的候選

意見句子集 COSj。

因為自動檢索的結果只是有較高的機會是在討論特定產品特徵 fj的句子，並不必

然是百分之百正確，因此稱之為候選意見句子。例如，“AIS lenses are a joy to use with the D70, even in the manual flash mode.” 這個句子雖然出現 “flash” 這個關鍵字，但該句子事實上是討論鏡頭 (Lens) 產品特徵，而非閃光燈 (Flash)。此外，自動檢索的候選意見句子無法涵蓋未出現產品特徵關鍵字的句子，例如 “I was amazed to see that it was sharp and relatively well exposed even with poor lighting.” 這個句子雖然沒有出現圖像品質 (Image Quality) 的關鍵字，但確實是在討論圖像品質這個產品特徵。以本研究第四章使用的資料為例，相較人工標註的正確結果，自動候選意見句子檢索的 Macro-precision與Macro-recall為 81.1%與 49.4%。

（三）前處理

前處理步驟的目的是將非結構化的候選意見句子，轉換成一組有意義的字詞，同

時豐富這組字詞的語意。本步驟包含三個子工作：詞性標註 (Part-of-speech (POS) Tagging)、詞幹還原 (Stemming)、有意義字詞選擇 (Meaningful Word Selection)。詞性標註的目的在對一串字元組成的句子，進行斷字以及詞性標註，以利後續分

析。因為本研究分析的文本為英文，因此採用 TreeTagger (Schmid, 1994, 1999) 來對候選意見句子集 COSj中的每個句子 si作詞性標註。假設要進行標註的句子為 “The battery life of this camera is good, but the price is unacceptable.”，在經過 TreeTagger標註後，其結果如圖 2。TreeTagger會將句子斷成每個字詞一行，每行含有三部分資訊，即原始字詞 (Original Word)、詞性 (POS) 與詞幹 (Stem)。緊接著，詞幹還原負責將同一個字詞的不同形式轉換成原形，以避免語意相同的

字詞被誤判為不同字詞的錯誤。例如，“takes,” “took,” “taken,” “taking” 等字詞都會以原形字詞 “take” 取代。由圖 2可以發現，TreeTagger也包含詞幹還原的功能（每一行的第三項資訊即為詞幹），因此本研究一樣使用 TreeTagger來作詞幹還原。


10

此外，由圖 2還可以發現，一個句子中存在許多經常出現，卻沒有太多語意含量的字詞，例如，定冠詞 “the”、介係詞 “of”、連接詞 “but” 等。一般而言，名詞用來表示被評論的對象（例如，電池 (Battery)）、動詞用來表示各類動作（例如，耗盡 (Drain)），而形容詞（例如，可再充電的 (Rechargeable)）和副詞（例如，電地 (Electrically)）分別用來修飾名詞和動詞，因此有意義字詞選擇步驟僅選取標註為名詞、動詞、形容詞與副詞 (Taboada et al., 2011) 的字詞。完成前處理後，每一個句子將改以一組由名詞、動詞、形容詞與副詞表示的字詞集合，即 Wi = {w

1, w

2, ⋯, w

ni}，其

中 ni為句子 si中有意義的字詞數。以圖 2為例，該句子 si將轉換成Wi = {“battery,” “life,” “camera,” “be,” “good,” “price,” “unacceptable”}。

• Original Word POS Stem

• The DT the

• battery NN battery

• life NN life

• Of IN of

• this DT this

• camera NN camera

• Is VBZ be

• good JJ good

• , , ,

• but CC but

• the DT the

• price NN price

• Is VBZ be

• unacceptable JJ unacceptable

• . SENT .

圖 2 TreeTagger詞性標註範例

（四）規則學習

本步驟的目的是對產品特徵 fj歸納出一組產品特徵識別規則，以利對未知的社群

媒體文章進行產品特徵的識別。之所以選擇類別關聯規則演算法作為 R-OSI技術的識別模型學習法，是因為 R-OSI技術並沒有完整的訓練資料集，其中需包含人工標註的正負範例（有討論／沒討論某個產品特徵）；相反地，只有系統自動檢索來的候選意


11

見句子 COSj，相當於正範例。在只有單一類別訓練資料的情況下，大多分類分析方法

都無法有效學習分類模式，但類別關聯規則演算法可以在只有正範例的情況下，依然

歸納出有效的分類規則，所以本研究選擇類別關聯規則演算法。

由於每個候選意見句子 si的 Wi必然包含產品特徵 fj的描述 Qj中的關鍵字，我們

可以學習一組符合 a→ b形式的識別規則，其中 a是任一個在候選意見句子集 COSj

中的有意義字詞（即），而 b則為產品特徵 fj的描述Qj中的任一關鍵字（即

b Qj），因為 Qj中的關鍵字可能超過一個，本研究將屬於同一個產品特徵 fj的所有

規則彙整（聯集）在一起，再進行後續的支持度和信心度的計算。這樣的規則意味著

當字詞 a出現時，此評論句子有很高的機率是產品特徵 fi的意見句子。例如，考慮下

面的識別規則：

鋰離子 (Lithium-ion)→ 電池 (Battery)毫安時 (mAh)→ 電池 (Battery)可再充電的 (Rechargeable)→ 電池 (Battery)它們指出，假如字詞「鋰離子」、「毫安時」，或「可再充電的」出現在一個句

子中時，我們有較高的信心相信這個句子是有討論產品特徵「電池」的句子，因此可

以將這個句子視為「電池」的意見句子。

為從候選意見句子 COSj中自動學習需要的產品特徵識別規則，本研究採用類別

關聯規則 (Class Association Rules) 演算法 (Yang et al., 2010) 來進行分析。該演算法跟關聯規則 (Association Rules) 演算法 (Agrawal and Srikant, 1994; Srikant and Agrawal, 1995) 類似，唯一個差別是限定規則的右邊須為產品特徵描述 Qj中的關鍵字，而規則

左邊則可以是候選意見句子集 COSj中的任意字詞。具體而言，在給定最小支持度 (Min-supp) 和最小信心度 (Min-conf) 門檻值的前提下，若一個規則 a→ b的支持度 (Support)（式 1）與信心度 (Confidence)（式 2）皆大於 Min-supp與 Min-conf門檻值，則該規則可視為一個有趣的規則並保留下來，反之，則視為無意義的規則並忽略之。

（式 1）

（式 2）

其中，n(a

∩

b)為同時出現字詞a和字詞b的句子數，n(a)為僅出現字詞a的句子數，N為總句子數。

在完成規則學習步驟後，可以針對每一個產品特徵 fj產生一組產品特徵識別規則 (Product Feature Identification Rules)，稱為PFIRj，其為所有有趣規則的左邊字詞（即a）的集合。


12

二、意見句子識別

意見句子識別階段的目的在於利用前一階段產生的產品特徵識別規則 PFIRj，來

針對一組自社群媒體抓取來的未經標註的使用者產生資料 (User Generated Content) 文集 UGC，自動識別那些句子有討論特定產品特徵 fj。

目前本研究採用簡單直覺的識別方式。假設要識別的是產品特徵 fj，則針對 UGC中的每個句子 sk，檢查 Wk中是否含有 PFIRj中的字詞，如果有，則視句子 sk為討論產

品特徵 fj的意見句子，最後產出一組產品特徵 fj的意見句子集 OSj。一個句子 sk可以

同時被歸類為多個產品特徵 fj的意見句子，以圖 2個句子為例，其可能同時被歸類為討論「電池」與「價格」的意見句子。

肆、實驗評估

本章詳述本研究的實驗評估方式，包括資料收集、評估指標以及重要實驗結果的

探討。

一、資料收集

為評估本研究的規則式意見句子識別技術，主要需要兩組資料集。第一個是產品

特徵識別規則學習階段的社群媒體文集，第二個是意見句子識別階段的使用者產生資

料文集。

為簡化資料收集以及後續分析的複雜度，本研究將分析的主題聚焦在數位相機

上，因此在社群媒體文集的收集上，選定三個重要的數位相機評論網站，分別是

RateItAll2、Epinions3與 Amazon4，自網站上抓取歸類在數位相機類別下的評論文章。

且為了分析文集大小對產品特徵識別規則學習成果的影響，特別將收集來的資料分成

小文集與大文集。小文集僅含自 RateItAll抓取的資料，總共有 442,509個未經標註的數位相機評論句子。大文集則混和了 RateItAll、Epinions與 Amazon三個資料來源所抓到的所有資料，總共有 2,318,823個句子。在意見句子識別階段的使用者產生資料文集方面，實際使用時只須準備尚未標註

且有興趣分析的資料，讓產品特徵識別規則來進行自動識別。但為了評估技術的效能，

所以在實驗中，需對這些這些句子進行人工標註，以得到標準答案，方便後續評估指

標的計算。本研究採用的是 Yang et al. (2010) 使用的資料集，該資料集原本包含 3,000個自 Amazon取得，且標註好的句子，涵蓋的產品特徵有電池 (Battery)、閃光燈

2 http://www.rateitall.com/3 http://www.epinions.com/4 http://www.amazon.com/


13

(Flash)、圖像 (Image)、鏡頭 (Lens)、記憶體 (Memory)、價格 (Price)、螢幕 (Screen) 與視頻 (Video)等八類。為增加實證評估結果一般化的能力，本研究另外自 Google Shopping5網站的數位相機類別中抓取了 1,500個句子，並參考相同的八類產品特徵來進行資料標註。彙總而言，用於評估技術效能的使用者產生資料文集包含 4,500個句子，各產品特徵的意見句子數為：電池 182句、閃光燈 99句、圖像 499句、鏡頭 324句、記憶體 111句、價格 284句、螢幕 133句與視頻 115句。表二為社群媒體文集與使用者產生資料文集兩個資料集的摘要說明。

另外，本研究還需要人工給定產品特徵的描述關鍵字（即 Qj），以便進行識別規

則學習。為分析產品特徵描述關鍵字完整程度對學習效果的影響，本研究針對每個產

品特徵分別給予一到三個關鍵字作為描述，各產品特徵的三個關鍵字依序為：電池 (Battery, Rechargeable, Lithium)、閃光燈 (Flash, Temperature, Synchronization)、圖像 (Image, Quality, Resolution)、鏡頭 (Lens, Zoom, Aperture)、記憶體 (Memory, Storage, Card)、價格 (Price, Money, Cost)、螢幕 (Screen, LCD, Inch)與視頻 (Video, Movie, FullHD)。

表 2 資料集摘要資料集資料來源描述

社群媒體文集 (SMC)

小文集 RateItAll 442,509 個未經標註的評論句子。

大文集RateItAll、Epinions、Amazon

2,318,823 個未經標註的評論句子。

使用者產生資料文集 (UGC)Amazon、Google Shopping

4,500 個經人工標註的評論句子（3,000 句來自

Amazon、1,500 句來自 Google Shopping），

作為驗證系統效能的測試資料。

二、評估指標

本研究採用常見的 F-measure當成評估指標。但由於我們的資料集涵蓋了 8個產品特徵，因此將 8個產品特徵的 F-measure值採用巨觀 (Macro) 和微觀 (Micro) 兩個方式來進行彙總 (Yang et al., 2010)，做為整體表現的評估指標。以表 3的混淆矩陣為例，進行評估指標的定義。假設真實類別為討論產品特徵 f且被技術標註為討論產品特徵 f的句子有 tpf句、真實類別為討論產品特徵 f且被技術標註為不是討論產品特徵 f的句子有 fnf句、真實類別為不是討論產品特徵 f且被技術標註為討論產品特徵 f的句子有fpf句、真實類別為不是討論產品特徵 f且被技術標註為不是討論產品特徵 f的句子有tnf句，首先定義準確率 (Precision) 與召回率 (Recall) 如式 3與式 4。而 F-measure為準

5 http://www.google.com/shopping


14

確率與召回率的加權調和平均數，定義如式 5。

（式 3）

（式 4）

（式 5）

表 3 混淆矩陣預測類別

Positive Negative

真實類別Positive tpf fnf

Negative fpf tnf

在計算完各個產品特徵個別的 F-measure後，緊接著利用巨觀 (Macro) 和微觀 (Micro) 兩個方式來進行彙總，以簡化實驗數據，方便解讀。假設所有產品特徵的集合為 F，則彙整後的Macro-precision, Macro-recall與Macro-F-measure定義如式 6、式 7 與式 8。

（式 6）

（式 7）

（式 8）

而彙整後的 micro-precision, micro-recall與 micro-F-measure，則分別定義於式 9、式 10與式 11。

（式 9）

（式 10）

（式 11）


15

三、實驗結果

最小支持度 (Min-supp) 和最小信心度 (Min-conf) 的值，對於本研究的規則式意見句子識別技術有很大的影響，此外，最小支持度和最小信心度的最佳值通常都是隨著

資料集而改變的，並沒有單一的建議最佳值，因此，我們首先對這兩個參數進行分析，

以找出適用本研究資料集的最佳門檻值。根據本研究初期的研究結果，在使用小的社

群媒體文集，一個產品特徵描述關鍵字的前提下，我們分析最小支持度在 0.0001、0.0003、0.0005、0.0007、0.0009、0.001、0.003、0.005、0.007和 0.009，以及最小信心度從 0.1到 0.5每次增加 0.05，不同參數組合對規則式意見句子識別 (R-OSI) 技術的影響。Macro-F-measure與Micro-F-measure的結果分別如圖 3、圖 4所示。R-OSI技術的最佳Macro-F-measure值為最小支持度門檻值設為 0.005、最小信心度門檻值設為0.15時達到的 74.8%，而最佳Micro-F-measure一樣是在最小支持度門檻值設為 0.005、最小信心度門檻值設為 0.15時達到，值為的 68.0%。而 Yang et al. (2010) 在人工準備訓練樣本，且只使用 3,000句測試句子的前提下，達到的最佳 Macro-F-measure與Micro-F-measure值分別為 72.3%與 69.0%，本研究的 R-OSI技術在Macro-F-measure上高出 2.5%，而 Micro-F-measure則低 1.0%。如果用本研究的 R-OSI技術，分析Yang et al. (2010) 的 3,000句子，最佳Macro-F-measure和Micro-F-measure分別為72.0% 與 68.9%，差異不大。彙整而言，考量 R-OSI技術只需要人工為每一個產品特徵提供一個關鍵字，這樣的表現是相當優異的。

0.0001

0.0007

0.003

0.009

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5

Min-Supp

F-m

easu

re

Min-conf

圖 3 R-OSI技術在不同最小支持度與最小信心度下 Macro-F-measure值


16

此外，R-OSI技術確實可以自動地學習出一些有意義，而且有趣的產品特徵識別規則。表 4顯示了在數位相機中「電池 (Battery)」產品特徵的識別規則，這些規則與相關支持度和信心度數值都是系統的真實產出，因此，如果以 R-OSI的最佳參數門檻值 0.005 和 0.15 為例，“Rechargeable” → “Battery,” “AA” → “Battery,” “Life” → “Battery,” “Charger” → “Battery” 等四個規則支持度都大於 0.005，且信心度也高於

0.15，因此會被用來進行意見句子識別。

表 4 R-OSI技術產生的產品特徵識別規則產品特徵識別規則支持度信心度

Rechargeable → Battery 0.006 0.906

Drain → Battery 0.002 0.877

Lithium → Battery 0.002 0.869

Alkaline → Battery 0.002 0.867

AA → Battery 0.006 0.839

Life → Battery 0.011 0.826

NiMH → Battery 0.003 0.819

Charger → Battery 0.005 0.765

0.0001

0.0007

0.003

0.009

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.1 0.15 0.2 0.25 0.3 0.35 0.4 0.45 0.5

Min-Supp

F-m

easu

re

Min-conf

圖 4 R-OSI技術在不同最小支持度與最小信心度下 Micro-F-measure值


17

因為上述的Macro-F-measure與Micro-F-measure值是 8個產品特徵平均過後的結果，無法看出不同產品特徵間是否存在差異，因此我們將 8個產品特徵各別的表現與Macro-F-measure與Micro-F-measure的結果彙整在表 5中。在 8個產品特徵中，電池、閃光燈、鏡頭、記憶體與螢幕等 5個產品特徵的各別 F-measure值高於Macro-average與Micro-average F-measure；而圖像、價格、視頻等 3個產品特徵的各別 F-measure值則低於Macro-average與Micro-average F-measure。造成這些差異的主要原因是可用於描述特定產品特徵的概念數量，尤其當隱性描述的數量多時，通常效果會比較差。以

圖像為例，除了易懂的 “Image,” “Picture,” “Quality,” “Resolution” 等關鍵字外，還有很多未出現特定關鍵字的隱性描述，例如，“Well, this is so sharp that I can’t imagine how 7 megapixels would be like.” 與 “Colors are vibrant.” 這兩個句子沒有出現常見的重要關鍵字，因此不容易正確的識別為討論圖像產品特徵。

表 5 R-OSI技術在 8個產品特徵的各別表現產品特徵 Precision Recall F-measure

電池 (Battery) 87.9% 96.2% 91.9%*

閃光燈 (Flash) 70.7% 91.9% 79.5%

圖像 (Image) 53.5% 35.1% 42.4%

鏡頭 (Lens) 86.8% 68.8% 76.8%

記憶體 (Memory) 73.9% 89.2% 80.8%

價格 (Price) 87.3% 43.5% 58.1%

螢幕 (Screen) 80.4% 92.4% 86.0%

視頻 (Video) 78.9% 61.7% 69.2%

Macro-Average 77.3% 72.4% 74.8%

Micro-Average 75.4% 61.9% 68.0%

*：粗體表示各別產品特徵的 F-measure 值高於 Macro- 與 Micro-Average。

緊接著，我們分析社群媒體文集大小對 R-OSI技術的影響。表 6為使用一個產品特徵關鍵字的情況下，大文集與 RateItAll、Epinions、Amazon等三個小文集對 R-OSI技術的影響。可以發現，大文集與 Epinions小文集在Macro-F-measure與Micro-F-measure都有最佳的效能，兩個文集間僅有 0.1%差異；其次為 RateItAll小文集，Macro-F-measure與Micro-F-measure分別低 Epinions小文集 1.1%與 2.2%；效果最差的是Amazon小文集，Macro-F-measure與Micro-F-measure分別低Epinions小文集 1.2%與2.4%。由上述結果可以發現，最佳的小文集與大文集的效果幾乎相等，都能讓R-OSI技術達到最佳的效能，但最佳的小文集需要經過實證評估才能找出，而大文集不需評

估即可直接採用，但在運算效率上會略低於小文集。


18

表 6 社群媒體文集大小對 R-OSI技術的影響門檻值 # Precision Recall F-measure

小文集 (RateItAll)Macro 0.005; 0.15 77.3% 72.4% 74.8%

Micro 0.005; 0.15 75.4% 61.9% 68.0%

小文集 (Epinions)Macro 0.007; 0.15 77.2% 74.5% 75.9%

Micro 0.005; 0.15 75.1% 66.0% 70.2%

小文集 (Amazon)Macro 0.005; 0.15 77.6%* 71.9% 74.7%

Micro 0.005; 0.15 75.7% 61.5% 67.8%

大文集 (R ∩A ∩E┴)Macro 0.005; 0.2 77.1% 74.6% 75.8%

Micro 0.005; 0.2 74.4% 65.5% 70.1%

#：依序為最小支持度 (Min-supp) 門檻值與最小信心度 (Min-conf) 門檻值。

┴：R ∩A ∩E 表示將 RateItAll、Amazon 與 Epinions 三個文集做聯集。

*：粗體字代表各指標在大小文集間最佳的表現。

除了社群媒體文集大小的影響外，R-OSI技術的效能是否會隨著社群媒體文集中句子數的減少而隨之下降，也是一項重要的議題。因此，我們自原本包含 44萬個句子的 RateItAll社群媒體文集中，分別抽出 30萬、20萬和 10萬個句子，降地社群媒體文集的句子數，來評估對 R-OSI技術的影響，結果如表 7所示。可以發現，RateItAll文集的句子數降到 30萬和 20萬句時，R-OSI技術的效能完全沒有改變，當句子數降到 10萬句時，Macro-F-measure與Micro-F-measure的值才分別下降 0.1%與0.2%，可見 R-OSI技術的穩定性相當好。

表 7 社群媒體文集句子數對 R-OSI技術的影響（使用 RateItAll文集）句子筆數門檻值 # Precision Recall F-measure

44 萬Macro 0.005; 0.15 77.3% 72.4% 74.8%

Micro 0.007; 0.15 75.4% 61.9% 68.0%

30 萬Macro 0.005; 0.15 77.3% 72.4% 74.8%

Micro 0.005; 0.15 75.5% 61.8% 68.0%

20 萬Macro 0.005; 0.15 77.3% 72.4% 74.8%

Micro 0.005; 0.15 75.5% 61.8% 68.0%

10 萬Macro 0.005; 0.15 77.6% 71.9% 74.7%

Micro 0.005; 0.15 75.7% 61.5% 67.8%

最後，我們分析產品特徵描述關鍵字完整性對 R-OSI技術的影響，在使用小社群媒體文集的情況下（在大文集裡有相似的發現），分別利用一個、二個和三個產品特

徵描述關鍵字來學習產品特徵識別規則，結果如表 8所示。可以發現，當產品特徵描述完整性越高時（即關鍵字字數越多），多可以提升召回率 (Recall) 指標，因為較完


19

整的產品特徵描述產出的產品特徵識別規則數量會較多，所以可以有較高的召回率。

不過，較高的召回率通常會伴隨著準確率 (Precision) 下降，在我們的評估結果中確實也可以發現此現象。以一個關鍵字和三個關鍵字的Macro-average指標為例，三個關鍵字的Macro-recall值為 86.4%，比一個關鍵字的 72.4%高了 14.0%，但三個關鍵字的 Macro-precision值只有 74.8%，比一個關鍵字的 77.3%低了 2.5%。但從整體的Macro-F-measure指標來看，三個關鍵字相較於一個關鍵字還是提升的，分別為 80.2% 與 74.8%，相差 5.4%。

表 8 產品特徵描述完整性對 R-OSI技術的影響（小文集）門檻值 # Precision Recall F-measure

一個關鍵字Macro 0.005; 0.15 77.3% 72.4% 74.8%

Micro 0.005; 0.15 75.4% 61.9% 68.0%

二個關鍵字Macro 0.007; 0.15 74.8% 86.3% 80.2%

Micro 0.009; 0.2 75.9% 69.5% 72.5%

三個關鍵字Macro 0.007; 0.15 74.8% 86.4% 80.2%

Micro 0.005; 0.25 73.1% 72.5% 72.6%

#：依序為最小支持度 (Min-supp) 門檻值與最小信心度 (Min-conf) 門檻值。

* ：粗體字代表不同特徵描述完整性下較佳的表現。

伍、結論

本研究專注在意見探勘中意見句子識別的工作，為改善傳統監督式學習法在準備

訓練資料上所需投入的大量人力與時間，本研究提出僅需要使用者提供少量的關鍵

字，再輔以網路抓取來未經人工標註的使用者產生資料，便能夠進行半監督式的學習，

產生與監督式學習相似甚至更佳的探勘結果。具體而言，本研究採用類別關聯規則演

算法來進行半監督式學習，提出規則式意見句子識別技術 (R-OSI)。根據實驗評估結果，本研究的 R-OSI在只使用一個關鍵字來描述產品特徵的前提下，與先前研究的監督式方法相比，Macro-F-measure高出 2.5%，而Micro-F-measure則低了 1.0%。如果將關鍵字數量從一個增加到三個，可以發現 R-OSI的效果在 Macro-F-measure與Micro-F-measure指標上分別可以提升 5.4%與 4.6%，優於先前監督式學習法的結果。彙整而言，本研究所提R-OSI方法屬於聚焦式技術，且採用的是半監督式學習法，

因此相較文獻中的相關工作，R-OSI技術的第一個優點是不需要花費大量的人力進行訓練資料的標註與準備。此外，本研究需要人工給予的資料僅有少量用於描述產品特

徵的關鍵字，相較於其他半監督式技術，需要的輸入也是相對簡單的。因此，R-OSI技術的實用性是比較高的，也比較不會受到領域相依這類情感分析常見問題的影響。

然而，R-OSI技術也不是在任何類型的產品或評論文章上都可以有相似的效能表現的。


20

根據本研究的觀察，如果屬於網路討論量豐富的產品種類（例如，汽機車、3C商品、飯店、美妝用品等），且有明確的產品特徵，則適合使用 R-OSI技術來進行分析。討論量少的產品種類（例如，量販賣場、DIY家具），因根本缺少分析的資料，所以不論使用哪種分析技術，都無法取得有代表性的結果；而產品特徵明確的部分，因為

R-OSI技術是使用關鍵字來進行候選意見句子檢索，並據以進行後續的意見句子識別模型學習，所以如果無法明確地用幾個關鍵字來描述某個產品特徵的話，R-OSI技術通常無法有良好的表現，但如果是採用監督式學習法，則仍然可能有不錯的效果。

本研究仍有下列的研究限制與未來研究方向：(1)本研究只使用一組數位相機資料來評估所提規則式意見句子識別技術的效能，未來應收集更多類的資料來進行評估，

以增強結論的一般化能力。(2)目前使用的類別關聯規則只採用最小支持度與最小信心度來衡量規則的有趣性，文獻中已有許多改良式的類別關聯規則探勘演算法，實證

評估比較不同類別關聯規則或有趣性指標在意見句子識別上的效果，是一個重要而且

有趣的研究方向。(3)目前本研究所提方法只解決了意見探勘的第一項核心工作，未來可延伸所提方法，同樣利用規則式的技術，設計搭配的意見傾向判定技術。


21

Mining Consumer Knowledge from Social Media: Development of an Opinion Mining Technique

1. Purpose/ObjectiveSocial media and user-generated content have been considered as vital assets for

supporting crucial business intelligence applications. The knowledge gained from social media can potentially lead to the development of novel services or products that are tailored more effectively to users’ needs, and can also be used to meet the objectives of the businesses offered to them. Online consumer reviews are a crucial type of user-generated content. Appropriate analysis of consumer reviews not only provides valuable information to facilitate the purchase decisions of consumers, but also helps merchants or product manufacturers understand the general responses of customers regarding their services or products for marketing campaign improvement. However, the rapidly increasing number of consumer reviews hinders both consumers and businesses from obtaining a comprehensive view of consumer opinions pertaining to products of interest when manual analysis techniques are used. Therefore, developing techniques for analyzing the exceedingly high number of online consumer reviews and summarizing their sentiments pertaining to specific products is desirable and essential. The objective of this study was to design a novel and effective opinion sentence identification technique, which can analyze and summarize consumer sentiments pertaining to specific products from online consumer reviews.

2. Design/Methodology/ApproachWe propose a rule-based opinion sentence identification (R-OSI) technique, which can

retrieve relevant review sentences to a specific product feature of interest from a large volume of consumer reviews. Assume that we are interested in the “battery” product feature of a digital camera; the R-OSI technique finds review sentences that discuss the battery product feature from the selected digital camera consumer review corpus. Specifically, the R-OSI technique adopts a semi-supervised learning approach by requesting a user to provide keywords (e.g., battery, rechargeable, lithium) to describe the target product feature (e.g., battery). In addition, a set of unannotated consumer reviews are retrieved from various social media websites. On the basis of the user-provided keywords and the set of unannotated

Chin-Sheng Yang, Assistant Professor, Department of Information Management, Yuan Ze University

Pei-Yun Xie, Master, Department of Information Management, Yuan Ze University

Hsiao-Ping Shih, Master, Department of Information Management, Yuan Ze University


22

consumer reviews, the class association rule mining algorithm is applied to learn a set of opinion sentence identification rules for the target product feature. The identification rules can be employed to distinguish between opinionated sentences and non-opinionated sentences of the target product feature.

3. FindingsTo evaluate the effectiveness of the proposed R-OSI technique, a data set comprising of

4,500 consumer review sentences regarding digital cameras was collected from Amazon and Google Shopping. We manually labeled each review sentence as containing zero or more product features. Across the whole data set, we identified eight product features: battery, flash, image quality, lens, memory, price, screen, and video; for which 149, 83, 273, 241, 95, 156, 102, and 67 sentences were respectively identified as opinion sentences. A supervised learning approach proposed by Yang et al. (2010) was adopted as our performance benchmark. A class association rule mining algorithm was also adopted as its underlying induction learning method. According to our empirical evaluation results, the R-OSI technique, with one keyword for each product feature, achieved comparable performance with the fully supervised benchmark approach, in which the manual preparation of training data is necessary. Furthermore, when we increased the number of keywords for each product feature from one to three, the performance of the R-OSI technique improved appreciably. The macro- and micro-F-measure values increased by 5.4% and 4.6%, respectively. We also investigated the sensitivity of the R-OSI technique to the size of the set of unannotated consumer reviews. The proposed technique demonstrated stable performance for different sizes of unannotated consumer review corpora. Overall, these findings suggest that the proposed R-OSI technique achieves promising performance in opinion sentence identification, even when a supervised learning approach is adopted as the performance benchmark.

4. Research Limitations/ImplicationsIn addition to theoretical contributions to the literature on opinion mining, our study

offers several practical implications. First, social media can be a crucial resource for companies to explore and develop business intelligence. However, understanding consumer-oriented business insights embedded in the considerable amount of online consumer reviews can be highly challenging. Effective opinion mining techniques or tools can increase the


23

chance of success for a social media listening project. Second, in addition to opinion sentence identification, many essential opinion mining tasks remain (e.g., opinion orientation determination and opinion searching). The proposed semi-supervised learning concept can be easily extended to support other opinion mining tasks.

Nonetheless, several limitations of this study must be acknowledged. First, only one data set is employed to evaluate the performance of the R-OSI technique. To increase the reliability of the evaluation results, it is essential to collect additional data sets from diverse sources. Second, the traditional association rule mining algorithm with minimum support and minimum confidence as interest measures is adopted to learn the opinion sentence identification rules. Some novel interest measures or extended association rule mining algorithms are proposed in the literature. Extending the R-OSI technique to novel designs represents a compelling future research direction. Finally, only the task of opinion sentence identification was addressed in this study. The concept of the R-OSI technique must be applied to other vital opinion mining tasks such as opinion orientation determination.

5. Originality/ContributionMost relevant studies have adopted either a supervised learning or an unsupervised

learning approach for the target opinion sentence identification task. A supervised learning approach generally demonstrates higher performance but requires manual preparation of training data. By contrast, an unsupervised learning approach requires no time- or labor-consuming process for preparing training data at the cost of lower identification effectiveness. Therefore, this study proposes a semi-supervised learning approach. The R-OSI technique can achieve comparable or even higher performance than that from using a supervised learning approach without the need for preparing training data. Only some (one to three in this study) keywords are provided by the user. The cognitive effort of keyword preparation should be lower than that of training data preparation. This represents the major contribution and novelty of this study to the literature on opinion mining. Furthermore, the R-OSI technique has additional advantages. First, it can identify opinion sentences with implicit product features. Second, the R-OSI technique can group opinion sentences with different product feature synonyms into the same category.


24

參考文獻

鍾珍珠與郭玉慧，2014，從「巨量資料」綜觀全國性繳費即時交易的成長遠景，財金資訊季刊，80期：2-8。(Chung, Chen-Chu, and Kuo, Yu-Hui. 2014. A big data perspective on the growth vision of the nationwide e-payment transactions. Financial Information Quarterly, 80: 2-8.)

蘇俊榮，2015，財政巨量資料的未來與挑戰，http://www.cse.yzu.edu.tw/qpi/download/speech/1040109_PDF.pdf，搜尋日期：2015年 5月 31日。(Su, Chun-Jung. 2015. Future opportunities and challenges in financial big data. http://www.cse.yzu.edu.tw/qpi/download/speech/1040109_PDF.pdf. Accessed May. 31, 2015.)

Agrawal, R., and Srikant, R. 1994. Fast algorithms for mining association rules in large databases. In Bocca, J. B., Jarke, M., and Zaniolo, C. (Eds.), Proceedings of the 20th International Conference on Very Large Data Bases: 487-499. San Francisco, CA: Morgan Kaufmann.

Chen, H., Chiang, R. H. L., and Storey, V. C. 2012. Business intelligence and analytics: From big data to big impact. MIS Quarterly, 36 (4): 1165-1188.

Chen, W., and Zhou, J. 2010. A text classifier with domain adaptation for sentiment classification. In Cheng, P. J., Kan, M. Y., Lam, W., and Nakov, P. (Eds.), Lecture Note in Computer Science: Vol. 6458. Information Retrieval Technology: 61-72. Berlin, Germamy: Springer. doi: 10.1007/978-3-642-17187-1_6

Das, S., and Chen, M. 2007. Yahoo! for Amazon: Sentiment extraction from small talk on the web. Management Science, 53 (9): 1375-1388. doi: 10.1287/mnsc.1070.0704

Ding, X., Liu, B., and Yu, P. S. 2008. A holistic lexicon-based approach to opinion mining. In Proceedings of the 2008 International Conference on Web Search and Data Mining: 231-240. New York, NY: Association for Computing Machinery. doi: 10.1145/1341531.1341561

EMC Digital Universe with Research & Analysis by IDC. 2014. The digital universe of opportunities: Rich data and the increasing value of the internet of things. http://www.emc.com/leadership/digital-universe/2014iview/index.htm. Accessed May. 26, 2015.

Etzioni, O., Cafarella, M., Downey, D., Popescu, A., Shaked, T., Soderland, S., Weld, D. S., and Yates, A. 2005. Unsupervised named-entity extraction from the web: An experimental study. Artificial Intelligence, 165 (1): 91-134. doi: 10.1016/j.artint. 2005.03.001


25

Feldman, R. 2013. Techniques and applications for sentiment analysis. Communications of the ACM, 56 (4): 82-89. doi: 10.1145/2436256.2436274

Ginsberg, J., Mohebbi, M. H., Patel, R. S., Brammer, L., Smolinski, M. S., and Brilliant, L. 2009. Detecting influenza epidemics using search engine query data. Nature, 457: 1012-1014. doi: 10.1038/nature07634

Hu, M., and Liu, B. 2004a. Mining and summarizing customer reviews. In Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining: 168-177. New York, NY: Association for Computing Machinery. doi: 10.1145/1014052.1014073. 2004b. Mining opinion features in customer reviews. In Cohn, A. G. (Ed.), Proceedings of the 19th National Conference on Artificial Intelligence: 755-760. San Jose, CA: American Association for Artificial Intelligence.

Kobayashi, N., Iida, R., Inui, K., and Matsumoto, Y. 2005. Opinion extraction using a learning-based anaphora resolution technique. Paper presented at the Second International Joint Conference on Natural Language Processing, Jeju, Korea.

Kobayashi, N., Inui, K., Matsumoto, Y., Tateishi, K., and Fukushima, T. 2004. Collecting evaluative expressions for opinion extraction. In Su, K., Tsujii, J., Lee, J., and Kwong, O. Y. (Eds.), Proceedings of the First International Joint Conference on Natural Language Processing: 596-605. Berlin, Germany: Springer-Verlag. doi: 10.1007/978-3-540-30211-7_63

Liu, B. 2010. Sentiment analysis and subjectivity. In Indurkhya, N., and Damerau, F. J. (Eds.), Handbook of Natural Language Processing (2nd ed.): 627-666. Boca Raton, FL: Chapman & Hall/CRC.. 2012. Sentiment Analysis and Opinion Mining. San Rafael, CA: Morgan & Claypool.

Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C., and Byers, A. H. 2011. Big data: The next frontier for innovation, competition, and productivity. http://www.mckinsey.com/insights/business_technology/big_data_the_next_frontier_for_innovation. Accessed May. 26, 2015.

Naisbitt, J. 1982. Megatrends: Ten New Directions Transforming Our Lives. New York, NY: Warner Books.

Pang, B., and Lee, L. 2008. Opinion mining and sentiment analysis. Foundations and Trends in Information Retrieval, 2 (1-2): 1-135. doi: 10.1561/1500000011


26

Pang, B., Lee, L., and Vaithyanathan, S. 2002. Thumbs up?: Sentiment classification using machine learning techniques. In Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing: 79-86. Stroudsburg, PA: Association for Computational Linguistics. doi: 10.3115/1118693.1118704

Popescu, A., and Etzioni, O. 2005. Extracting product features and opinions from reviews. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing: 339-346. Stroudsburg, PA: Association for Computational Linguistics. doi: 10.3115/1220575.1220618

Schein, A. I., Popescul, A., Ungar, L. H., and Pennock, D. M. 2002. Methods and metrics for cold-start recommendations. In Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval: 253-260. New York, NY: Association for Computing Machinery. doi: 10.1145/564376.564421

Schmid, H. 1994. Probabilistic part-of-speech tagging using decision trees. Paper presented at the International Conference on New Methods in Language Processing, Manchester, UK.

. 1999. Improvements in part-of-speech tagging with an application to German. In Armstrong, S., Church, K., Isabelle, P., Manzi, S., Tzoukermann, E., and Yarowsky, D. (Eds.), Text, Speech and Language Technology: Vol. 11. Natural Language Processing Using Very Large Corpora: 13-25. Dordrecht, Netherlands: Springer Netherlands. doi: 10.1007/978-94-017-2390-9_2

Srikant, R., and Agrawal, R. 1995. Mining generalized association rules. In Dayal, U., Gray, P. M. D., and Nishio, S. (Eds.), Proceedings of the 21st International Conference on Very Large Data Bases: 407-419. San Francisco, CA: Morgan Kaufmann.

Stepinski, A., and Mittal, V. 2007. A fact/opinion classifier for news articles. In Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval: 807-808. New York, NY: Association for Computing Machinery. doi: 10.1145/1277741.1277919

Taboada, M., Brooke, J., Tofiloski, M., Voll, K., and Stede, M. 2011. Lexicon-based methods for sentiment analysis. Computational Linguistics, 37 (2): 267-307. doi: 10.1162/COLI_a_00049

Turney, P. D. 2002. Thumbs up or thumbs down?: Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the 40th Annual Meeting on Association for Computational Linguistics: 417-424. Stroudsburg, PA: Association for Computational Linguistics. doi: 10.3115/1073083.1073153


27

Wei, C. P., Chen, Y. M., Yang, C. S., and Yang, C. C. 2010. Understanding what concerns consumers: A semantic approach to product feature extraction from consumer reviews. Information Systems and e-Business Management, 8 (2): 149-167. doi: 10.1007/s10257-009-0113-9

Wong, T. L., and Lam, W. 2005. Hot item mining and summarization from multiple auction web sites. In Proceedings of the Fifth IEEE International Conference on Data Mining: 797-800. Washington, DC: IEEE Computer Society. doi: 10.1109/ICDM.2005.78

. 2008. Learning to extract and summarize hot item features from multiple auction web sites. Knowledge and Information Systems, 14 (2): 143-160. doi: 10.1007/s10115-007-0078-2

Yang, C. C., Tang, X., Wong, Y. C., and Wei, C. P. 2010. Understanding online consumer review opinions with sentiment analysis using machine learning. Pacific Asia Journal of the Association for Information Systems, 2 (3): 73-89.

Yang, C. S., Chen, C. H., and Chang, P. C. 2014. Harnessing consumer reviews for marketing intelligence: A domain-adapted sentiment classification approach. Information Systems and e-Business Management, 13 (3): 403-419. doi: 10.1007/s10257-014-0266-z

Yang, C. S., Wei, C. P., and Yang, C. C. 2009. Extracting customer knowledge from online consumer reviews: A collaborative-filtering-based opinion sentence identification approach. In Proceedings of the 11th International Conference on Electronic Commerce: 64-71. New York, NY: Association for Computing Machinery. doi: 10.1145/1593254.1593264

Zhou, K., Yang, S. H., and Zha, H. 2011. Functional matrix factorizations for cold-start recommendation. In Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval: 315-324. New York, NY: Association for Computing Machinery. doi: 10.1145/ 2009916.2009961


28

作者簡介*楊錦生國立中山大學資訊管理學系博士，目前為元智大學資訊管理學系助理教授。研究

專長為文字與網頁探勘、社群媒體分析、商業智慧，學術論文曾發表於 Journal of Management Information Systems、Decision Sciences、Decision Support Systems、Information Processing and Management、Information Systems and E-Business Management與資訊管理學報。

謝佩芸

元智大學資訊管理學系碩士，研究專長為意見探勘。

施曉萍

元智大學資訊管理學系碩士，研究專長為意見探勘。

致謝：本研究獲行政院科技部與國家科學委員會專題研究計畫部份經費補助 (NSC 100-2410-H-155-013-MY3、MOST 103-2410-H-155-027-MY3)，謹此致謝。

* E-mail: [email protected]

Date post:	30-Aug-2019
Category:	Documents
Upload:	others
View:	0 times
Download:	0 times

社群媒體中顧客知識之挖掘：意見探勘技術開發review.management.ntu.edu.tw/paper/6243-P.pdf ·...

Documents