Text mining self-disclosing health information for public health service

Yungchang Ku, Chaochang Chiu, Yulei Zhang, Hsinchun Chen, Handsome Su

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

Understanding specific patterns or knowledge of selfdisclosing health information could support public health surveillance and healthcare. This study aimed to develop an analytical framework to identify selfdisclosing health information with unusual messages on web forums by leveraging advanced text-mining techniques. To demonstrate the performance of the proposed analytical framework, we conducted an experimental study on 2 major human immunodeficiency virus (HIV)/acquired immune deficiency syndrome (AIDS) forums in Taiwan. The experimental results show that the classification accuracy increased significantly (up to 83.83%) when using features selected by the information gain technique. The results also show the importance of adopting domain-specific features in analyzing unusual messages on web forums. This study has practical implications for the prevention and support of HIV/AIDS healthcare. For example, public health agencies can re-allocate resources and deliver services to people who need help via social media sites. In addition, individuals can also join a social media site to get better suggestions and support from each other.

Original languageEnglish (US)
Pages (from-to)928-947
Number of pages20
JournalJournal of the Association for Information Science and Technology
Volume65
Issue number5
DOIs
StatePublished - 2014

Fingerprint

public health services
Public health
health information
social media
Viruses
public health
Health
surveillance
Taiwan
resources
performance
Text mining
Health information
Health services
Healthcare
World Wide Web
Social media

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Information Systems
  • Information Systems and Management
  • Library and Information Sciences

Cite this

Text mining self-disclosing health information for public health service. / Ku, Yungchang; Chiu, Chaochang; Zhang, Yulei; Chen, Hsinchun; Su, Handsome.

In: Journal of the Association for Information Science and Technology, Vol. 65, No. 5, 2014, p. 928-947.

Research output: Contribution to journalArticle

Ku, Yungchang ; Chiu, Chaochang ; Zhang, Yulei ; Chen, Hsinchun ; Su, Handsome. / Text mining self-disclosing health information for public health service. In: Journal of the Association for Information Science and Technology. 2014 ; Vol. 65, No. 5. pp. 928-947.
@article{8328503922f548398c946c1fa81e6cc5,
title = "Text mining self-disclosing health information for public health service",
abstract = "Understanding specific patterns or knowledge of selfdisclosing health information could support public health surveillance and healthcare. This study aimed to develop an analytical framework to identify selfdisclosing health information with unusual messages on web forums by leveraging advanced text-mining techniques. To demonstrate the performance of the proposed analytical framework, we conducted an experimental study on 2 major human immunodeficiency virus (HIV)/acquired immune deficiency syndrome (AIDS) forums in Taiwan. The experimental results show that the classification accuracy increased significantly (up to 83.83{\%}) when using features selected by the information gain technique. The results also show the importance of adopting domain-specific features in analyzing unusual messages on web forums. This study has practical implications for the prevention and support of HIV/AIDS healthcare. For example, public health agencies can re-allocate resources and deliver services to people who need help via social media sites. In addition, individuals can also join a social media site to get better suggestions and support from each other.",
author = "Yungchang Ku and Chaochang Chiu and Yulei Zhang and Hsinchun Chen and Handsome Su",
year = "2014",
doi = "10.1002/asi.23025",
language = "English (US)",
volume = "65",
pages = "928--947",
journal = "Journal of the Association for Information Science and Technology",
issn = "2330-1635",
publisher = "John Wiley and Sons Ltd",
number = "5",

}

TY - JOUR

T1 - Text mining self-disclosing health information for public health service

AU - Ku, Yungchang

AU - Chiu, Chaochang

AU - Zhang, Yulei

AU - Chen, Hsinchun

AU - Su, Handsome

PY - 2014

Y1 - 2014

N2 - Understanding specific patterns or knowledge of selfdisclosing health information could support public health surveillance and healthcare. This study aimed to develop an analytical framework to identify selfdisclosing health information with unusual messages on web forums by leveraging advanced text-mining techniques. To demonstrate the performance of the proposed analytical framework, we conducted an experimental study on 2 major human immunodeficiency virus (HIV)/acquired immune deficiency syndrome (AIDS) forums in Taiwan. The experimental results show that the classification accuracy increased significantly (up to 83.83%) when using features selected by the information gain technique. The results also show the importance of adopting domain-specific features in analyzing unusual messages on web forums. This study has practical implications for the prevention and support of HIV/AIDS healthcare. For example, public health agencies can re-allocate resources and deliver services to people who need help via social media sites. In addition, individuals can also join a social media site to get better suggestions and support from each other.

AB - Understanding specific patterns or knowledge of selfdisclosing health information could support public health surveillance and healthcare. This study aimed to develop an analytical framework to identify selfdisclosing health information with unusual messages on web forums by leveraging advanced text-mining techniques. To demonstrate the performance of the proposed analytical framework, we conducted an experimental study on 2 major human immunodeficiency virus (HIV)/acquired immune deficiency syndrome (AIDS) forums in Taiwan. The experimental results show that the classification accuracy increased significantly (up to 83.83%) when using features selected by the information gain technique. The results also show the importance of adopting domain-specific features in analyzing unusual messages on web forums. This study has practical implications for the prevention and support of HIV/AIDS healthcare. For example, public health agencies can re-allocate resources and deliver services to people who need help via social media sites. In addition, individuals can also join a social media site to get better suggestions and support from each other.

UR - http://www.scopus.com/inward/record.url?scp=84901508230&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84901508230&partnerID=8YFLogxK

U2 - 10.1002/asi.23025

DO - 10.1002/asi.23025

M3 - Article

VL - 65

SP - 928

EP - 947

JO - Journal of the Association for Information Science and Technology

JF - Journal of the Association for Information Science and Technology

SN - 2330-1635

IS - 5

ER -