Gender classification for web forums

Yulei Zhang, Yan Dang, Hsinchun Chen

Research output: Contribution to journalArticle

20 Citations (Scopus)

Abstract

More and more women are participating in and exchanging opinions through community-based online social media. Questions concerning gender differences in the new media have been raised. This paper proposes a feature-based text classification framework to examine online gender differences between Web forum posters by analyzing writing styles and topics of interest. Our experiment on an Islamic women's political forum shows that feature sets containing both content-free and content-specific features perform significantly better than those consisting of only content-free features, feature selection can improve the classification results significantly, and female and male participants have significantly different topics of interest.

Original languageEnglish (US)
Article number5723017
Pages (from-to)668-677
Number of pages10
JournalIEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans.
Volume41
Issue number4
DOIs
StatePublished - Jul 2011

Fingerprint

Feature extraction
Experiments

Keywords

  • Gender classification
  • online gender differences

ASJC Scopus subject areas

  • Control and Systems Engineering
  • Electrical and Electronic Engineering
  • Computer Science Applications
  • Human-Computer Interaction
  • Information Systems
  • Software

Cite this

Gender classification for web forums. / Zhang, Yulei; Dang, Yan; Chen, Hsinchun.

In: IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans., Vol. 41, No. 4, 5723017, 07.2011, p. 668-677.

Research output: Contribution to journalArticle

@article{08521dcbbcd64e02bb545f62adc30edb,
title = "Gender classification for web forums",
abstract = "More and more women are participating in and exchanging opinions through community-based online social media. Questions concerning gender differences in the new media have been raised. This paper proposes a feature-based text classification framework to examine online gender differences between Web forum posters by analyzing writing styles and topics of interest. Our experiment on an Islamic women's political forum shows that feature sets containing both content-free and content-specific features perform significantly better than those consisting of only content-free features, feature selection can improve the classification results significantly, and female and male participants have significantly different topics of interest.",
keywords = "Gender classification, online gender differences",
author = "Yulei Zhang and Yan Dang and Hsinchun Chen",
year = "2011",
month = "7",
doi = "10.1109/TSMCA.2010.2093886",
language = "English (US)",
volume = "41",
pages = "668--677",
journal = "IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans",
issn = "1083-4427",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "4",

}

TY - JOUR

T1 - Gender classification for web forums

AU - Zhang, Yulei

AU - Dang, Yan

AU - Chen, Hsinchun

PY - 2011/7

Y1 - 2011/7

N2 - More and more women are participating in and exchanging opinions through community-based online social media. Questions concerning gender differences in the new media have been raised. This paper proposes a feature-based text classification framework to examine online gender differences between Web forum posters by analyzing writing styles and topics of interest. Our experiment on an Islamic women's political forum shows that feature sets containing both content-free and content-specific features perform significantly better than those consisting of only content-free features, feature selection can improve the classification results significantly, and female and male participants have significantly different topics of interest.

AB - More and more women are participating in and exchanging opinions through community-based online social media. Questions concerning gender differences in the new media have been raised. This paper proposes a feature-based text classification framework to examine online gender differences between Web forum posters by analyzing writing styles and topics of interest. Our experiment on an Islamic women's political forum shows that feature sets containing both content-free and content-specific features perform significantly better than those consisting of only content-free features, feature selection can improve the classification results significantly, and female and male participants have significantly different topics of interest.

KW - Gender classification

KW - online gender differences

UR - http://www.scopus.com/inward/record.url?scp=79959627244&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79959627244&partnerID=8YFLogxK

U2 - 10.1109/TSMCA.2010.2093886

DO - 10.1109/TSMCA.2010.2093886

M3 - Article

VL - 41

SP - 668

EP - 677

JO - IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans

JF - IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans

SN - 1083-4427

IS - 4

M1 - 5723017

ER -