Crowdguard: Characterization and early detection of collective content polluters in online social networks

Ke Li; Bin Guo; Qiuyun Zhang; Jianping Yuan; Zhiwen Yu

doi:10.1145/3308560.3316452

Crowdguard: Characterization and early detection of collective content polluters in online social networks

Ke Li, Bin Guo, Qiuyun Zhang, Jianping Yuan, Zhiwen Yu

Northwestern Polytechnical University Xian

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

6 引用（Scopus）

摘要

Recently, content polluters post malicious information in Online Social Networks (OSNs), which is a more and more serious problem that poses a serious threat to the privacy information, account security, user experience, etc. They continuously simulate the behaviors of legitimate accounts in various ways, and evade detection systems against them. In this paper, we focus on one kind of content polluter, namely collective content polluter (hereinafter referred to as CCP). Existing works either focus on individual polluters or require long periods of data records for detection, making their detection methods less robust and lagging behind. It is thus necessary to analyze the characteristics of collective content polluters and study the methods for early detection. This paper proposes a CCP early detection method called CrowdGuard. It analyzes the crowd behaviors of collective content polluters and legitimate accounts, extracts distinctive features, and leverages the Gaussian Mixture Model (GMM) method to cluster the two groups of accounts (legitimate users and polluters) to achieve early detection. Using the public dataset including thousands of collective content polluters on Twitter about a political election, we design an experimental scenario simulating early detection and evaluate the performance of CrowdGuard. The results show that CrowdGuard outperforms existing methods and is adequate for early detection.

源语言	英语
主期刊名	The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019
出版商	Association for Computing Machinery, Inc
页	1063-1070
页数	8
ISBN（电子版）	9781450366755
DOI	https://doi.org/10.1145/3308560.3316452
出版状态	已出版 - 13 5月 2019
活动	2019 World Wide Web Conference, WWW 2019 - San Francisco, 美国期限: 13 5月 2019 → 17 5月 2019

出版系列

姓名	The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019

会议

会议	2019 World Wide Web Conference, WWW 2019
国家/地区	美国
市	San Francisco
时期	13/05/19 → 17/05/19

访问文件

10.1145/3308560.3316452

其它文件与链接

链接到 Scopus 的出版物

引用此

Li, K., Guo, B., Zhang, Q., Yuan, J., & Yu, Z. (2019). Crowdguard: Characterization and early detection of collective content polluters in online social networks. 在 The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019 (页码 1063-1070). (The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019). Association for Computing Machinery, Inc. https://doi.org/10.1145/3308560.3316452

Li, Ke ; Guo, Bin ; Zhang, Qiuyun 等. / Crowdguard : Characterization and early detection of collective content polluters in online social networks. The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019. Association for Computing Machinery, Inc, 2019. 页码 1063-1070 (The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019).

@inproceedings{c2d97b9449e1452c8229e7b7e738dc12,

title = "Crowdguard: Characterization and early detection of collective content polluters in online social networks",

abstract = "Recently, content polluters post malicious information in Online Social Networks (OSNs), which is a more and more serious problem that poses a serious threat to the privacy information, account security, user experience, etc. They continuously simulate the behaviors of legitimate accounts in various ways, and evade detection systems against them. In this paper, we focus on one kind of content polluter, namely collective content polluter (hereinafter referred to as CCP). Existing works either focus on individual polluters or require long periods of data records for detection, making their detection methods less robust and lagging behind. It is thus necessary to analyze the characteristics of collective content polluters and study the methods for early detection. This paper proposes a CCP early detection method called CrowdGuard. It analyzes the crowd behaviors of collective content polluters and legitimate accounts, extracts distinctive features, and leverages the Gaussian Mixture Model (GMM) method to cluster the two groups of accounts (legitimate users and polluters) to achieve early detection. Using the public dataset including thousands of collective content polluters on Twitter about a political election, we design an experimental scenario simulating early detection and evaluate the performance of CrowdGuard. The results show that CrowdGuard outperforms existing methods and is adequate for early detection.",

keywords = "Collective Content Polluters, Crowd Computing, Early Detection, Gaussian Mixture Model, Social Media",

author = "Ke Li and Bin Guo and Qiuyun Zhang and Jianping Yuan and Zhiwen Yu",

note = "Publisher Copyright: {\textcopyright} 2019 IW3C2 (International World Wide Web Conference Committee), published under Creative Commons CC-BY 4.0 License.; 2019 World Wide Web Conference, WWW 2019 ; Conference date: 13-05-2019 Through 17-05-2019",

year = "2019",

month = may,

day = "13",

doi = "10.1145/3308560.3316452",

language = "英语",

series = "The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019",

publisher = "Association for Computing Machinery, Inc",

pages = "1063--1070",

booktitle = "The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019",

}

Li, K, Guo, B, Zhang, Q, Yuan, J & Yu, Z 2019, Crowdguard: Characterization and early detection of collective content polluters in online social networks. 在 The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019. The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019, Association for Computing Machinery, Inc, 页码 1063-1070, 2019 World Wide Web Conference, WWW 2019, San Francisco, 美国, 13/05/19. https://doi.org/10.1145/3308560.3316452

Crowdguard: Characterization and early detection of collective content polluters in online social networks. / Li, Ke; Guo, Bin; Zhang, Qiuyun 等.
The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019. Association for Computing Machinery, Inc, 2019. 页码 1063-1070 (The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019).

科研成果: 书/报告/会议事项章节 › 会议稿件 › 同行评审

TY - GEN

T1 - Crowdguard

T2 - 2019 World Wide Web Conference, WWW 2019

AU - Li, Ke

AU - Guo, Bin

AU - Zhang, Qiuyun

AU - Yuan, Jianping

AU - Yu, Zhiwen

PY - 2019/5/13

Y1 - 2019/5/13

N2 - Recently, content polluters post malicious information in Online Social Networks (OSNs), which is a more and more serious problem that poses a serious threat to the privacy information, account security, user experience, etc. They continuously simulate the behaviors of legitimate accounts in various ways, and evade detection systems against them. In this paper, we focus on one kind of content polluter, namely collective content polluter (hereinafter referred to as CCP). Existing works either focus on individual polluters or require long periods of data records for detection, making their detection methods less robust and lagging behind. It is thus necessary to analyze the characteristics of collective content polluters and study the methods for early detection. This paper proposes a CCP early detection method called CrowdGuard. It analyzes the crowd behaviors of collective content polluters and legitimate accounts, extracts distinctive features, and leverages the Gaussian Mixture Model (GMM) method to cluster the two groups of accounts (legitimate users and polluters) to achieve early detection. Using the public dataset including thousands of collective content polluters on Twitter about a political election, we design an experimental scenario simulating early detection and evaluate the performance of CrowdGuard. The results show that CrowdGuard outperforms existing methods and is adequate for early detection.

AB - Recently, content polluters post malicious information in Online Social Networks (OSNs), which is a more and more serious problem that poses a serious threat to the privacy information, account security, user experience, etc. They continuously simulate the behaviors of legitimate accounts in various ways, and evade detection systems against them. In this paper, we focus on one kind of content polluter, namely collective content polluter (hereinafter referred to as CCP). Existing works either focus on individual polluters or require long periods of data records for detection, making their detection methods less robust and lagging behind. It is thus necessary to analyze the characteristics of collective content polluters and study the methods for early detection. This paper proposes a CCP early detection method called CrowdGuard. It analyzes the crowd behaviors of collective content polluters and legitimate accounts, extracts distinctive features, and leverages the Gaussian Mixture Model (GMM) method to cluster the two groups of accounts (legitimate users and polluters) to achieve early detection. Using the public dataset including thousands of collective content polluters on Twitter about a political election, we design an experimental scenario simulating early detection and evaluate the performance of CrowdGuard. The results show that CrowdGuard outperforms existing methods and is adequate for early detection.

KW - Collective Content Polluters

KW - Crowd Computing

KW - Early Detection

KW - Gaussian Mixture Model

KW - Social Media

UR - http://www.scopus.com/inward/record.url?scp=85066901460&partnerID=8YFLogxK

U2 - 10.1145/3308560.3316452

DO - 10.1145/3308560.3316452

M3 - 会议稿件

AN - SCOPUS:85066901460

T3 - The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019

SP - 1063

EP - 1070

BT - The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019

PB - Association for Computing Machinery, Inc

Y2 - 13 May 2019 through 17 May 2019

ER -

Li K, Guo B, Zhang Q, Yuan J, Yu Z. Crowdguard: Characterization and early detection of collective content polluters in online social networks. 在 The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019. Association for Computing Machinery, Inc. 2019. 页码 1063-1070. (The Web Conference 2019 - Companion of the World Wide Web Conference, WWW 2019). doi: 10.1145/3308560.3316452

Crowdguard: Characterization and early detection of collective content polluters in online social networks

摘要

出版系列

会议

访问文件

其它文件与链接

指纹

引用此