Skip to main navigation Skip to search Skip to main content

Target Distribution Guided Network Sampling

  • Northwestern Polytechnical University Xian
  • Xi'an University of Science and Technology
  • University of Fribourg

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Studying public users' data on social networks to provide service and prediction for the society has been a widespread and effective way thanks to the rapid raise of social networks. However, users' population structure online is usually different from that of physical world, which may influence the researches significantly. Thus it may become an essential limitation for studies conducted by revealing knowledge from social media data owing to the biased network population structure. Tradition sample approaches are either resources-intensive or data-biased. In this paper, we proposed a target distribution guided sample process to solve the problem of imbalanced user data in the virtual space. We make intervention to the sampling procedure according to the real-Time divergence of the collected sample set against the target distribution, apply theory of homophily to discover the users with matched features and refine the samples with recursive sampling. Experiments show this method is able to successfully constrain samples' overall structure according to the given distribution within a given JS divergence of 0.1 while leaving the unrelated features distributed randomly. Moreover, it takes less times of access to collect a certain number of samples for the method proposed in this paper and thus save time and computer resources.

Original languageEnglish
Title of host publicationProceedings - 5th International Conference on Advanced Cloud and Big Data, CBD 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages374-379
Number of pages6
ISBN (Electronic)9781538610725
DOIs
StatePublished - 6 Sep 2017
Externally publishedYes
Event5th International Conference on Advanced Cloud and Big Data, CBD 2017 - Shanghai, China
Duration: 13 Aug 201716 Aug 2017

Publication series

NameProceedings - 5th International Conference on Advanced Cloud and Big Data, CBD 2017

Conference

Conference5th International Conference on Advanced Cloud and Big Data, CBD 2017
Country/TerritoryChina
CityShanghai
Period13/08/1716/08/17

Keywords

  • Online analytical processing
  • Social network analysis

Fingerprint

Dive into the research topics of 'Target Distribution Guided Network Sampling'. Together they form a unique fingerprint.

Cite this