Analysis of Topological Characteristics of Huge Online Social Networking Services

Yong-Yeol Ahn, Seungyeop Han, Haewoon Kwak, Sue Moon, Hawoong Jeong
16th International World Wide Web Conference (WWW), Banff, Alberta, CANADA, May 2007

Social networking services are a fast-growing business in the Internet. However, it is unknown if online relationships and their growth patterns are the same as in real-life social networks. In this paper, we compare the structures of three online social networking services: Cyworld, MySpace, and orkut, each with more than 10 million users, respectively. We have access to complete data of Cyworld’s ilchon (friend) relationships and analyze its degree distribution, clustering property, degree correlation, and evolution over time. We also use Cyworld data to evaluate the validity of snowball sampling method, which we use to crawl and obtain partial network topologies of MySpace and orkut. Cyworld, the oldest of the three, demonstrates a changing scaling behavior over time in degree distribution. The latest Cyworld data’s degree distribution exhibits a multi-scaling behavior, while those of MySpace and orkut have simple scaling behaviors with different exponents. Very interestingly, each of the two exponents corresponds to the different segments in Cyworld’s degree distribution. Certain online social networking services encourage online activities that cannot be easily copied in real life; we show that they deviate from close-knit online social networks which show a similar degree correlation pattern to real-life social networks.

[PDF (WWW 2007 paper)]
[PDF (Corrected & extended version)]

@inproceedings{2007-www-social-networking,
   author =    "Yong-Yeol Ahn and Seungyeop Han and Haewoon Kwak and Sue Moon and Hawoong Jeong",
   title =     "{Analysis of topological characteristics of huge online social networking services}",
   booktitle = {WWW '07: Proceedings of the 16th international conference on World Wide Web},
   year =      {2007},
   isbn =      {978-1-59593-654-7},
   pages =     {835--844},
   location =  {Banff, Alberta, Canada},
   doi =       {http://doi.acm.org/10.1145/1242572.1242685},
   publisher = {ACM},
   address =   {New York, NY, USA}
}

Data

We have obtained 100,000 user information from the MySpace friend network by crawling the MySpace online web site from September to October, 2006. The crawler randomly selects a starting user site, and crawl the user's friends' pages, their friends' pages, and so on. We have left out users who do not publicize their firends' list, and the amount of those users were about 23% out of all the nodes we have crawled. Unfortunately, we cannot provide the orkut dataset that was used in our paper, since it was lost in a crash of our data server. For more information on the traces, please refer to our paper.

MySpace

Contact

Yong-Yeol Ahn, Dept. of Physics, KAIST. Email: yongyeol@gmail.com