Social Media Mining: An Introduction

P1: WQS Trim: 6.125in×9.25in Top: 0.5in Gutter: 0.75in
CUUS2079-04 CUUS2079-Zafarani 978 1 107 01885 3 January 13, 2014 16:54

100 Network Models

is what we are interested in, which, when solved, gives

P(d)=

2 m^2 t d^3 (t+m 0 )

and the stationary solution (t→∞),

P(d)=

2 m^2 d^3

, (4.30)

which is a power-law degree distribution with exponentb=3. Note that in real-world networks, the exponent varies in a range (e.g., [2, 3]); however, there is no variance in the exponent of the introduced model. To overcome this issue, several other models are proposed. Interested readers can refer to the bibliographical notes for further references.

Clustering Coefficient In general, not many triangles are formed by the Bar ́abasi-Albert model, because edges are created independently and one at a time. Again, using a mean-field analysis, the expected clustering coefficient can be calculated as

C=

m 0 − 1 8

(lnt)^2 t

, (4.31)

wheretis the time passed in the system during the simulation. We avoid the details of this calculation due to techniques beyond the scope of this book. Unfortunately, as time passes, the clustering coefficient gets smaller and fails to model the high clustering coefficient observed in real-world networks.

Average Path Length

The average path length of the preferential attachment model increases logarithmically with the number of nodes present in the network:

l∼

ln|V| ln(ln|V|)

. (4.32)

This indicates that, on average, preferential attachment models generate shorter path lengths than random graphs. Random graphs are considered accurate in approximating the average path lengths. The same holds for preferential attachment models.

Social Media Mining: An Introduction

, (4.30)

C=

, (4.31)

. (4.32)

Get our desktop app

Company

Features

Documentation

Resources