P1: WQS Trim: 6.125in×9.25in Top: 0.5in Gutter: 0.75in
CUUS2079-END CUUS2079-Zafarani 978 1 107 01885 3 January 13, 2014 19:33
Notes
Chapter 1
- The data has a power-law distribution and more often than not, data is not inde-
pendent and identically distributed (i.i.d.) as generally assumed in data mining.
Chapter 2
- This is similar to plotting the probability mass function for degrees.
- Instead ofWin weighted networks,Cis used to clearly represent capacities.
- This edge is often called theweak link.
- The proof is omitted here and is a direct result from the minimum-cut/maximum
flow theorem not discussed in this chapter.
Chapter 3
- This constraint is optional and can be lifted based on the context.
- When det(I−αAT)=0, it can be rearranged as det(AT−α−^1 I)=0, which is
basically the characteristic equation. This equation first becomes zero when the
largest eigenvalue equalsα−^1 , or equivalentlyα= 1 /λ. - Whendoutj =0, we know that since the out-degree is zero,∀i,Aj,i=0, this makes
the term inside the summation^00. We can fix this problem by settingdoutj =1since
the node will not contribute any centrality to any other nodes. - Here, we start fromv 1 and follow the edges. One can start from a different node,
and the result should remain the same. - HITS stands for hypertext-induced topic search.
Chapter 4
- For a more detailed approach refer to [Clauset et al., 2009].
- Note that forc=1, the component size is stable, and in the limit, no growth will
be observed. The phase transition happens exactly atc=1. - Hint: The proof is similar to the proof provided for the likelihood of observingm
edges (Proposition 4.3).