CS224W Machine Learning with Graphs

Lecture 6

Title: Message Passing and Node Classification
Date: 2020. 04. 09 (THU) ~ 2020. 04. 13 (MON)
Materials: Slides YouTube

Main question: How to predict labels of unlabeled nodes with some labeled nodes
Collective classification
- Intuition: Correlations exist in networks
- Relational classification, Iterative classification, Belief propagation

Homophily
- Characteristic of nodes determine the edge(connection)
- Individuals to associate and bond with similar others
Influence
- Connections determine characteristic of nodes
- Social connections can influence the individual characteristics
Confounding
- Other factor(environment) determines both of characteristic of nodes and connections

Similar nodes are typically close together or directly connected
Guilt -by-association
- Using Feature of \(O\)(object in network), Label of the \(O\)’s neighborhood, Feature of \(O\)’s neighborhood
Markov Assumption
- the label \(Y_i\) depends on the labels of its neighbors \(N_i\)
- \[P(Y_i|i) = P(Y_i|N_i)\]

fake reviewer/review detection
Easy to fake: Behavioral analysis(individual behaviors), Language analysis(content of review)
Hard to fake: graph structure(relationships between reviewers, reviews, stores)
Bipartite rating graph(positive review is +1 rating, negative review is -1 rating)
Fairness(\(F(u)\)), Goodness(\(G(p)\)), Reliability(\(R(u, p)\))
- \[F(u) = { {\sum_{ {(u,p)}\in{Out(u)}} R(u, p)} \over {|out(u)|}}\]
- \[G(p) = { {\sum_{ {(u, p)} \in {In(p)}} \cdot score(u, p)}\over{|In(p)|}}\]
- \[R(u, p) = { { {1} \over {y_1+y_2}} (y_1 \cdot F(u)) + y_2 \cdot (1- { {|score(u, p) - G(p)|} \over {2}}) }\]

Fake Review

Receive the message(state, attribute, etc) from neighbors and update, then pass toward other neighbors
After receive from and pass toward every neighbors, node can calculate their own state
However, it doesn’t work loopy(cyclic) graph

Belief Propagation

Label-label potential matrix(\(\psi\)): \(\psi(Y_i, Y_j)\) is probability of a node \(j\) being in the state \(Y_j\) given that it has a \(i\) neighbor in state \(Y_i\)
Prior belief(\(\phi\)): Probability \(\phi_i(Y_i)\) of node \(i\) being in the state \(Y_i\)
\(m_{i \to j}(Y_j)\) is \(i\)’s estimate of \(j\) being in the state \(Y_j\)
\(\mathcal{L}\) is the set of all states

Loopy BP

Convergence

Online Auction Fraud(especially, Non-delivery fraud)
Fraudsters try to form near-bipartite cores (2 roles)
- Accomplices: trades with honest
- Fraudsters: trades with accomplice and fraud with honest
  - These actions maintain their feedback score over than fraudsters who fraud with honest only
Three groups have different provabilities