Introduction to Flow Model

Date:

Flow Model

Basic Ideas

Suppose a neural network generator G defines a distribution PG, that is

zπ(z),xPG(x),x=G(z)

The optimal model is

G=argmaxGmi=1logPG(xi)KL(Pdata||PG)

From the transformation relationship, we have

PG(xi)=π(zi)|det(JG1)|

where π(zi)=G1(xi). Apply log to both sides, we have

logPG(xi)=logπ(G1(xi))+log|det(JG1)|

The iteration of multiple models (the flow) is as follows

π(x)G1P1(x)G2P2(x)

And the we get distribution

PG(xi)=π(zi)|det(JG11)||det(JG12)||det(JG1K)|,i.e.,logPG(xi)=logπ(zi)+Kn=1|det(JG1n)|

Coupling Layer

coupling_layer

It is easy to find that it is a invertible transformation, on the one hand,

xid=zid,xi>d=βi>dzi>d+γi>d

on the other hand,

zid=xid,zi>d=xi>dγi>dβi>d

Now we can compute the Jacobian Matrix

[I0Diagonal] detJG=xd+1zd+1xd+2zd+2xDzD=βd+1βd+2βD

Coupling Layer Stacks

coupling_layers