Variational Inference for Stochastic Block Models from Sampled Data

Abstract : This paper deals with non-observed dyads during the sampling of a network and consecutive issues in the Stochastic Block Model (SBM) inference. We review sampling designs and recover Missing At Random (MAR) and Not Missing At Random (NMAR) conditions for SBM. We introduce several variants of the variational EM (VEM) algorithm for inferring the SBM under various sampling designs (MAR and NMAR). The sampling design must be taken into account only in the NMAR case. Model selection criteria based on Integrated Classification Likelihood (ICL) are derived for selecting both the number of blocks and the sampling design. We investigate the accuracy and the range of applicability of these algorithms with simulations. We finally explore two real-world networks from ethnology (seed circulation network) and biology (protein-protein interaction network), where the interpretations considerably depends on the sampling designs considered.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

https://hal-agroparistech.archives-ouvertes.fr/hal-01593183
Contributor : Pierre Barbillon <>
Submitted on : Monday, September 25, 2017 - 7:27:43 PM
Last modification on : Friday, July 20, 2018 - 11:13:33 AM

Links full text

Identifiers

  • HAL Id : hal-01593183, version 1
  • ARXIV : 1707.04141

Citation

Timothée Tabouy, Pierre Barbillon, Julien Chiquet. Variational Inference for Stochastic Block Models from Sampled Data. 2017. ⟨hal-01593183⟩

Share

Metrics

Record views

214