Python pgmpy库：构建与推理概率图模型详解

1星 | 下载需积分: 50 | PDF格式 | 254KB | 更新于2024-09-10 | 128 浏览量 | 举报

1 收藏

在本篇文章中，我们将深入探讨如何利用Python编程语言的强大工具——pgmpy来构建和应用概率图模型（Probabilistic Graphical Models, PGM）。PGM是一种强大的技术，它通过表示随机变量之间的依赖关系，以紧凑的方式表达联合分布，并提供比传统方法更为高效的推理过程。这种方法在诸如语音识别、信息提取、图像分割以及生物领域如基因调控网络建模等领域有着广泛的应用。 pgmpy是一个专门为在Python中处理图形模型设计的库，它允许用户创建自己的图形模型并进行有效的推理。该库支持多种推理算法，例如Variable Elimination（变量消除法）和Belief Propagation（信念传播算法），这些算法对于处理复杂概率问题具有重要作用。文章首先会提供一个简短的介绍，概述PGM的基本概念和原理，以及与之相关的其他Python库。接着，我们将具体讲解如何使用pgmpy构建两种常见的概率图模型：贝叶斯网络（Bayesian Networks）和马尔可夫网络（Markov Networks）。在贝叶斯网络部分，我们将讨论变量的节点表示、结构学习（如结构学习算法）、以及如何利用pgmpy进行条件概率查询和概率后验计算。对于马尔可夫网络，我们将会介绍无向图的特性、能量函数和如何进行推理操作。在实际操作中，读者将学到如何导入pgmpy库，创建基本的模型结构，设置节点属性，以及如何执行各种类型的推理任务。此外，本文还将分享一些案例研究，展示如何解决实际问题并利用pgmpy的性能优势。这篇文章是一份宝贵的指南，不仅适合对概率图模型感兴趣的专业人士，也适合希望在Python环境中扩展数据分析技能的开发者和研究人员。通过学习和实践，读者将能够熟练地运用pgmpy构建和分析概率图模型，从而在各自的领域中提升数据分析的效率和准确性。

6 PROC. OF THE 14th PYTHON IN SCIENCE CONF. (SCIPY 2015)

pgmpy: Probabilistic Graphical Models using Python

Ankur Ankan

∗

, Abinash Panda

https://www.youtube.com/watch?v=Vcmjqx7lht0

Abstract—Probabilistic Graphical Models (PGM) is a technique of compactly

representing a joint distribution by exploiting dependencies between the random

variables. It also allows us to do inference on joint distributions in a computation-

ally cheaper way than the traditional methods. PGMs are widely used in the ﬁeld

of speech recognition, information extraction, image segmentation, modelling

gene regulatory networks.

pgmpy [pgmpy] is a python library for working with graphical models. It allows

the user to create their own graphical models and answer inference or map

queries over them. pgmpy has implementation of many inference algorithms like

VariableElimination, Belief Propagation etc.

This paper ﬁrst gives a short introduction to PGMs and various other python

packages available for working with PGMs. Then we discuss about creating and

doing inference over Bayesian Networks and Markov Networks using pgmpy.

Index Terms—Graphical Models, Bayesian Networks, Markov Networks, Vari-

able Elimination

Introduction

Probabilistic Graphical Model (PGM) is a technique of repre-

senting Joint Distributions over random variables in a compact

way by exploiting the dependencies between them. PGMs use

a network structure to encode the relationships between the

random variables and some parameters to represent the joint

distribution.

There are two major types of Graphical Models: Bayesian

Networks and Markov Networks.

Bayesian Network: A Bayesian Network consists of a

directed graph and a conditional probability distribution asso-

ciated with each of the random variables. A Bayesian network

is used mostly when there is a causal relationship between

the random variables. An example of a Bayesian Network

representing a student [student] taking some course is shown

in Fig 1.

Markov Network: A Markov Network consists of an undi-

rected graph and a few Factors are associated with it. Unlike

Conditional Probability Distributions, a Factor does not rep-

resent the probabilities of variables in the network; instead it

represents the compatibility between random variables that is

how much a particular state of a random variable likely to

agree with the another state of some other random variable.

An example of markov [markov] network over four friends A,

B, C, D agreeing to some concept is shown in Fig 2.

* Corresponding author: [email protected]

○ 2015 Ankur Ankan et al. This is an open-access article dis-

tributed under the terms of the Creative Commons Attribution License,

which permits unrestricted use, distribution, and reproduction in any medium,

provided the original author and source are credited.

There are numerous open source packages available in

Python for working with graphical models. eBay’s bayesian-

belief-networks [bbn] mostly focuses on Bayesian Models

and has implementation of a limited number of inference

algorithms. Another package pymc [pymc] focuses mainly

on Markov Chain Monte Carlo (MCMC) method. libpgm

[libpgm] also mainly focuses on Bayesian Networks.

pgmpy tries to be a complete package for working with

graphical models and gives the user full control on designing

the model. The source code is very well documented with

proper docstrings and doctests for each method so that users

can quickly get upto speed. Furthermore, pgmpy also provides

easy extensibility allowing users to write their own inference

algorithms or elimination order algorithms without any addi-

tional effort to get familiar with the source code.

Getting Source Code and Installing

pgmpy is released under MIT Licence and is hosted on github.

We can simply clone the repository and install it:

git clone https://github.com/pgmpy/pgmpy

cd pgmpy

[sudo] python3 setup.py install

Dependencies: pgmpy runs only on python3 and is dependent

on networkx, numpy, pandas and scipy which can be installed

using pip or conda as:

pip install -r requirements.txt

or:

conda install --file requirements.txt

Creating Bayesian Models using pgmpy

A Bayesian Network consists of a directed graph where

nodes represents random variables and edges represent the the

relation between them. It is parameterized using Conditional

Probability Distributions(CPD). Each random variable in a

Bayesian Network has a CPD associated with it. If a random

varible has parents in the network then the CPD represents

P(var|Par

var

) i.e. the probability of that variable given its

parents. In the case, when the random variable has no parents

it simply represents P(var) i.e. the probability of that variable.

For example, we can take the case of student model rep-

resented in Fig 1. A possible CPD for the random variable

grade is shown in Table 1.

We can represent the CPD shown in Table 1 in pgmpy as

follows:

下载后可阅读完整内容，剩余5页未读，立即下载

Somnus612

粉丝: 1

Python pgmpy库：构建与推理概率图模型详解

概率图模型开发工具-Python

pgmpy, 概率图形模型的python 库.zip

做贝叶斯网络可视化界面程序

Python实现概率图模型：实战教程与应用

Python实战：概率图模型解析与应用

pgmpy：Python概率图模型开发与研究

用Python掌握概率图形模型：实战学习与实例代码

Daft Python软件包：使用matplotlib绘制概率图模型

sumu：Python概率因果图模型库开发进展

使用skorecard构建Python信用风险评分模型

最新资源